Richard Zhang

Research Scientist
Adobe Research
San Francisco, CA

rizhang at adobe.com
[GitHub] [Google Scholar]
[Resume/CV] [Twitter] [Bio]


My research interests are in computer vision, machine learning, deep learning, graphics, and image processing. I obtained a PhD at UC Berkeley, advised by Prof. Alexei (Alyosha) Efros. I obtained BS and MEng degrees from Cornell University in ECE. I often collaborate with academic researchers, either through internships or university collaboration.


News

[Sept 2020] Few updates regarding Antialiasing CNNs [ICML 2019], which can stabilize and improve the backbone for your application:
- Easy installation: pip install antialiased-cnns and import antialiased_cnns; model = antialiased_cnns.resnet50(pretrained= True)
- For more information, including "What is Aliasing?", see my guest lecture [15 min] in SFU CMPT 361, Intro to Vision, Sampling and Aliasing lecture.
- A nice followup work, Delving Deeper into Antialiasing in Convnets by Zou, Xiao, Yu, & Lee, won best paper at BMVC 2020. Check it out!
[Aug 2020] I gave a talk on Detecting Generated Imagery, Deep and Shallow (35 min) at the Sensing Humans workshop at ECCV.
[Aug 2020] I gave a talk on Style and Structure Disentanglement for Image Manipulation (30 min) at the Advances in Image Manipulation workshop at ECCV.
[Aug 2020] I gave a talk on Analyzing Artifacts in Discriminative and Generative Models (40 min) at the GAMES webinar.
[July 2020] Our work on using contrastive learning for unpaired translation was accepted to ECCV.
[July 2020] Our work on inverting GANs was accepted to ECCV as an oral.
[July 2020] See our new work on Swapping Autoencoders below.
[July 2020] Our work on audio perceptual metrics was accepted to Intespeech.
[Feb 2020] I served as an Area Chair for CVPR 2020 and spoke on Analyzing CNN Artifacts in Discriminative and Generative Models (11 min). The second half includes our "Detecting CNN-generated images" work, just accepted to CVPR.


Internship

If you have similar interests and are interested in collaborating during a summer 2021 internship, I would be happy to hear from you. Timing-wise, please contact me after the CVPR deadline (Nov 15), unless you have expiring offers. This is because I am still focusing on current projects and do not know headcount for next year. Tell me about your past research experience and what you would potentially like to do. The goal of an internship is a publication, usually CVPR or SIGGRAPH. Interns are typically PhD students; the number of slots is limited, so we unfortunately cannot accept everyone.


Publications

    Swapping Autoencoder for Deep Image Manipulation
Taesung Park, Jun-Yan Zhu, Oliver Wang, Jingwan Lu, Eli Shechtman, Alexei A. Efros, Richard Zhang
To appear in NeurIPS, 2020.
[Paper] [Webpage] [Video] [Bibtex]
  Contrastive Learning for Unpaired Image-to-Image Translation
Taesung Park, Alexei A. Efros, Richard Zhang, Jun-Yan Zhu
In ECCV, 2020.
[Paper] [Webpage] [GitHub] [Teaser] [Video] [Bibtex]
Transforming and Projecting Images into Class-conditional Generative Networks
Minyoung Huh, Richard Zhang, Jun-Yan Zhu, Sylvain Paris, Aaron Hertzmann
In ECCV (oral), 2020.
[Paper] [Webpage] [GitHub] [Bibtex]
A Differentiable Perceptual Audio Metric Learned from Just Noticeable Differences
Pranay Manocha, Adam Finkelstein, Richard Zhang, Nicholas J. Bryan, Gautham J. Mysore, Zeyu Jin
In Interspeech, 2020.
[Paper] [Webpage] [GitHub] [Bibtex]
CNN-generated images are surprisingly easy to spot...for now
Sheng-Yu Wang, Oliver Wang, Richard Zhang, Andrew Owens, Alexei A. Efros
In CVPR, 2020 (oral).
[Paper] [Webpage] [GitHub] [Talk] [Bibtex]
Deep Parametric Shape Predictions using Distance Fields
Dmitriy Smirnov, Matthew Fisher, Vladimir G. Kim, Richard Zhang, Justin Solomon
In CVPR, 2020.
[Paper] [Webpage] [GitHub] [Video] [Bibtex]
Image Morphing with Perceptual Constraints and STN Alignment
Noa Fish, Richard Zhang, Lilach Perry, Daniel Cohen-Or, Eli Shechtman, Connelly Barnes
In CGF, 2020.
[Paper] [GitHub] [Bibtex]
Making Convolutional Networks Shift-Invariant Again
Richard Zhang
In ICML, 2019.
[Paper] [Webpage] [GitHub] [Talk] [Slides (122mb)] [Poster] [Bibtex]
Detecting Photoshopped Faces by Scripting Photoshop
Sheng-Yu Wang, Oliver Wang, Andrew Owens, Richard Zhang, Alexei A. Efros
In ICCV, 2019.
[Paper] [Webpage] [GitHub] [Video] [Poster] [Adobe Max] [Adobe Blog] [Bibtex]
Interactive Sketch & Fill: Multiclass Sketch-to-Image Translation
Arnab Ghosh, Richard Zhang, Puneet Dokania, Oliver Wang, Alexei A. Efros, Philip H.S. Torr, Eli Shechtman In ICCV, 2019.
[Paper] [Webpage] [GitHub] [Video] [Bibtex]
The Unreasonable Effectiveness of Deep Features as a Perceptual Metric
Richard Zhang, Phillip Isola, Alexei A. Efros, Eli Shechtman, Oliver Wang
In CVPR, 2018.
[Paper] [Webpage] [GitHub] [Poster] [Adobe Blog] [Two Min Papers] [Bibtex]
Stochastic Adversarial Video Prediction
Alex X. Lee, Richard Zhang, Frederik Ebert, Pieter Abbeel, Chelsea Finn, Sergey Levine
In ArXiv, 2018.
[Paper] [Webpage] [GitHub] [Bibtex]
Toward Multimodal Image-to-Image Translation
Jun-Yan Zhu, Richard Zhang, Deepak Pathak, Trevor Darrell, Alexei A. Efros, Oliver Wang, Eli Shechtman
In NIPS, 2017.
[Paper] [Webpage] [GitHub] [Video (YouTube)(mp4)] [Poster] [Two Min Papers] [Bibtex]
Real-Time User-Guided Image Colorization with Learned Deep Priors
Richard Zhang*, Jun-Yan Zhu*, Phillip Isola, Xinyang Geng, Angela S. Lin, Tianhe Yu, Alexei A. Efros
(*indicates equal contribution)
In SIGGRAPH, 2017.
[Paper] [Webpage] [Fastforward] [Talk] [Video (YouTube)(mp4)] [PSE 2020] [GitHub] [Slides (141mb)] [Bibtex]
Split-Brain Autoencoders: Unsupervised Learning by Cross-Channel Prediction
Richard Zhang, Phillip Isola, Alexei A. Efros
In CVPR, 2017.
[Paper] [Webpage] [GitHub] [Poster] [Seminar Talk] [Bibtex]
Colorful Image Colorization
Richard Zhang, Phillip Isola, Alexei A. Efros
In ECCV, 2016 (oral).
[Paper] [Webpage] [GitHub] [Talk] [Slides (138mb)] [Poster] [Bibtex]
Sensor Fusion for Semantic Segmentation of Urban Scenes
Richard Zhang, Stefan Candra, Kai Vetter, Avideh Zakhor
In ICRA, 2015.
[Paper (pdf)(official)] [Slides] [Poster] [Talk] [Annotations (tar)(zip) ] [Bibtex]
Automatic Identification of Window Regions on Indoor Point Clouds Using LiDAR and Cameras
Richard Zhang, Avideh Zakhor
In WACV, 2014.
[Paper (pdf)(official)] [Bibtex]

Thesis

Image Synthesis for Self-Supervised Visual Representation Learning
Richard Zhang
Spring 2018.
[Thesis] [Dissertation Talk] [Fast Forward] [Slides (396 MB)] [Bibtex]

Awards

Reviewer recognitions, CVPR 2019, NeurIPS 2019, ECCV 2020
Thesis Fast Forward, Best Presentation, SIGGRAPH 2018
Adobe Research Fellowship 2017

Student collaborators/interns

I have gotten to work with some wonderful collaborators.

@Adobe
PhD [interns]
Lucy Chai, MIT
Taesung Park, UC Berkeley (Fellowship winner, 2020)
Ji Lin, MIT
William (Bill) Peebles, MIT
Alex Andonian, MIT
Steven Liu, MIT
Utkarsh Ojha, UC Davis
Arnab Ghosh, Oxford
Minyoung (Jacob) Huh, MIT
Tamar Rott Shaham, Technion (Fellowship winner, 2020)
Peiye Zhuang, UIUC
Dima Smirnov, MIT
Noa Fish, Tel Aviv

Masters [interns]
Seungjoo Yoo, Korea Univ (WIT scholarship winner, 2019)

PhD [university collaborators]
Pranay Manocha, Princeton
Rawan Alghofaili, George Mason
Alvin Wan, UC Berkeley

Undergrad [university collaborators]
Sheng-Yu Wang, UC Berkeley
@Berkeley
Undergraduates
Xin Qin, now @ USC
Hemang Jangle
Angela S. Lin, now @ UT Austin
Xinyang Geng, now @ UC Berkeley
Tianhe Yu, now @ Stanford
Stefan A. Candra

Teaching

Introduction to Artificial Intelligence (CS 188), UC Berkeley
Graduate Student Instructor (GSI) with Prof. Anca Dragan
Spring 2017

Computer Vision (CS 280), UC Berkeley
Graduate Student Instructor (GSI) with Prof. Alexei A. Efros, Prof. Trevor Darrell
Spring 2016

Introduction to Circuits (ECE 2100), Cornell University
Teaching Assistant (TA) with Prof. Alyosha Molnar
Spring 2010

My Name

Confused by the contents of this page? Well, you may have been looking for Professor Richard Zhang or Professor Richard Zhang. My Chinese name is 章睿嘉.