Picture for Dongdong Chen

Dongdong Chen

Vector Quantized Diffusion Model for Text-to-Image Synthesis

Add code
Dec 20, 2021
Figure 1 for Vector Quantized Diffusion Model for Text-to-Image Synthesis
Figure 2 for Vector Quantized Diffusion Model for Text-to-Image Synthesis
Figure 3 for Vector Quantized Diffusion Model for Text-to-Image Synthesis
Figure 4 for Vector Quantized Diffusion Model for Text-to-Image Synthesis
Viaarxiv icon

3D Question Answering

Add code
Dec 15, 2021
Figure 1 for 3D Question Answering
Figure 2 for 3D Question Answering
Figure 3 for 3D Question Answering
Figure 4 for 3D Question Answering
Viaarxiv icon

HairCLIP: Design Your Hair by Text and Reference Image

Add code
Dec 09, 2021
Figure 1 for HairCLIP: Design Your Hair by Text and Reference Image
Figure 2 for HairCLIP: Design Your Hair by Text and Reference Image
Figure 3 for HairCLIP: Design Your Hair by Text and Reference Image
Figure 4 for HairCLIP: Design Your Hair by Text and Reference Image
Viaarxiv icon

CLIP-NeRF: Text-and-Image Driven Manipulation of Neural Radiance Fields

Add code
Dec 09, 2021
Figure 1 for CLIP-NeRF: Text-and-Image Driven Manipulation of Neural Radiance Fields
Figure 2 for CLIP-NeRF: Text-and-Image Driven Manipulation of Neural Radiance Fields
Figure 3 for CLIP-NeRF: Text-and-Image Driven Manipulation of Neural Radiance Fields
Figure 4 for CLIP-NeRF: Text-and-Image Driven Manipulation of Neural Radiance Fields
Viaarxiv icon

General Facial Representation Learning in a Visual-Linguistic Manner

Add code
Dec 06, 2021
Figure 1 for General Facial Representation Learning in a Visual-Linguistic Manner
Figure 2 for General Facial Representation Learning in a Visual-Linguistic Manner
Figure 3 for General Facial Representation Learning in a Visual-Linguistic Manner
Figure 4 for General Facial Representation Learning in a Visual-Linguistic Manner
Viaarxiv icon

BEVT: BERT Pretraining of Video Transformers

Add code
Dec 02, 2021
Figure 1 for BEVT: BERT Pretraining of Video Transformers
Figure 2 for BEVT: BERT Pretraining of Video Transformers
Figure 3 for BEVT: BERT Pretraining of Video Transformers
Figure 4 for BEVT: BERT Pretraining of Video Transformers
Viaarxiv icon

Robust Equivariant Imaging: a fully unsupervised framework for learning to image from noisy and partial measurements

Add code
Nov 25, 2021
Figure 1 for Robust Equivariant Imaging: a fully unsupervised framework for learning to image from noisy and partial measurements
Figure 2 for Robust Equivariant Imaging: a fully unsupervised framework for learning to image from noisy and partial measurements
Figure 3 for Robust Equivariant Imaging: a fully unsupervised framework for learning to image from noisy and partial measurements
Figure 4 for Robust Equivariant Imaging: a fully unsupervised framework for learning to image from noisy and partial measurements
Viaarxiv icon

PeCo: Perceptual Codebook for BERT Pre-training of Vision Transformers

Add code
Nov 24, 2021
Figure 1 for PeCo: Perceptual Codebook for BERT Pre-training of Vision Transformers
Figure 2 for PeCo: Perceptual Codebook for BERT Pre-training of Vision Transformers
Figure 3 for PeCo: Perceptual Codebook for BERT Pre-training of Vision Transformers
Figure 4 for PeCo: Perceptual Codebook for BERT Pre-training of Vision Transformers
Viaarxiv icon

Florence: A New Foundation Model for Computer Vision

Add code
Nov 22, 2021
Figure 1 for Florence: A New Foundation Model for Computer Vision
Figure 2 for Florence: A New Foundation Model for Computer Vision
Figure 3 for Florence: A New Foundation Model for Computer Vision
Figure 4 for Florence: A New Foundation Model for Computer Vision
Viaarxiv icon

Unsupervised Finetuning

Add code
Oct 18, 2021
Figure 1 for Unsupervised Finetuning
Figure 2 for Unsupervised Finetuning
Figure 3 for Unsupervised Finetuning
Figure 4 for Unsupervised Finetuning
Viaarxiv icon