Picture for Xiaodan Liang

Xiaodan Liang

Contrastive Instruction-Trajectory Learning for Vision-Language Navigation

Add code
Dec 09, 2021
Figure 1 for Contrastive Instruction-Trajectory Learning for Vision-Language Navigation
Figure 2 for Contrastive Instruction-Trajectory Learning for Vision-Language Navigation
Figure 3 for Contrastive Instruction-Trajectory Learning for Vision-Language Navigation
Figure 4 for Contrastive Instruction-Trajectory Learning for Vision-Language Navigation
Viaarxiv icon

Towards Scalable Unpaired Virtual Try-On via Patch-Routed Spatially-Adaptive GAN

Add code
Nov 20, 2021
Figure 1 for Towards Scalable Unpaired Virtual Try-On via Patch-Routed Spatially-Adaptive GAN
Figure 2 for Towards Scalable Unpaired Virtual Try-On via Patch-Routed Spatially-Adaptive GAN
Figure 3 for Towards Scalable Unpaired Virtual Try-On via Patch-Routed Spatially-Adaptive GAN
Figure 4 for Towards Scalable Unpaired Virtual Try-On via Patch-Routed Spatially-Adaptive GAN
Viaarxiv icon

FILIP: Fine-grained Interactive Language-Image Pre-Training

Add code
Nov 09, 2021
Figure 1 for FILIP: Fine-grained Interactive Language-Image Pre-Training
Figure 2 for FILIP: Fine-grained Interactive Language-Image Pre-Training
Figure 3 for FILIP: Fine-grained Interactive Language-Image Pre-Training
Figure 4 for FILIP: Fine-grained Interactive Language-Image Pre-Training
Viaarxiv icon

IconQA: A New Benchmark for Abstract Diagram Understanding and Visual Language Reasoning

Add code
Nov 07, 2021
Figure 1 for IconQA: A New Benchmark for Abstract Diagram Understanding and Visual Language Reasoning
Figure 2 for IconQA: A New Benchmark for Abstract Diagram Understanding and Visual Language Reasoning
Figure 3 for IconQA: A New Benchmark for Abstract Diagram Understanding and Visual Language Reasoning
Figure 4 for IconQA: A New Benchmark for Abstract Diagram Understanding and Visual Language Reasoning
Viaarxiv icon

UltraPose: Synthesizing Dense Pose with 1 Billion Points by Human-body Decoupling 3D Model

Add code
Oct 28, 2021
Figure 1 for UltraPose: Synthesizing Dense Pose with 1 Billion Points by Human-body Decoupling 3D Model
Figure 2 for UltraPose: Synthesizing Dense Pose with 1 Billion Points by Human-body Decoupling 3D Model
Figure 3 for UltraPose: Synthesizing Dense Pose with 1 Billion Points by Human-body Decoupling 3D Model
Figure 4 for UltraPose: Synthesizing Dense Pose with 1 Billion Points by Human-body Decoupling 3D Model
Viaarxiv icon

Image Comes Dancing with Collaborative Parsing-Flow Video Synthesis

Add code
Oct 28, 2021
Figure 1 for Image Comes Dancing with Collaborative Parsing-Flow Video Synthesis
Figure 2 for Image Comes Dancing with Collaborative Parsing-Flow Video Synthesis
Figure 3 for Image Comes Dancing with Collaborative Parsing-Flow Video Synthesis
Figure 4 for Image Comes Dancing with Collaborative Parsing-Flow Video Synthesis
Viaarxiv icon

Wav-BERT: Cooperative Acoustic and Linguistic Representation Learning for Low-Resource Speech Recognition

Add code
Oct 09, 2021
Figure 1 for Wav-BERT: Cooperative Acoustic and Linguistic Representation Learning for Low-Resource Speech Recognition
Figure 2 for Wav-BERT: Cooperative Acoustic and Linguistic Representation Learning for Low-Resource Speech Recognition
Figure 3 for Wav-BERT: Cooperative Acoustic and Linguistic Representation Learning for Low-Resource Speech Recognition
Figure 4 for Wav-BERT: Cooperative Acoustic and Linguistic Representation Learning for Low-Resource Speech Recognition
Viaarxiv icon

DS-Net++: Dynamic Weight Slicing for Efficient Inference in CNNs and Transformers

Add code
Sep 21, 2021
Figure 1 for DS-Net++: Dynamic Weight Slicing for Efficient Inference in CNNs and Transformers
Figure 2 for DS-Net++: Dynamic Weight Slicing for Efficient Inference in CNNs and Transformers
Figure 3 for DS-Net++: Dynamic Weight Slicing for Efficient Inference in CNNs and Transformers
Figure 4 for DS-Net++: Dynamic Weight Slicing for Efficient Inference in CNNs and Transformers
Viaarxiv icon

EfficientBERT: Progressively Searching Multilayer Perceptron via Warm-up Knowledge Distillation

Add code
Sep 16, 2021
Figure 1 for EfficientBERT: Progressively Searching Multilayer Perceptron via Warm-up Knowledge Distillation
Figure 2 for EfficientBERT: Progressively Searching Multilayer Perceptron via Warm-up Knowledge Distillation
Figure 3 for EfficientBERT: Progressively Searching Multilayer Perceptron via Warm-up Knowledge Distillation
Figure 4 for EfficientBERT: Progressively Searching Multilayer Perceptron via Warm-up Knowledge Distillation
Viaarxiv icon

Voxel Transformer for 3D Object Detection

Add code
Sep 13, 2021
Figure 1 for Voxel Transformer for 3D Object Detection
Figure 2 for Voxel Transformer for 3D Object Detection
Figure 3 for Voxel Transformer for 3D Object Detection
Figure 4 for Voxel Transformer for 3D Object Detection
Viaarxiv icon