Picture for Xiaodan Liang

Xiaodan Liang

DetCLIP: Dictionary-Enriched Visual-Concept Paralleled Pre-training for Open-world Detection

Add code
Sep 20, 2022
Figure 1 for DetCLIP: Dictionary-Enriched Visual-Concept Paralleled Pre-training for Open-world Detection
Figure 2 for DetCLIP: Dictionary-Enriched Visual-Concept Paralleled Pre-training for Open-world Detection
Figure 3 for DetCLIP: Dictionary-Enriched Visual-Concept Paralleled Pre-training for Open-world Detection
Figure 4 for DetCLIP: Dictionary-Enriched Visual-Concept Paralleled Pre-training for Open-world Detection
Viaarxiv icon

Effective Adaptation in Multi-Task Co-Training for Unified Autonomous Driving

Add code
Sep 19, 2022
Figure 1 for Effective Adaptation in Multi-Task Co-Training for Unified Autonomous Driving
Figure 2 for Effective Adaptation in Multi-Task Co-Training for Unified Autonomous Driving
Figure 3 for Effective Adaptation in Multi-Task Co-Training for Unified Autonomous Driving
Figure 4 for Effective Adaptation in Multi-Task Co-Training for Unified Autonomous Driving
Viaarxiv icon

ARMANI: Part-level Garment-Text Alignment for Unified Cross-Modal Fashion Design

Add code
Aug 11, 2022
Figure 1 for ARMANI: Part-level Garment-Text Alignment for Unified Cross-Modal Fashion Design
Figure 2 for ARMANI: Part-level Garment-Text Alignment for Unified Cross-Modal Fashion Design
Figure 3 for ARMANI: Part-level Garment-Text Alignment for Unified Cross-Modal Fashion Design
Figure 4 for ARMANI: Part-level Garment-Text Alignment for Unified Cross-Modal Fashion Design
Viaarxiv icon

Composable Text Control Operations in Latent Space with Ordinary Differential Equations

Add code
Aug 01, 2022
Figure 1 for Composable Text Control Operations in Latent Space with Ordinary Differential Equations
Figure 2 for Composable Text Control Operations in Latent Space with Ordinary Differential Equations
Figure 3 for Composable Text Control Operations in Latent Space with Ordinary Differential Equations
Figure 4 for Composable Text Control Operations in Latent Space with Ordinary Differential Equations
Viaarxiv icon

PASTA-GAN++: A Versatile Framework for High-Resolution Unpaired Virtual Try-on

Add code
Jul 27, 2022
Figure 1 for PASTA-GAN++: A Versatile Framework for High-Resolution Unpaired Virtual Try-on
Figure 2 for PASTA-GAN++: A Versatile Framework for High-Resolution Unpaired Virtual Try-on
Figure 3 for PASTA-GAN++: A Versatile Framework for High-Resolution Unpaired Virtual Try-on
Figure 4 for PASTA-GAN++: A Versatile Framework for High-Resolution Unpaired Virtual Try-on
Viaarxiv icon

SiRi: A Simple Selective Retraining Mechanism for Transformer-based Visual Grounding

Add code
Jul 27, 2022
Figure 1 for SiRi: A Simple Selective Retraining Mechanism for Transformer-based Visual Grounding
Figure 2 for SiRi: A Simple Selective Retraining Mechanism for Transformer-based Visual Grounding
Figure 3 for SiRi: A Simple Selective Retraining Mechanism for Transformer-based Visual Grounding
Figure 4 for SiRi: A Simple Selective Retraining Mechanism for Transformer-based Visual Grounding
Viaarxiv icon

Open-world Semantic Segmentation via Contrasting and Clustering Vision-Language Embedding

Add code
Jul 19, 2022
Figure 1 for Open-world Semantic Segmentation via Contrasting and Clustering Vision-Language Embedding
Figure 2 for Open-world Semantic Segmentation via Contrasting and Clustering Vision-Language Embedding
Figure 3 for Open-world Semantic Segmentation via Contrasting and Clustering Vision-Language Embedding
Figure 4 for Open-world Semantic Segmentation via Contrasting and Clustering Vision-Language Embedding
Viaarxiv icon

Discourse-Aware Graph Networks for Textual Logical Reasoning

Add code
Jul 04, 2022
Figure 1 for Discourse-Aware Graph Networks for Textual Logical Reasoning
Figure 2 for Discourse-Aware Graph Networks for Textual Logical Reasoning
Figure 3 for Discourse-Aware Graph Networks for Textual Logical Reasoning
Figure 4 for Discourse-Aware Graph Networks for Textual Logical Reasoning
Viaarxiv icon

Entity-Graph Enhanced Cross-Modal Pretraining for Instance-level Product Retrieval

Add code
Jun 17, 2022
Figure 1 for Entity-Graph Enhanced Cross-Modal Pretraining for Instance-level Product Retrieval
Figure 2 for Entity-Graph Enhanced Cross-Modal Pretraining for Instance-level Product Retrieval
Figure 3 for Entity-Graph Enhanced Cross-Modal Pretraining for Instance-level Product Retrieval
Figure 4 for Entity-Graph Enhanced Cross-Modal Pretraining for Instance-level Product Retrieval
Viaarxiv icon

Cross-modal Clinical Graph Transformer for Ophthalmic Report Generation

Add code
Jun 04, 2022
Figure 1 for Cross-modal Clinical Graph Transformer for Ophthalmic Report Generation
Figure 2 for Cross-modal Clinical Graph Transformer for Ophthalmic Report Generation
Figure 3 for Cross-modal Clinical Graph Transformer for Ophthalmic Report Generation
Figure 4 for Cross-modal Clinical Graph Transformer for Ophthalmic Report Generation
Viaarxiv icon