Picture for Xiaodan Liang

Xiaodan Liang

Boosting Text-to-Image Diffusion Models with Fine-Grained Semantic Rewards

Add code
Jun 01, 2023
Figure 1 for Boosting Text-to-Image Diffusion Models with Fine-Grained Semantic Rewards
Figure 2 for Boosting Text-to-Image Diffusion Models with Fine-Grained Semantic Rewards
Figure 3 for Boosting Text-to-Image Diffusion Models with Fine-Grained Semantic Rewards
Figure 4 for Boosting Text-to-Image Diffusion Models with Fine-Grained Semantic Rewards
Viaarxiv icon

Boosting Visual-Language Models by Exploiting Hard Samples

Add code
May 09, 2023
Viaarxiv icon

Towards Medical Artificial General Intelligence via Knowledge-Enhanced Multimodal Pretraining

Add code
Apr 26, 2023
Figure 1 for Towards Medical Artificial General Intelligence via Knowledge-Enhanced Multimodal Pretraining
Figure 2 for Towards Medical Artificial General Intelligence via Knowledge-Enhanced Multimodal Pretraining
Figure 3 for Towards Medical Artificial General Intelligence via Knowledge-Enhanced Multimodal Pretraining
Figure 4 for Towards Medical Artificial General Intelligence via Knowledge-Enhanced Multimodal Pretraining
Viaarxiv icon

LiDAR-NeRF: Novel LiDAR View Synthesis via Neural Radiance Fields

Add code
Apr 20, 2023
Viaarxiv icon

DetCLIPv2: Scalable Open-Vocabulary Object Detection Pre-training via Word-Region Alignment

Add code
Apr 10, 2023
Figure 1 for DetCLIPv2: Scalable Open-Vocabulary Object Detection Pre-training via Word-Region Alignment
Figure 2 for DetCLIPv2: Scalable Open-Vocabulary Object Detection Pre-training via Word-Region Alignment
Figure 3 for DetCLIPv2: Scalable Open-Vocabulary Object Detection Pre-training via Word-Region Alignment
Figure 4 for DetCLIPv2: Scalable Open-Vocabulary Object Detection Pre-training via Word-Region Alignment
Viaarxiv icon

CLIP$^2$: Contrastive Language-Image-Point Pretraining from Real-World Point Cloud Data

Add code
Mar 26, 2023
Figure 1 for CLIP$^2$: Contrastive Language-Image-Point Pretraining from Real-World Point Cloud Data
Figure 2 for CLIP$^2$: Contrastive Language-Image-Point Pretraining from Real-World Point Cloud Data
Figure 3 for CLIP$^2$: Contrastive Language-Image-Point Pretraining from Real-World Point Cloud Data
Figure 4 for CLIP$^2$: Contrastive Language-Image-Point Pretraining from Real-World Point Cloud Data
Viaarxiv icon

GP-VTON: Towards General Purpose Virtual Try-on via Collaborative Local-Flow Global-Parsing Learning

Add code
Mar 24, 2023
Viaarxiv icon

Dynamic Graph Enhanced Contrastive Learning for Chest X-ray Report Generation

Add code
Mar 18, 2023
Figure 1 for Dynamic Graph Enhanced Contrastive Learning for Chest X-ray Report Generation
Figure 2 for Dynamic Graph Enhanced Contrastive Learning for Chest X-ray Report Generation
Figure 3 for Dynamic Graph Enhanced Contrastive Learning for Chest X-ray Report Generation
Figure 4 for Dynamic Graph Enhanced Contrastive Learning for Chest X-ray Report Generation
Viaarxiv icon

CapDet: Unifying Dense Captioning and Open-World Detection Pretraining

Add code
Mar 15, 2023
Figure 1 for CapDet: Unifying Dense Captioning and Open-World Detection Pretraining
Figure 2 for CapDet: Unifying Dense Captioning and Open-World Detection Pretraining
Figure 3 for CapDet: Unifying Dense Captioning and Open-World Detection Pretraining
Figure 4 for CapDet: Unifying Dense Captioning and Open-World Detection Pretraining
Viaarxiv icon

Visual Exemplar Driven Task-Prompting for Unified Perception in Autonomous Driving

Add code
Mar 03, 2023
Viaarxiv icon