Picture for Xiaodan Liang

Xiaodan Liang

MixReorg: Cross-Modal Mixed Patch Reorganization is a Good Mask Learner for Open-World Semantic Segmentation

Add code
Aug 09, 2023
Figure 1 for MixReorg: Cross-Modal Mixed Patch Reorganization is a Good Mask Learner for Open-World Semantic Segmentation
Figure 2 for MixReorg: Cross-Modal Mixed Patch Reorganization is a Good Mask Learner for Open-World Semantic Segmentation
Figure 3 for MixReorg: Cross-Modal Mixed Patch Reorganization is a Good Mask Learner for Open-World Semantic Segmentation
Figure 4 for MixReorg: Cross-Modal Mixed Patch Reorganization is a Good Mask Learner for Open-World Semantic Segmentation
Viaarxiv icon

FULLER: Unified Multi-modality Multi-task 3D Perception via Multi-level Gradient Calibration

Add code
Jul 31, 2023
Figure 1 for FULLER: Unified Multi-modality Multi-task 3D Perception via Multi-level Gradient Calibration
Figure 2 for FULLER: Unified Multi-modality Multi-task 3D Perception via Multi-level Gradient Calibration
Figure 3 for FULLER: Unified Multi-modality Multi-task 3D Perception via Multi-level Gradient Calibration
Figure 4 for FULLER: Unified Multi-modality Multi-task 3D Perception via Multi-level Gradient Calibration
Viaarxiv icon

Fashion Matrix: Editing Photos by Just Talking

Add code
Jul 25, 2023
Figure 1 for Fashion Matrix: Editing Photos by Just Talking
Figure 2 for Fashion Matrix: Editing Photos by Just Talking
Figure 3 for Fashion Matrix: Editing Photos by Just Talking
Figure 4 for Fashion Matrix: Editing Photos by Just Talking
Viaarxiv icon

RM-PRT: Realistic Robotic Manipulation Simulator and Benchmark with Progressive Reasoning Tasks

Add code
Jun 21, 2023
Figure 1 for RM-PRT: Realistic Robotic Manipulation Simulator and Benchmark with Progressive Reasoning Tasks
Figure 2 for RM-PRT: Realistic Robotic Manipulation Simulator and Benchmark with Progressive Reasoning Tasks
Figure 3 for RM-PRT: Realistic Robotic Manipulation Simulator and Benchmark with Progressive Reasoning Tasks
Figure 4 for RM-PRT: Realistic Robotic Manipulation Simulator and Benchmark with Progressive Reasoning Tasks
Viaarxiv icon

MO-VLN: A Multi-Task Benchmark for Open-set Zero-Shot Vision-and-Language Navigation

Add code
Jun 17, 2023
Figure 1 for MO-VLN: A Multi-Task Benchmark for Open-set Zero-Shot Vision-and-Language Navigation
Figure 2 for MO-VLN: A Multi-Task Benchmark for Open-set Zero-Shot Vision-and-Language Navigation
Figure 3 for MO-VLN: A Multi-Task Benchmark for Open-set Zero-Shot Vision-and-Language Navigation
Figure 4 for MO-VLN: A Multi-Task Benchmark for Open-set Zero-Shot Vision-and-Language Navigation
Viaarxiv icon

UniDiff: Advancing Vision-Language Models with Generative and Discriminative Learning

Add code
Jun 01, 2023
Viaarxiv icon

Boosting Text-to-Image Diffusion Models with Fine-Grained Semantic Rewards

Add code
Jun 01, 2023
Figure 1 for Boosting Text-to-Image Diffusion Models with Fine-Grained Semantic Rewards
Figure 2 for Boosting Text-to-Image Diffusion Models with Fine-Grained Semantic Rewards
Figure 3 for Boosting Text-to-Image Diffusion Models with Fine-Grained Semantic Rewards
Figure 4 for Boosting Text-to-Image Diffusion Models with Fine-Grained Semantic Rewards
Viaarxiv icon

Boosting Visual-Language Models by Exploiting Hard Samples

Add code
May 09, 2023
Figure 1 for Boosting Visual-Language Models by Exploiting Hard Samples
Figure 2 for Boosting Visual-Language Models by Exploiting Hard Samples
Figure 3 for Boosting Visual-Language Models by Exploiting Hard Samples
Figure 4 for Boosting Visual-Language Models by Exploiting Hard Samples
Viaarxiv icon

Towards Medical Artificial General Intelligence via Knowledge-Enhanced Multimodal Pretraining

Add code
Apr 26, 2023
Figure 1 for Towards Medical Artificial General Intelligence via Knowledge-Enhanced Multimodal Pretraining
Figure 2 for Towards Medical Artificial General Intelligence via Knowledge-Enhanced Multimodal Pretraining
Figure 3 for Towards Medical Artificial General Intelligence via Knowledge-Enhanced Multimodal Pretraining
Figure 4 for Towards Medical Artificial General Intelligence via Knowledge-Enhanced Multimodal Pretraining
Viaarxiv icon

LiDAR-NeRF: Novel LiDAR View Synthesis via Neural Radiance Fields

Add code
Apr 20, 2023
Figure 1 for LiDAR-NeRF: Novel LiDAR View Synthesis via Neural Radiance Fields
Figure 2 for LiDAR-NeRF: Novel LiDAR View Synthesis via Neural Radiance Fields
Figure 3 for LiDAR-NeRF: Novel LiDAR View Synthesis via Neural Radiance Fields
Figure 4 for LiDAR-NeRF: Novel LiDAR View Synthesis via Neural Radiance Fields
Viaarxiv icon