Alert button
Picture for Xiaodan Liang

Xiaodan Liang

Alert button

DiffDis: Empowering Generative Diffusion Model with Cross-Modal Discrimination Capability

Add code
Bookmark button
Alert button
Aug 18, 2023
Runhui Huang, Jianhua Han, Guansong Lu, Xiaodan Liang, Yihan Zeng, Wei Zhang, Hang Xu

Figure 1 for DiffDis: Empowering Generative Diffusion Model with Cross-Modal Discrimination Capability
Figure 2 for DiffDis: Empowering Generative Diffusion Model with Cross-Modal Discrimination Capability
Figure 3 for DiffDis: Empowering Generative Diffusion Model with Cross-Modal Discrimination Capability
Figure 4 for DiffDis: Empowering Generative Diffusion Model with Cross-Modal Discrimination Capability
Viaarxiv icon

CTP: Towards Vision-Language Continual Pretraining via Compatible Momentum Contrast and Topology Preservation

Add code
Bookmark button
Alert button
Aug 14, 2023
Hongguang Zhu, Yunchao Wei, Xiaodan Liang, Chunjie Zhang, Yao Zhao

Figure 1 for CTP: Towards Vision-Language Continual Pretraining via Compatible Momentum Contrast and Topology Preservation
Figure 2 for CTP: Towards Vision-Language Continual Pretraining via Compatible Momentum Contrast and Topology Preservation
Figure 3 for CTP: Towards Vision-Language Continual Pretraining via Compatible Momentum Contrast and Topology Preservation
Figure 4 for CTP: Towards Vision-Language Continual Pretraining via Compatible Momentum Contrast and Topology Preservation
Viaarxiv icon

LAW-Diffusion: Complex Scene Generation by Diffusion with Layouts

Add code
Bookmark button
Alert button
Aug 13, 2023
Binbin Yang, Yi Luo, Ziliang Chen, Guangrun Wang, Xiaodan Liang, Liang Lin

Figure 1 for LAW-Diffusion: Complex Scene Generation by Diffusion with Layouts
Figure 2 for LAW-Diffusion: Complex Scene Generation by Diffusion with Layouts
Figure 3 for LAW-Diffusion: Complex Scene Generation by Diffusion with Layouts
Figure 4 for LAW-Diffusion: Complex Scene Generation by Diffusion with Layouts
Viaarxiv icon

MixReorg: Cross-Modal Mixed Patch Reorganization is a Good Mask Learner for Open-World Semantic Segmentation

Add code
Bookmark button
Alert button
Aug 09, 2023
Kaixin Cai, Pengzhen Ren, Yi Zhu, Hang Xu, Jianzhuang Liu, Changlin Li, Guangrun Wang, Xiaodan Liang

Figure 1 for MixReorg: Cross-Modal Mixed Patch Reorganization is a Good Mask Learner for Open-World Semantic Segmentation
Figure 2 for MixReorg: Cross-Modal Mixed Patch Reorganization is a Good Mask Learner for Open-World Semantic Segmentation
Figure 3 for MixReorg: Cross-Modal Mixed Patch Reorganization is a Good Mask Learner for Open-World Semantic Segmentation
Figure 4 for MixReorg: Cross-Modal Mixed Patch Reorganization is a Good Mask Learner for Open-World Semantic Segmentation
Viaarxiv icon

FULLER: Unified Multi-modality Multi-task 3D Perception via Multi-level Gradient Calibration

Add code
Bookmark button
Alert button
Jul 31, 2023
Zhijian Huang, Sihao Lin, Guiyu Liu, Mukun Luo, Chaoqiang Ye, Hang Xu, Xiaojun Chang, Xiaodan Liang

Figure 1 for FULLER: Unified Multi-modality Multi-task 3D Perception via Multi-level Gradient Calibration
Figure 2 for FULLER: Unified Multi-modality Multi-task 3D Perception via Multi-level Gradient Calibration
Figure 3 for FULLER: Unified Multi-modality Multi-task 3D Perception via Multi-level Gradient Calibration
Figure 4 for FULLER: Unified Multi-modality Multi-task 3D Perception via Multi-level Gradient Calibration
Viaarxiv icon

Fashion Matrix: Editing Photos by Just Talking

Add code
Bookmark button
Alert button
Jul 25, 2023
Zheng Chong, Xujie Zhang, Fuwei Zhao, Zhenyu Xie, Xiaodan Liang

Figure 1 for Fashion Matrix: Editing Photos by Just Talking
Figure 2 for Fashion Matrix: Editing Photos by Just Talking
Figure 3 for Fashion Matrix: Editing Photos by Just Talking
Figure 4 for Fashion Matrix: Editing Photos by Just Talking
Viaarxiv icon

RM-PRT: Realistic Robotic Manipulation Simulator and Benchmark with Progressive Reasoning Tasks

Add code
Bookmark button
Alert button
Jun 21, 2023
Pengzhen Ren, Kaidong Zhang, Hetao Zheng, Zixuan Li, Yuhang Wen, Fengda Zhu, Mas Ma, Xiaodan Liang

Figure 1 for RM-PRT: Realistic Robotic Manipulation Simulator and Benchmark with Progressive Reasoning Tasks
Figure 2 for RM-PRT: Realistic Robotic Manipulation Simulator and Benchmark with Progressive Reasoning Tasks
Figure 3 for RM-PRT: Realistic Robotic Manipulation Simulator and Benchmark with Progressive Reasoning Tasks
Figure 4 for RM-PRT: Realistic Robotic Manipulation Simulator and Benchmark with Progressive Reasoning Tasks
Viaarxiv icon

MO-VLN: A Multi-Task Benchmark for Open-set Zero-Shot Vision-and-Language Navigation

Add code
Bookmark button
Alert button
Jun 17, 2023
Xiwen Liang, Liang Ma, Shanshan Guo, Jianhua Han, Hang Xu, Shikui Ma, Xiaodan Liang

Figure 1 for MO-VLN: A Multi-Task Benchmark for Open-set Zero-Shot Vision-and-Language Navigation
Figure 2 for MO-VLN: A Multi-Task Benchmark for Open-set Zero-Shot Vision-and-Language Navigation
Figure 3 for MO-VLN: A Multi-Task Benchmark for Open-set Zero-Shot Vision-and-Language Navigation
Figure 4 for MO-VLN: A Multi-Task Benchmark for Open-set Zero-Shot Vision-and-Language Navigation
Viaarxiv icon

UniDiff: Advancing Vision-Language Models with Generative and Discriminative Learning

Add code
Bookmark button
Alert button
Jun 01, 2023
Xiao Dong, Runhui Huang, Xiaoyong Wei, Zequn Jie, Jianxing Yu, Jian Yin, Xiaodan Liang

Figure 1 for UniDiff: Advancing Vision-Language Models with Generative and Discriminative Learning
Figure 2 for UniDiff: Advancing Vision-Language Models with Generative and Discriminative Learning
Figure 3 for UniDiff: Advancing Vision-Language Models with Generative and Discriminative Learning
Figure 4 for UniDiff: Advancing Vision-Language Models with Generative and Discriminative Learning
Viaarxiv icon

Boosting Text-to-Image Diffusion Models with Fine-Grained Semantic Rewards

Add code
Bookmark button
Alert button
Jun 01, 2023
Guian Fang, Zutao Jiang, Jianhua Han, Guansong Lu, Hang Xu, Xiaodan Liang

Figure 1 for Boosting Text-to-Image Diffusion Models with Fine-Grained Semantic Rewards
Figure 2 for Boosting Text-to-Image Diffusion Models with Fine-Grained Semantic Rewards
Figure 3 for Boosting Text-to-Image Diffusion Models with Fine-Grained Semantic Rewards
Figure 4 for Boosting Text-to-Image Diffusion Models with Fine-Grained Semantic Rewards
Viaarxiv icon