Alert button
Picture for Hang Xu

Hang Xu

Alert button

DiffDis: Empowering Generative Diffusion Model with Cross-Modal Discrimination Capability

Add code
Bookmark button
Alert button
Aug 18, 2023
Runhui Huang, Jianhua Han, Guansong Lu, Xiaodan Liang, Yihan Zeng, Wei Zhang, Hang Xu

Figure 1 for DiffDis: Empowering Generative Diffusion Model with Cross-Modal Discrimination Capability
Figure 2 for DiffDis: Empowering Generative Diffusion Model with Cross-Modal Discrimination Capability
Figure 3 for DiffDis: Empowering Generative Diffusion Model with Cross-Modal Discrimination Capability
Figure 4 for DiffDis: Empowering Generative Diffusion Model with Cross-Modal Discrimination Capability
Viaarxiv icon

MixReorg: Cross-Modal Mixed Patch Reorganization is a Good Mask Learner for Open-World Semantic Segmentation

Add code
Bookmark button
Alert button
Aug 09, 2023
Kaixin Cai, Pengzhen Ren, Yi Zhu, Hang Xu, Jianzhuang Liu, Changlin Li, Guangrun Wang, Xiaodan Liang

Figure 1 for MixReorg: Cross-Modal Mixed Patch Reorganization is a Good Mask Learner for Open-World Semantic Segmentation
Figure 2 for MixReorg: Cross-Modal Mixed Patch Reorganization is a Good Mask Learner for Open-World Semantic Segmentation
Figure 3 for MixReorg: Cross-Modal Mixed Patch Reorganization is a Good Mask Learner for Open-World Semantic Segmentation
Figure 4 for MixReorg: Cross-Modal Mixed Patch Reorganization is a Good Mask Learner for Open-World Semantic Segmentation
Viaarxiv icon

PARTNER: Level up the Polar Representation for LiDAR 3D Object Detection

Add code
Bookmark button
Alert button
Aug 08, 2023
Ming Nie, Yujing Xue, Chunwei Wang, Chaoqiang Ye, Hang Xu, Xinge Zhu, Qingqiu Huang, Michael Bi Mi, Xinchao Wang, Li Zhang

Figure 1 for PARTNER: Level up the Polar Representation for LiDAR 3D Object Detection
Figure 2 for PARTNER: Level up the Polar Representation for LiDAR 3D Object Detection
Figure 3 for PARTNER: Level up the Polar Representation for LiDAR 3D Object Detection
Figure 4 for PARTNER: Level up the Polar Representation for LiDAR 3D Object Detection
Viaarxiv icon

FULLER: Unified Multi-modality Multi-task 3D Perception via Multi-level Gradient Calibration

Add code
Bookmark button
Alert button
Jul 31, 2023
Zhijian Huang, Sihao Lin, Guiyu Liu, Mukun Luo, Chaoqiang Ye, Hang Xu, Xiaojun Chang, Xiaodan Liang

Figure 1 for FULLER: Unified Multi-modality Multi-task 3D Perception via Multi-level Gradient Calibration
Figure 2 for FULLER: Unified Multi-modality Multi-task 3D Perception via Multi-level Gradient Calibration
Figure 3 for FULLER: Unified Multi-modality Multi-task 3D Perception via Multi-level Gradient Calibration
Figure 4 for FULLER: Unified Multi-modality Multi-task 3D Perception via Multi-level Gradient Calibration
Viaarxiv icon

SUIT: Learning Significance-guided Information for 3D Temporal Detection

Add code
Bookmark button
Alert button
Jul 04, 2023
Zheyuan Zhou, Jiachen Lu, Yihan Zeng, Hang Xu, Li Zhang

Figure 1 for SUIT: Learning Significance-guided Information for 3D Temporal Detection
Figure 2 for SUIT: Learning Significance-guided Information for 3D Temporal Detection
Figure 3 for SUIT: Learning Significance-guided Information for 3D Temporal Detection
Figure 4 for SUIT: Learning Significance-guided Information for 3D Temporal Detection
Viaarxiv icon

MO-VLN: A Multi-Task Benchmark for Open-set Zero-Shot Vision-and-Language Navigation

Add code
Bookmark button
Alert button
Jun 17, 2023
Xiwen Liang, Liang Ma, Shanshan Guo, Jianhua Han, Hang Xu, Shikui Ma, Xiaodan Liang

Figure 1 for MO-VLN: A Multi-Task Benchmark for Open-set Zero-Shot Vision-and-Language Navigation
Figure 2 for MO-VLN: A Multi-Task Benchmark for Open-set Zero-Shot Vision-and-Language Navigation
Figure 3 for MO-VLN: A Multi-Task Benchmark for Open-set Zero-Shot Vision-and-Language Navigation
Figure 4 for MO-VLN: A Multi-Task Benchmark for Open-set Zero-Shot Vision-and-Language Navigation
Viaarxiv icon

Boosting Text-to-Image Diffusion Models with Fine-Grained Semantic Rewards

Add code
Bookmark button
Alert button
Jun 01, 2023
Guian Fang, Zutao Jiang, Jianhua Han, Guansong Lu, Hang Xu, Xiaodan Liang

Figure 1 for Boosting Text-to-Image Diffusion Models with Fine-Grained Semantic Rewards
Figure 2 for Boosting Text-to-Image Diffusion Models with Fine-Grained Semantic Rewards
Figure 3 for Boosting Text-to-Image Diffusion Models with Fine-Grained Semantic Rewards
Figure 4 for Boosting Text-to-Image Diffusion Models with Fine-Grained Semantic Rewards
Viaarxiv icon

DetGPT: Detect What You Need via Reasoning

Add code
Bookmark button
Alert button
May 24, 2023
Renjie Pi, Jiahui Gao, Shizhe Diao, Rui Pan, Hanze Dong, Jipeng Zhang, Lewei Yao, Jianhua Han, Hang Xu, Lingpeng Kong, Tong Zhang

Figure 1 for DetGPT: Detect What You Need via Reasoning
Figure 2 for DetGPT: Detect What You Need via Reasoning
Figure 3 for DetGPT: Detect What You Need via Reasoning
Figure 4 for DetGPT: Detect What You Need via Reasoning
Viaarxiv icon

Rethinking Boundary Discontinuity Problem for Oriented Object Detection

Add code
Bookmark button
Alert button
May 17, 2023
Hang Xu, Xinyuan Liu, Haonan Xu, Yike Ma, Zunjie Zhu, Chenggang Yan, Feng Dai

Figure 1 for Rethinking Boundary Discontinuity Problem for Oriented Object Detection
Figure 2 for Rethinking Boundary Discontinuity Problem for Oriented Object Detection
Figure 3 for Rethinking Boundary Discontinuity Problem for Oriented Object Detection
Figure 4 for Rethinking Boundary Discontinuity Problem for Oriented Object Detection
Viaarxiv icon

Boosting Visual-Language Models by Exploiting Hard Samples

Add code
Bookmark button
Alert button
May 09, 2023
Haonan Wang, Minbin Huang, Runhui Huang, Lanqing Hong, Hang Xu, Tianyang Hu, Xiaodan Liang, Zhenguo Li

Figure 1 for Boosting Visual-Language Models by Exploiting Hard Samples
Figure 2 for Boosting Visual-Language Models by Exploiting Hard Samples
Figure 3 for Boosting Visual-Language Models by Exploiting Hard Samples
Figure 4 for Boosting Visual-Language Models by Exploiting Hard Samples
Viaarxiv icon