Alert button
Picture for Yongming Rao

Yongming Rao

Alert button

Chain-of-Spot: Interactive Reasoning Improves Large Vision-Language Models

Mar 21, 2024
Zuyan Liu, Yuhao Dong, Yongming Rao, Jie Zhou, Jiwen Lu

Viaarxiv icon

Generative Multimodal Models are In-Context Learners

Dec 20, 2023
Quan Sun, Yufeng Cui, Xiaosong Zhang, Fan Zhang, Qiying Yu, Zhengxiong Luo, Yueze Wang, Yongming Rao, Jingjing Liu, Tiejun Huang, Xinlong Wang

Viaarxiv icon

Sherpa3D: Boosting High-Fidelity Text-to-3D Generation via Coarse 3D Prior

Dec 11, 2023
Fangfu Liu, Diankun Wu, Yi Wei, Yongming Rao, Yueqi Duan

Viaarxiv icon

TCOVIS: Temporally Consistent Online Video Instance Segmentation

Sep 21, 2023
Junlong Li, Bingyao Yu, Yongming Rao, Jie Zhou, Jiwen Lu

Figure 1 for TCOVIS: Temporally Consistent Online Video Instance Segmentation
Figure 2 for TCOVIS: Temporally Consistent Online Video Instance Segmentation
Figure 3 for TCOVIS: Temporally Consistent Online Video Instance Segmentation
Figure 4 for TCOVIS: Temporally Consistent Online Video Instance Segmentation
Viaarxiv icon

Take-A-Photo: 3D-to-2D Generative Pre-training of Point Cloud Models

Jul 27, 2023
Ziyi Wang, Xumin Yu, Yongming Rao, Jie Zhou, Jiwen Lu

Figure 1 for Take-A-Photo: 3D-to-2D Generative Pre-training of Point Cloud Models
Figure 2 for Take-A-Photo: 3D-to-2D Generative Pre-training of Point Cloud Models
Figure 3 for Take-A-Photo: 3D-to-2D Generative Pre-training of Point Cloud Models
Figure 4 for Take-A-Photo: 3D-to-2D Generative Pre-training of Point Cloud Models
Viaarxiv icon

Unleashing Text-to-Image Diffusion Models for Visual Perception

Mar 03, 2023
Wenliang Zhao, Yongming Rao, Zuyan Liu, Benlin Liu, Jie Zhou, Jiwen Lu

Figure 1 for Unleashing Text-to-Image Diffusion Models for Visual Perception
Figure 2 for Unleashing Text-to-Image Diffusion Models for Visual Perception
Figure 3 for Unleashing Text-to-Image Diffusion Models for Visual Perception
Figure 4 for Unleashing Text-to-Image Diffusion Models for Visual Perception
Viaarxiv icon

UniPC: A Unified Predictor-Corrector Framework for Fast Sampling of Diffusion Models

Feb 12, 2023
Wenliang Zhao, Lujia Bai, Yongming Rao, Jie Zhou, Jiwen Lu

Figure 1 for UniPC: A Unified Predictor-Corrector Framework for Fast Sampling of Diffusion Models
Figure 2 for UniPC: A Unified Predictor-Corrector Framework for Fast Sampling of Diffusion Models
Figure 3 for UniPC: A Unified Predictor-Corrector Framework for Fast Sampling of Diffusion Models
Figure 4 for UniPC: A Unified Predictor-Corrector Framework for Fast Sampling of Diffusion Models
Viaarxiv icon

AdaPoinTr: Diverse Point Cloud Completion with Adaptive Geometry-Aware Transformers

Jan 11, 2023
Xumin Yu, Yongming Rao, Ziyi Wang, Jiwen Lu, Jie Zhou

Figure 1 for AdaPoinTr: Diverse Point Cloud Completion with Adaptive Geometry-Aware Transformers
Figure 2 for AdaPoinTr: Diverse Point Cloud Completion with Adaptive Geometry-Aware Transformers
Figure 3 for AdaPoinTr: Diverse Point Cloud Completion with Adaptive Geometry-Aware Transformers
Figure 4 for AdaPoinTr: Diverse Point Cloud Completion with Adaptive Geometry-Aware Transformers
Viaarxiv icon

FLAG3D: A 3D Fitness Activity Dataset with Language Instruction

Dec 09, 2022
Yansong Tang, Jinpeng Liu, Aoyang Liu, Bin Yang, Wenxun Dai, Yongming Rao, Jiwen Lu, Jie Zhou, Xiu Li

Figure 1 for FLAG3D: A 3D Fitness Activity Dataset with Language Instruction
Figure 2 for FLAG3D: A 3D Fitness Activity Dataset with Language Instruction
Figure 3 for FLAG3D: A 3D Fitness Activity Dataset with Language Instruction
Figure 4 for FLAG3D: A 3D Fitness Activity Dataset with Language Instruction
Viaarxiv icon