Alert button
Picture for Yongming Rao

Yongming Rao

Alert button

Chain-of-Spot: Interactive Reasoning Improves Large Vision-Language Models

Add code
Bookmark button
Alert button
Mar 21, 2024
Zuyan Liu, Yuhao Dong, Yongming Rao, Jie Zhou, Jiwen Lu

Figure 1 for Chain-of-Spot: Interactive Reasoning Improves Large Vision-Language Models
Figure 2 for Chain-of-Spot: Interactive Reasoning Improves Large Vision-Language Models
Figure 3 for Chain-of-Spot: Interactive Reasoning Improves Large Vision-Language Models
Figure 4 for Chain-of-Spot: Interactive Reasoning Improves Large Vision-Language Models
Viaarxiv icon

Generative Multimodal Models are In-Context Learners

Add code
Bookmark button
Alert button
Dec 20, 2023
Quan Sun, Yufeng Cui, Xiaosong Zhang, Fan Zhang, Qiying Yu, Zhengxiong Luo, Yueze Wang, Yongming Rao, Jingjing Liu, Tiejun Huang, Xinlong Wang

Viaarxiv icon

Sherpa3D: Boosting High-Fidelity Text-to-3D Generation via Coarse 3D Prior

Add code
Bookmark button
Alert button
Dec 11, 2023
Fangfu Liu, Diankun Wu, Yi Wei, Yongming Rao, Yueqi Duan

Viaarxiv icon

TCOVIS: Temporally Consistent Online Video Instance Segmentation

Add code
Bookmark button
Alert button
Sep 21, 2023
Junlong Li, Bingyao Yu, Yongming Rao, Jie Zhou, Jiwen Lu

Figure 1 for TCOVIS: Temporally Consistent Online Video Instance Segmentation
Figure 2 for TCOVIS: Temporally Consistent Online Video Instance Segmentation
Figure 3 for TCOVIS: Temporally Consistent Online Video Instance Segmentation
Figure 4 for TCOVIS: Temporally Consistent Online Video Instance Segmentation
Viaarxiv icon

Take-A-Photo: 3D-to-2D Generative Pre-training of Point Cloud Models

Add code
Bookmark button
Alert button
Jul 27, 2023
Ziyi Wang, Xumin Yu, Yongming Rao, Jie Zhou, Jiwen Lu

Figure 1 for Take-A-Photo: 3D-to-2D Generative Pre-training of Point Cloud Models
Figure 2 for Take-A-Photo: 3D-to-2D Generative Pre-training of Point Cloud Models
Figure 3 for Take-A-Photo: 3D-to-2D Generative Pre-training of Point Cloud Models
Figure 4 for Take-A-Photo: 3D-to-2D Generative Pre-training of Point Cloud Models
Viaarxiv icon

Unleashing Text-to-Image Diffusion Models for Visual Perception

Add code
Bookmark button
Alert button
Mar 03, 2023
Wenliang Zhao, Yongming Rao, Zuyan Liu, Benlin Liu, Jie Zhou, Jiwen Lu

Figure 1 for Unleashing Text-to-Image Diffusion Models for Visual Perception
Figure 2 for Unleashing Text-to-Image Diffusion Models for Visual Perception
Figure 3 for Unleashing Text-to-Image Diffusion Models for Visual Perception
Figure 4 for Unleashing Text-to-Image Diffusion Models for Visual Perception
Viaarxiv icon

UniPC: A Unified Predictor-Corrector Framework for Fast Sampling of Diffusion Models

Add code
Bookmark button
Alert button
Feb 12, 2023
Wenliang Zhao, Lujia Bai, Yongming Rao, Jie Zhou, Jiwen Lu

Figure 1 for UniPC: A Unified Predictor-Corrector Framework for Fast Sampling of Diffusion Models
Figure 2 for UniPC: A Unified Predictor-Corrector Framework for Fast Sampling of Diffusion Models
Figure 3 for UniPC: A Unified Predictor-Corrector Framework for Fast Sampling of Diffusion Models
Figure 4 for UniPC: A Unified Predictor-Corrector Framework for Fast Sampling of Diffusion Models
Viaarxiv icon

AdaPoinTr: Diverse Point Cloud Completion with Adaptive Geometry-Aware Transformers

Add code
Bookmark button
Alert button
Jan 11, 2023
Xumin Yu, Yongming Rao, Ziyi Wang, Jiwen Lu, Jie Zhou

Figure 1 for AdaPoinTr: Diverse Point Cloud Completion with Adaptive Geometry-Aware Transformers
Figure 2 for AdaPoinTr: Diverse Point Cloud Completion with Adaptive Geometry-Aware Transformers
Figure 3 for AdaPoinTr: Diverse Point Cloud Completion with Adaptive Geometry-Aware Transformers
Figure 4 for AdaPoinTr: Diverse Point Cloud Completion with Adaptive Geometry-Aware Transformers
Viaarxiv icon

FLAG3D: A 3D Fitness Activity Dataset with Language Instruction

Add code
Bookmark button
Alert button
Dec 09, 2022
Yansong Tang, Jinpeng Liu, Aoyang Liu, Bin Yang, Wenxun Dai, Yongming Rao, Jiwen Lu, Jie Zhou, Xiu Li

Figure 1 for FLAG3D: A 3D Fitness Activity Dataset with Language Instruction
Figure 2 for FLAG3D: A 3D Fitness Activity Dataset with Language Instruction
Figure 3 for FLAG3D: A 3D Fitness Activity Dataset with Language Instruction
Figure 4 for FLAG3D: A 3D Fitness Activity Dataset with Language Instruction
Viaarxiv icon