Alert button
Picture for Peng Gao

Peng Gao

Alert button

No Time to Train: Empowering Non-Parametric Networks for Few-shot 3D Scene Segmentation

Add code
Bookmark button
Alert button
Apr 05, 2024
Xiangyang Zhu, Renrui Zhang, Bowei He, Ziyu Guo, Jiaming Liu, Han Xiao, Chaoyou Fu, Hao Dong, Peng Gao

Viaarxiv icon

Multi-Robot Collaborative Navigation with Formation Adaptation

Add code
Bookmark button
Alert button
Apr 02, 2024
Zihao Deng, Peng Gao, Williard Joshua Jose, Hao Zhang

Viaarxiv icon

Draw-and-Understand: Leveraging Visual Prompts to Enable MLLMs to Comprehend What You Want

Add code
Bookmark button
Alert button
Apr 01, 2024
Weifeng Lin, Xinyu Wei, Ruichuan An, Peng Gao, Bocheng Zou, Yulin Luo, Siyuan Huang, Shanghang Zhang, Hongsheng Li

Figure 1 for Draw-and-Understand: Leveraging Visual Prompts to Enable MLLMs to Comprehend What You Want
Figure 2 for Draw-and-Understand: Leveraging Visual Prompts to Enable MLLMs to Comprehend What You Want
Figure 3 for Draw-and-Understand: Leveraging Visual Prompts to Enable MLLMs to Comprehend What You Want
Figure 4 for Draw-and-Understand: Leveraging Visual Prompts to Enable MLLMs to Comprehend What You Want
Viaarxiv icon

CT Synthesis with Conditional Diffusion Models for Abdominal Lymph Node Segmentation

Add code
Bookmark button
Alert button
Mar 26, 2024
Yongrui Yu, Hanyu Chen, Zitian Zhang, Qiong Xiao, Wenhui Lei, Linrui Dai, Yu Fu, Hui Tan, Guan Wang, Peng Gao, Xiaofan Zhang

Viaarxiv icon

MathVerse: Does Your Multi-modal LLM Truly See the Diagrams in Visual Math Problems?

Add code
Bookmark button
Alert button
Mar 21, 2024
Renrui Zhang, Dongzhi Jiang, Yichi Zhang, Haokun Lin, Ziyu Guo, Pengshuo Qiu, Aojun Zhou, Pan Lu, Kai-Wei Chang, Peng Gao, Hongsheng Li

Figure 1 for MathVerse: Does Your Multi-modal LLM Truly See the Diagrams in Visual Math Problems?
Figure 2 for MathVerse: Does Your Multi-modal LLM Truly See the Diagrams in Visual Math Problems?
Figure 3 for MathVerse: Does Your Multi-modal LLM Truly See the Diagrams in Visual Math Problems?
Figure 4 for MathVerse: Does Your Multi-modal LLM Truly See the Diagrams in Visual Math Problems?
Viaarxiv icon

ManipVQA: Injecting Robotic Affordance and Physically Grounded Information into Multi-Modal Large Language Models

Add code
Bookmark button
Alert button
Mar 17, 2024
Siyuan Huang, Iaroslav Ponomarenko, Zhengkai Jiang, Xiaoqi Li, Xiaobin Hu, Peng Gao, Hongsheng Li, Hao Dong

Figure 1 for ManipVQA: Injecting Robotic Affordance and Physically Grounded Information into Multi-Modal Large Language Models
Figure 2 for ManipVQA: Injecting Robotic Affordance and Physically Grounded Information into Multi-Modal Large Language Models
Figure 3 for ManipVQA: Injecting Robotic Affordance and Physically Grounded Information into Multi-Modal Large Language Models
Figure 4 for ManipVQA: Injecting Robotic Affordance and Physically Grounded Information into Multi-Modal Large Language Models
Viaarxiv icon

Masked AutoDecoder is Effective Multi-Task Vision Generalist

Add code
Bookmark button
Alert button
Mar 14, 2024
Han Qiu, Jiaxing Huang, Peng Gao, Lewei Lu, Xiaoqin Zhang, Shijian Lu

Figure 1 for Masked AutoDecoder is Effective Multi-Task Vision Generalist
Figure 2 for Masked AutoDecoder is Effective Multi-Task Vision Generalist
Figure 3 for Masked AutoDecoder is Effective Multi-Task Vision Generalist
Figure 4 for Masked AutoDecoder is Effective Multi-Task Vision Generalist
Viaarxiv icon

In Defense and Revival of Bayesian Filtering for Thermal Infrared Object Tracking

Add code
Bookmark button
Alert button
Feb 27, 2024
Peng Gao, Shi-Min Li, Feng Gao, Fei Wang, Ru-Yue Yuan, Hamido Fujita

Viaarxiv icon

Searching a Lightweight Network Architecture for Thermal Infrared Pedestrian Tracking

Add code
Bookmark button
Alert button
Feb 26, 2024
Peng Gao, Xiao Liu, Yu Wang, Ru-Yue Yuan

Viaarxiv icon

YOLO-TLA: An Efficient and Lightweight Small Object Detection Model based on YOLOv5

Add code
Bookmark button
Alert button
Feb 22, 2024
Peng Gao, Chun-Lin Ji, Tao Yu, Ru-Yue Yuan

Viaarxiv icon