Alert button
Picture for Peng Gao

Peng Gao

Alert button

University of Massachusetts Amherst

ChartAssisstant: A Universal Chart Multimodal Language Model via Chart-to-Table Pre-training and Multitask Instruction Tuning

Jan 10, 2024
Fanqing Meng, Wenqi Shao, Quanfeng Lu, Peng Gao, Kaipeng Zhang, Yu Qiao, Ping Luo

Viaarxiv icon

LiDAR-LLM: Exploring the Potential of Large Language Models for 3D LiDAR Understanding

Dec 21, 2023
Senqiao Yang, Jiaming Liu, Ray Zhang, Mingjie Pan, Zoey Guo, Xiaoqi Li, Zehui Chen, Peng Gao, Yandong Guo, Shanghang Zhang

Viaarxiv icon

Compositional Zero-Shot Learning for Attribute-Based Object Reference in Human-Robot Interaction

Dec 21, 2023
Peng Gao, Ahmed Jaafar, Brian Reily, Christopher Reardon, Hao Zhang

Viaarxiv icon

A Challenger to GPT-4V? Early Explorations of Gemini in Visual Expertise

Dec 20, 2023
Chaoyou Fu, Renrui Zhang, Zihan Wang, Yubo Huang, Zhengye Zhang, Longtian Qiu, Gaoxiang Ye, Yunhang Shen, Mengdan Zhang, Peixian Chen, Sirui Zhao, Shaohui Lin, Deqiang Jiang, Di Yin, Peng Gao, Ke Li, Hongsheng Li, Xing Sun

Viaarxiv icon

3DAxiesPrompts: Unleashing the 3D Spatial Task Capabilities of GPT-4V

Dec 15, 2023
Dingning Liu, Xiaomeng Dong, Renrui Zhang, Xu Luo, Peng Gao, Xiaoshui Huang, Yongshun Gong, Zhihui Wang

Figure 1 for 3DAxiesPrompts: Unleashing the 3D Spatial Task Capabilities of GPT-4V
Figure 2 for 3DAxiesPrompts: Unleashing the 3D Spatial Task Capabilities of GPT-4V
Figure 3 for 3DAxiesPrompts: Unleashing the 3D Spatial Task Capabilities of GPT-4V
Figure 4 for 3DAxiesPrompts: Unleashing the 3D Spatial Task Capabilities of GPT-4V
Viaarxiv icon

Digital Life Project: Autonomous 3D Characters with Social Intelligence

Dec 07, 2023
Zhongang Cai, Jianping Jiang, Zhongfei Qing, Xinying Guo, Mingyuan Zhang, Zhengyu Lin, Haiyi Mei, Chen Wei, Ruisi Wang, Wanqi Yin, Xiangyu Fan, Han Du, Liang Pan, Peng Gao, Zhitao Yang, Yang Gao, Jiaqi Li, Tianxiang Ren, Yukun Wei, Xiaogang Wang, Chen Change Loy, Lei Yang, Ziwei Liu

Figure 1 for Digital Life Project: Autonomous 3D Characters with Social Intelligence
Figure 2 for Digital Life Project: Autonomous 3D Characters with Social Intelligence
Figure 3 for Digital Life Project: Autonomous 3D Characters with Social Intelligence
Figure 4 for Digital Life Project: Autonomous 3D Characters with Social Intelligence
Viaarxiv icon

OneLLM: One Framework to Align All Modalities with Language

Dec 06, 2023
Jiaming Han, Kaixiong Gong, Yiyuan Zhang, Jiaqi Wang, Kaipeng Zhang, Dahua Lin, Yu Qiao, Peng Gao, Xiangyu Yue

Figure 1 for OneLLM: One Framework to Align All Modalities with Language
Figure 2 for OneLLM: One Framework to Align All Modalities with Language
Figure 3 for OneLLM: One Framework to Align All Modalities with Language
Figure 4 for OneLLM: One Framework to Align All Modalities with Language
Viaarxiv icon

ChatIllusion: Efficient-Aligning Interleaved Generation ability with Visual Instruction Model

Nov 29, 2023
Xiaowei Chi, Yijiang Liu, Zhengkai Jiang, Rongyu Zhang, Ziyi Lin, Renrui Zhang, Peng Gao, Chaoyou Fu, Shanghang Zhang, Qifeng Liu, Yike Guo

Viaarxiv icon

SPHINX: The Joint Mixing of Weights, Tasks, and Visual Embeddings for Multi-modal Large Language Models

Nov 13, 2023
Ziyi Lin, Chris Liu, Renrui Zhang, Peng Gao, Longtian Qiu, Han Xiao, Han Qiu, Chen Lin, Wenqi Shao, Keqin Chen, Jiaming Han, Siyuan Huang, Yichi Zhang, Xuming He, Hongsheng Li, Yu Qiao

Viaarxiv icon