Alert button
Picture for Renrui Zhang

Renrui Zhang

Alert button

MathVerse: Does Your Multi-modal LLM Truly See the Diagrams in Visual Math Problems?

Mar 21, 2024
Renrui Zhang, Dongzhi Jiang, Yichi Zhang, Haokun Lin, Ziyu Guo, Pengshuo Qiu, Aojun Zhou, Pan Lu, Kai-Wei Chang, Peng Gao, Hongsheng Li

Viaarxiv icon

OneTracker: Unifying Visual Object Tracking with Foundation Models and Efficient Tuning

Mar 14, 2024
Lingyi Hong, Shilin Yan, Renrui Zhang, Wanyun Li, Xinyu Zhou, Pinxue Guo, Kaixun Jiang, Yiting Chen, Jinglun Li, Zhaoyu Chen, Wenqiang Zhang

Viaarxiv icon

SPHINX-X: Scaling Data and Parameters for a Family of Multi-modal Large Language Models

Feb 08, 2024
Peng Gao, Renrui Zhang, Chris Liu, Longtian Qiu, Siyuan Huang, Weifeng Lin, Shitian Zhao, Shijie Geng, Ziyi Lin, Peng Jin, Kaipeng Zhang, Wenqi Shao, Chao Xu, Conghui He, Junjun He, Hao Shao, Pan Lu, Hongsheng Li, Yu Qiao

Viaarxiv icon

Language-Assisted 3D Scene Understanding

Dec 31, 2023
Yanmin Wu, Qiankun Gao, Renrui Zhang, Jian Zhang

Viaarxiv icon

ManipLLM: Embodied Multimodal Large Language Model for Object-Centric Robotic Manipulation

Dec 24, 2023
Xiaoqi Li, Mingxu Zhang, Yiran Geng, Haoran Geng, Yuxing Long, Yan Shen, Renrui Zhang, Jiaming Liu, Hao Dong

Viaarxiv icon

A Challenger to GPT-4V? Early Explorations of Gemini in Visual Expertise

Dec 20, 2023
Chaoyou Fu, Renrui Zhang, Zihan Wang, Yubo Huang, Zhengye Zhang, Longtian Qiu, Gaoxiang Ye, Yunhang Shen, Mengdan Zhang, Peixian Chen, Sirui Zhao, Shaohui Lin, Deqiang Jiang, Di Yin, Peng Gao, Ke Li, Hongsheng Li, Xing Sun

Viaarxiv icon

Adaptive Distribution Masked Autoencoders for Continual Test-Time Adaptation

Dec 19, 2023
Jiaming Liu, Ran Xu, Senqiao Yang, Renrui Zhang, Qizhe Zhang, Zehui Chen, Yandong Guo, Shanghang Zhang

Viaarxiv icon

Gradient-based Parameter Selection for Efficient Fine-Tuning

Dec 15, 2023
Zhi Zhang, Qizhe Zhang, Zijun Gao, Renrui Zhang, Ekaterina Shutova, Shiji Zhou, Shanghang Zhang

Viaarxiv icon

3DAxiesPrompts: Unleashing the 3D Spatial Task Capabilities of GPT-4V

Dec 15, 2023
Dingning Liu, Xiaomeng Dong, Renrui Zhang, Xu Luo, Peng Gao, Xiaoshui Huang, Yongshun Gong, Zhihui Wang

Figure 1 for 3DAxiesPrompts: Unleashing the 3D Spatial Task Capabilities of GPT-4V
Figure 2 for 3DAxiesPrompts: Unleashing the 3D Spatial Task Capabilities of GPT-4V
Figure 3 for 3DAxiesPrompts: Unleashing the 3D Spatial Task Capabilities of GPT-4V
Figure 4 for 3DAxiesPrompts: Unleashing the 3D Spatial Task Capabilities of GPT-4V
Viaarxiv icon