Alert button
Picture for Hongsheng Li

Hongsheng Li

Alert button

SmartRefine: A Scenario-Adaptive Refinement Framework for Efficient Motion Prediction

Add code
Bookmark button
Alert button
Mar 19, 2024
Yang Zhou, Hao Shao, Letian Wang, Steven L. Waslander, Hongsheng Li, Yu Liu

Figure 1 for SmartRefine: A Scenario-Adaptive Refinement Framework for Efficient Motion Prediction
Figure 2 for SmartRefine: A Scenario-Adaptive Refinement Framework for Efficient Motion Prediction
Figure 3 for SmartRefine: A Scenario-Adaptive Refinement Framework for Efficient Motion Prediction
Figure 4 for SmartRefine: A Scenario-Adaptive Refinement Framework for Efficient Motion Prediction
Viaarxiv icon

ManipVQA: Injecting Robotic Affordance and Physically Grounded Information into Multi-Modal Large Language Models

Add code
Bookmark button
Alert button
Mar 17, 2024
Siyuan Huang, Iaroslav Ponomarenko, Zhengkai Jiang, Xiaoqi Li, Xiaobin Hu, Peng Gao, Hongsheng Li, Hao Dong

Figure 1 for ManipVQA: Injecting Robotic Affordance and Physically Grounded Information into Multi-Modal Large Language Models
Figure 2 for ManipVQA: Injecting Robotic Affordance and Physically Grounded Information into Multi-Modal Large Language Models
Figure 3 for ManipVQA: Injecting Robotic Affordance and Physically Grounded Information into Multi-Modal Large Language Models
Figure 4 for ManipVQA: Injecting Robotic Affordance and Physically Grounded Information into Multi-Modal Large Language Models
Viaarxiv icon

GiT: Towards Generalist Vision Transformer through Universal Language Interface

Add code
Bookmark button
Alert button
Mar 14, 2024
Haiyang Wang, Hao Tang, Li Jiang, Shaoshuai Shi, Muhammad Ferjad Naeem, Hongsheng Li, Bernt Schiele, Liwei Wang

Figure 1 for GiT: Towards Generalist Vision Transformer through Universal Language Interface
Figure 2 for GiT: Towards Generalist Vision Transformer through Universal Language Interface
Figure 3 for GiT: Towards Generalist Vision Transformer through Universal Language Interface
Figure 4 for GiT: Towards Generalist Vision Transformer through Universal Language Interface
Viaarxiv icon

Vision-RWKV: Efficient and Scalable Visual Perception with RWKV-Like Architectures

Add code
Bookmark button
Alert button
Mar 07, 2024
Yuchen Duan, Weiyun Wang, Zhe Chen, Xizhou Zhu, Lewei Lu, Tong Lu, Yu Qiao, Hongsheng Li, Jifeng Dai, Wenhai Wang

Figure 1 for Vision-RWKV: Efficient and Scalable Visual Perception with RWKV-Like Architectures
Figure 2 for Vision-RWKV: Efficient and Scalable Visual Perception with RWKV-Like Architectures
Figure 3 for Vision-RWKV: Efficient and Scalable Visual Perception with RWKV-Like Architectures
Figure 4 for Vision-RWKV: Efficient and Scalable Visual Perception with RWKV-Like Architectures
Viaarxiv icon

MathGenie: Generating Synthetic Data with Question Back-translation for Enhancing Mathematical Reasoning of LLMs

Add code
Bookmark button
Alert button
Feb 26, 2024
Zimu Lu, Aojun Zhou, Houxing Ren, Ke Wang, Weikang Shi, Junting Pan, Mingjie Zhan, Hongsheng Li

Viaarxiv icon

Measuring Multimodal Mathematical Reasoning with MATH-Vision Dataset

Add code
Bookmark button
Alert button
Feb 22, 2024
Ke Wang, Junting Pan, Weikang Shi, Zimu Lu, Mingjie Zhan, Hongsheng Li

Viaarxiv icon

Not All Experts are Equal: Efficient Expert Pruning and Skipping for Mixture-of-Experts Large Language Models

Add code
Bookmark button
Alert button
Feb 22, 2024
Xudong Lu, Qi Liu, Yuhui Xu, Aojun Zhou, Siyuan Huang, Bo Zhang, Junchi Yan, Hongsheng Li

Viaarxiv icon

SPHINX-X: Scaling Data and Parameters for a Family of Multi-modal Large Language Models

Add code
Bookmark button
Alert button
Feb 08, 2024
Peng Gao, Renrui Zhang, Chris Liu, Longtian Qiu, Siyuan Huang, Weifeng Lin, Shitian Zhao, Shijie Geng, Ziyi Lin, Peng Jin, Kaipeng Zhang, Wenqi Shao, Chao Xu, Conghui He, Junjun He, Hao Shao, Pan Lu, Hongsheng Li, Yu Qiao

Viaarxiv icon

AnimateLCM: Accelerating the Animation of Personalized Diffusion Models and Adapters with Decoupled Consistency Learning

Add code
Bookmark button
Alert button
Feb 01, 2024
Fu-Yun Wang, Zhaoyang Huang, Xiaoyu Shi, Weikang Bian, Guanglu Song, Yu Liu, Hongsheng Li

Viaarxiv icon

Motion-I2V: Consistent and Controllable Image-to-Video Generation with Explicit Motion Modeling

Add code
Bookmark button
Alert button
Jan 31, 2024
Xiaoyu Shi, Zhaoyang Huang, Fu-Yun Wang, Weikang Bian, Dasong Li, Yi Zhang, Manyuan Zhang, Ka Chun Cheung, Simon See, Hongwei Qin, Jifeng Dai, Hongsheng Li

Viaarxiv icon