Alert button
Picture for Bo Zhang

Bo Zhang

Alert button

mPLUG-DocOwl 1.5: Unified Structure Learning for OCR-free Document Understanding

Mar 19, 2024
Anwen Hu, Haiyang Xu, Jiabo Ye, Ming Yan, Liang Zhang, Bo Zhang, Chen Li, Ji Zhang, Qin Jin, Fei Huang, Jingren Zhou

Viaarxiv icon

DeepSeek-VL: Towards Real-World Vision-Language Understanding

Mar 11, 2024
Haoyu Lu, Wen Liu, Bo Zhang, Bingxuan Wang, Kai Dong, Bo Liu, Jingxiang Sun, Tongzheng Ren, Zhuoshu Li, Hao Yang, Yaofeng Sun, Chengqi Deng, Hanwei Xu, Zhenda Xie, Chong Ruan

Figure 1 for DeepSeek-VL: Towards Real-World Vision-Language Understanding
Figure 2 for DeepSeek-VL: Towards Real-World Vision-Language Understanding
Figure 3 for DeepSeek-VL: Towards Real-World Vision-Language Understanding
Figure 4 for DeepSeek-VL: Towards Real-World Vision-Language Understanding
Viaarxiv icon

Lemur: Log Parsing with Entropy Sampling and Chain-of-Thought Merging

Mar 02, 2024
Wei Zhang, Hongcheng Guo, Anjie Le, Jian Yang, Jiaheng Liu, Zhoujun Li, Tieqiao Zheng, Shi Xu, Runqiang Zang, Liangfan Zheng, Bo Zhang

Viaarxiv icon

VisionLLaMA: A Unified LLaMA Interface for Vision Tasks

Mar 01, 2024
Xiangxiang Chu, Jianlin Su, Bo Zhang, Chunhua Shen

Figure 1 for VisionLLaMA: A Unified LLaMA Interface for Vision Tasks
Figure 2 for VisionLLaMA: A Unified LLaMA Interface for Vision Tasks
Figure 3 for VisionLLaMA: A Unified LLaMA Interface for Vision Tasks
Figure 4 for VisionLLaMA: A Unified LLaMA Interface for Vision Tasks
Viaarxiv icon

Not All Experts are Equal: Efficient Expert Pruning and Skipping for Mixture-of-Experts Large Language Models

Feb 22, 2024
Xudong Lu, Qi Liu, Yuhui Xu, Aojun Zhou, Siyuan Huang, Bo Zhang, Junchi Yan, Hongsheng Li

Viaarxiv icon

ChartX & ChartVLM: A Versatile Benchmark and Foundation Model for Complicated Chart Reasoning

Feb 19, 2024
Renqiu Xia, Bo Zhang, Hancheng Ye, Xiangchao Yan, Qi Liu, Hongbin Zhou, Zijun Chen, Min Dou, Botian Shi, Junchi Yan, Yu Qiao

Viaarxiv icon

OASim: an Open and Adaptive Simulator based on Neural Rendering for Autonomous Driving

Feb 06, 2024
Guohang Yan, Jiahao Pi, Jianfei Guo, Zhaotong Luo, Min Dou, Nianchen Deng, Qiusheng Huang, Daocheng Fu, Licheng Wen, Pinlong Cai, Xing Gao, Xinyu Cai, Bo Zhang, Xuemeng Yang, Yeqi Bai, Hongbin Zhou, Botian Shi

Viaarxiv icon

MobileVLM V2: Faster and Stronger Baseline for Vision Language Model

Feb 06, 2024
Xiangxiang Chu, Limeng Qiao, Xinyu Zhang, Shuang Xu, Fei Wei, Yang Yang, Xiaofei Sun, Yiming Hu, Xinyang Lin, Bo Zhang, Chunhua Shen

Viaarxiv icon

Cross-Task Linearity Emerges in the Pretraining-Finetuning Paradigm

Feb 06, 2024
Zhanpeng Zhou, Zijun Chen, Yilan Chen, Bo Zhang, Junchi Yan

Viaarxiv icon