Alert button
Picture for Wengang Zhou

Wengang Zhou

Alert button

Sinkhorn Distance Minimization for Knowledge Distillation

Add code
Bookmark button
Alert button
Feb 27, 2024
Xiao Cui, Yulei Qin, Yuting Gao, Enwei Zhang, Zihan Xu, Tong Wu, Ke Li, Xing Sun, Wengang Zhou, Houqiang Li

Viaarxiv icon

Instance-aware Exploration-Verification-Exploitation for Instance ImageGoal Navigation

Add code
Bookmark button
Alert button
Feb 25, 2024
Xiaohan Lei, Min Wang, Wengang Zhou, Li Li, Houqiang Li

Viaarxiv icon

Exploiting GPT-4 Vision for Zero-shot Point Cloud Understanding

Add code
Bookmark button
Alert button
Jan 15, 2024
Qi Sun, Xiao Cui, Wengang Zhou, Houqiang Li

Viaarxiv icon

DanZero+: Dominating the GuanDan Game through Reinforcement Learning

Add code
Bookmark button
Alert button
Dec 05, 2023
Youpeng Zhao, Yudong Lu, Jian Zhao, Wengang Zhou, Houqiang Li

Viaarxiv icon

DocPedia: Unleashing the Power of Large Multimodal Model in the Frequency Domain for Versatile Document Understanding

Add code
Bookmark button
Alert button
Nov 30, 2023
Hao Feng, Qi Liu, Hao Liu, Wengang Zhou, Houqiang Li, Can Huang

Figure 1 for DocPedia: Unleashing the Power of Large Multimodal Model in the Frequency Domain for Versatile Document Understanding
Figure 2 for DocPedia: Unleashing the Power of Large Multimodal Model in the Frequency Domain for Versatile Document Understanding
Figure 3 for DocPedia: Unleashing the Power of Large Multimodal Model in the Frequency Domain for Versatile Document Understanding
Figure 4 for DocPedia: Unleashing the Power of Large Multimodal Model in the Frequency Domain for Versatile Document Understanding
Viaarxiv icon

Towards Improving Document Understanding: An Exploration on Text-Grounding via MLLMs

Add code
Bookmark button
Alert button
Nov 22, 2023
Yonghui Wang, Wengang Zhou, Hao Feng, Keyi Zhou, Houqiang Li

Viaarxiv icon

Progressive Recurrent Network for Shadow Removal

Add code
Bookmark button
Alert button
Nov 01, 2023
Yonghui Wang, Wengang Zhou, Hao Feng, Li Li, Houqiang Li

Viaarxiv icon

State Sequences Prediction via Fourier Transform for Representation Learning

Add code
Bookmark button
Alert button
Oct 24, 2023
Mingxuan Ye, Yufei Kuang, Jie Wang, Rui Yang, Wengang Zhou, Houqiang Li, Feng Wu

Viaarxiv icon

I$^2$MD: 3D Action Representation Learning with Inter- and Intra-modal Mutual Distillation

Add code
Bookmark button
Alert button
Oct 24, 2023
Yunyao Mao, Jiajun Deng, Wengang Zhou, Zhenbo Lu, Wanli Ouyang, Houqiang Li

Viaarxiv icon