Alert button
Picture for Wengang Zhou

Wengang Zhou

Alert button

UniDoc: A Universal Large Multimodal Model for Simultaneous Text Detection, Recognition, Spotting and Understanding

Add code
Bookmark button
Alert button
Sep 02, 2023
Hao Feng, Zijian Wang, Jingqun Tang, Jinghui Lu, Wengang Zhou, Houqiang Li, Can Huang

Figure 1 for UniDoc: A Universal Large Multimodal Model for Simultaneous Text Detection, Recognition, Spotting and Understanding
Figure 2 for UniDoc: A Universal Large Multimodal Model for Simultaneous Text Detection, Recognition, Spotting and Understanding
Figure 3 for UniDoc: A Universal Large Multimodal Model for Simultaneous Text Detection, Recognition, Spotting and Understanding
Figure 4 for UniDoc: A Universal Large Multimodal Model for Simultaneous Text Detection, Recognition, Spotting and Understanding
Viaarxiv icon

Sign Language Translation with Iterative Prototype

Add code
Bookmark button
Alert button
Aug 23, 2023
Huijie Yao, Wengang Zhou, Hao Feng, Hezhen Hu, Hao Zhou, Houqiang Li

Figure 1 for Sign Language Translation with Iterative Prototype
Figure 2 for Sign Language Translation with Iterative Prototype
Figure 3 for Sign Language Translation with Iterative Prototype
Figure 4 for Sign Language Translation with Iterative Prototype
Viaarxiv icon

SimFIR: A Simple Framework for Fisheye Image Rectification with Self-supervised Representation Learning

Add code
Bookmark button
Alert button
Aug 17, 2023
Hao Feng, Wendi Wang, Jiajun Deng, Wengang Zhou, Li Li, Houqiang Li

Figure 1 for SimFIR: A Simple Framework for Fisheye Image Rectification with Self-supervised Representation Learning
Figure 2 for SimFIR: A Simple Framework for Fisheye Image Rectification with Self-supervised Representation Learning
Figure 3 for SimFIR: A Simple Framework for Fisheye Image Rectification with Self-supervised Representation Learning
Figure 4 for SimFIR: A Simple Framework for Fisheye Image Rectification with Self-supervised Representation Learning
Viaarxiv icon

Text-Only Training for Visual Storytelling

Add code
Bookmark button
Alert button
Aug 17, 2023
Yuechen Wang, Wengang Zhou, Zhenbo Lu, Houqiang Li

Figure 1 for Text-Only Training for Visual Storytelling
Figure 2 for Text-Only Training for Visual Storytelling
Figure 3 for Text-Only Training for Visual Storytelling
Figure 4 for Text-Only Training for Visual Storytelling
Viaarxiv icon

Masked Motion Predictors are Strong 3D Action Representation Learners

Add code
Bookmark button
Alert button
Aug 14, 2023
Yunyao Mao, Jiajun Deng, Wengang Zhou, Yao Fang, Wanli Ouyang, Houqiang Li

Figure 1 for Masked Motion Predictors are Strong 3D Action Representation Learners
Figure 2 for Masked Motion Predictors are Strong 3D Action Representation Learners
Figure 3 for Masked Motion Predictors are Strong 3D Action Representation Learners
Figure 4 for Masked Motion Predictors are Strong 3D Action Representation Learners
Viaarxiv icon

Cyclic-Bootstrap Labeling for Weakly Supervised Object Detection

Add code
Bookmark button
Alert button
Aug 11, 2023
Yufei Yin, Jiajun Deng, Wengang Zhou, Li Li, Houqiang Li

Figure 1 for Cyclic-Bootstrap Labeling for Weakly Supervised Object Detection
Figure 2 for Cyclic-Bootstrap Labeling for Weakly Supervised Object Detection
Figure 3 for Cyclic-Bootstrap Labeling for Weakly Supervised Object Detection
Figure 4 for Cyclic-Bootstrap Labeling for Weakly Supervised Object Detection
Viaarxiv icon

Exploiting Spatial-Temporal Context for Interacting Hand Reconstruction on Monocular RGB Video

Add code
Bookmark button
Alert button
Aug 08, 2023
Weichao Zhao, Hezhen Hu, Wengang Zhou, Li li, Houqiang Li

Figure 1 for Exploiting Spatial-Temporal Context for Interacting Hand Reconstruction on Monocular RGB Video
Figure 2 for Exploiting Spatial-Temporal Context for Interacting Hand Reconstruction on Monocular RGB Video
Figure 3 for Exploiting Spatial-Temporal Context for Interacting Hand Reconstruction on Monocular RGB Video
Figure 4 for Exploiting Spatial-Temporal Context for Interacting Hand Reconstruction on Monocular RGB Video
Viaarxiv icon

AltFreezing for More General Video Face Forgery Detection

Add code
Bookmark button
Alert button
Jul 17, 2023
Zhendong Wang, Jianmin Bao, Wengang Zhou, Weilun Wang, Houqiang Li

Figure 1 for AltFreezing for More General Video Face Forgery Detection
Figure 2 for AltFreezing for More General Video Face Forgery Detection
Figure 3 for AltFreezing for More General Video Face Forgery Detection
Figure 4 for AltFreezing for More General Video Face Forgery Detection
Viaarxiv icon

Exploring Effective Mask Sampling Modeling for Neural Image Compression

Add code
Bookmark button
Alert button
Jun 09, 2023
Lin Liu, Mingming Zhao, Shanxin Yuan, Wenlong Lyu, Wengang Zhou, Houqiang Li, Yanfeng Wang, Qi Tian

Figure 1 for Exploring Effective Mask Sampling Modeling for Neural Image Compression
Figure 2 for Exploring Effective Mask Sampling Modeling for Neural Image Compression
Figure 3 for Exploring Effective Mask Sampling Modeling for Neural Image Compression
Figure 4 for Exploring Effective Mask Sampling Modeling for Neural Image Compression
Viaarxiv icon