Picture for Wengang Zhou

Wengang Zhou

I$^2$MD: 3D Action Representation Learning with Inter- and Intra-modal Mutual Distillation

Add code
Oct 24, 2023
Figure 1 for I$^2$MD: 3D Action Representation Learning with Inter- and Intra-modal Mutual Distillation
Figure 2 for I$^2$MD: 3D Action Representation Learning with Inter- and Intra-modal Mutual Distillation
Figure 3 for I$^2$MD: 3D Action Representation Learning with Inter- and Intra-modal Mutual Distillation
Figure 4 for I$^2$MD: 3D Action Representation Learning with Inter- and Intra-modal Mutual Distillation
Viaarxiv icon

State Sequences Prediction via Fourier Transform for Representation Learning

Add code
Oct 24, 2023
Viaarxiv icon

UniDoc: A Universal Large Multimodal Model for Simultaneous Text Detection, Recognition, Spotting and Understanding

Add code
Sep 02, 2023
Viaarxiv icon

Sign Language Translation with Iterative Prototype

Add code
Aug 23, 2023
Viaarxiv icon

Text-Only Training for Visual Storytelling

Add code
Aug 17, 2023
Figure 1 for Text-Only Training for Visual Storytelling
Figure 2 for Text-Only Training for Visual Storytelling
Figure 3 for Text-Only Training for Visual Storytelling
Figure 4 for Text-Only Training for Visual Storytelling
Viaarxiv icon

SimFIR: A Simple Framework for Fisheye Image Rectification with Self-supervised Representation Learning

Add code
Aug 17, 2023
Viaarxiv icon

Masked Motion Predictors are Strong 3D Action Representation Learners

Add code
Aug 14, 2023
Figure 1 for Masked Motion Predictors are Strong 3D Action Representation Learners
Figure 2 for Masked Motion Predictors are Strong 3D Action Representation Learners
Figure 3 for Masked Motion Predictors are Strong 3D Action Representation Learners
Figure 4 for Masked Motion Predictors are Strong 3D Action Representation Learners
Viaarxiv icon

Cyclic-Bootstrap Labeling for Weakly Supervised Object Detection

Add code
Aug 11, 2023
Figure 1 for Cyclic-Bootstrap Labeling for Weakly Supervised Object Detection
Figure 2 for Cyclic-Bootstrap Labeling for Weakly Supervised Object Detection
Figure 3 for Cyclic-Bootstrap Labeling for Weakly Supervised Object Detection
Figure 4 for Cyclic-Bootstrap Labeling for Weakly Supervised Object Detection
Viaarxiv icon

Exploiting Spatial-Temporal Context for Interacting Hand Reconstruction on Monocular RGB Video

Add code
Aug 08, 2023
Figure 1 for Exploiting Spatial-Temporal Context for Interacting Hand Reconstruction on Monocular RGB Video
Figure 2 for Exploiting Spatial-Temporal Context for Interacting Hand Reconstruction on Monocular RGB Video
Figure 3 for Exploiting Spatial-Temporal Context for Interacting Hand Reconstruction on Monocular RGB Video
Figure 4 for Exploiting Spatial-Temporal Context for Interacting Hand Reconstruction on Monocular RGB Video
Viaarxiv icon

AltFreezing for More General Video Face Forgery Detection

Add code
Jul 17, 2023
Viaarxiv icon