Picture for Wengang Zhou

Wengang Zhou

MASA: Motion-aware Masked Autoencoder with Semantic Alignment for Sign Language Recognition

Add code
May 31, 2024
Figure 1 for MASA: Motion-aware Masked Autoencoder with Semantic Alignment for Sign Language Recognition
Figure 2 for MASA: Motion-aware Masked Autoencoder with Semantic Alignment for Sign Language Recognition
Figure 3 for MASA: Motion-aware Masked Autoencoder with Semantic Alignment for Sign Language Recognition
Figure 4 for MASA: Motion-aware Masked Autoencoder with Semantic Alignment for Sign Language Recognition
Viaarxiv icon

EG4D: Explicit Generation of 4D Object without Score Distillation

Add code
May 28, 2024
Viaarxiv icon

Learning Generalizable Human Motion Generator with Reinforcement Learning

Add code
May 24, 2024
Figure 1 for Learning Generalizable Human Motion Generator with Reinforcement Learning
Figure 2 for Learning Generalizable Human Motion Generator with Reinforcement Learning
Figure 3 for Learning Generalizable Human Motion Generator with Reinforcement Learning
Figure 4 for Learning Generalizable Human Motion Generator with Reinforcement Learning
Viaarxiv icon

Progressive Multi-modal Conditional Prompt Tuning

Add code
Apr 18, 2024
Figure 1 for Progressive Multi-modal Conditional Prompt Tuning
Figure 2 for Progressive Multi-modal Conditional Prompt Tuning
Figure 3 for Progressive Multi-modal Conditional Prompt Tuning
Figure 4 for Progressive Multi-modal Conditional Prompt Tuning
Viaarxiv icon

TextCoT: Zoom In for Enhanced Multimodal Text-Rich Image Understanding

Add code
Apr 15, 2024
Viaarxiv icon

Learning Spatial Adaptation and Temporal Coherence in Diffusion Models for Video Super-Resolution

Add code
Mar 25, 2024
Figure 1 for Learning Spatial Adaptation and Temporal Coherence in Diffusion Models for Video Super-Resolution
Figure 2 for Learning Spatial Adaptation and Temporal Coherence in Diffusion Models for Video Super-Resolution
Figure 3 for Learning Spatial Adaptation and Temporal Coherence in Diffusion Models for Video Super-Resolution
Figure 4 for Learning Spatial Adaptation and Temporal Coherence in Diffusion Models for Video Super-Resolution
Viaarxiv icon

GaussNav: Gaussian Splatting for Visual Navigation

Add code
Mar 20, 2024
Viaarxiv icon

Cross-Lingual Transfer for Natural Language Inference via Multilingual Prompt Translator

Add code
Mar 19, 2024
Figure 1 for Cross-Lingual Transfer for Natural Language Inference via Multilingual Prompt Translator
Figure 2 for Cross-Lingual Transfer for Natural Language Inference via Multilingual Prompt Translator
Figure 3 for Cross-Lingual Transfer for Natural Language Inference via Multilingual Prompt Translator
Figure 4 for Cross-Lingual Transfer for Natural Language Inference via Multilingual Prompt Translator
Viaarxiv icon

Motion-aware 3D Gaussian Splatting for Efficient Dynamic Scene Reconstruction

Add code
Mar 18, 2024
Figure 1 for Motion-aware 3D Gaussian Splatting for Efficient Dynamic Scene Reconstruction
Figure 2 for Motion-aware 3D Gaussian Splatting for Efficient Dynamic Scene Reconstruction
Figure 3 for Motion-aware 3D Gaussian Splatting for Efficient Dynamic Scene Reconstruction
Figure 4 for Motion-aware 3D Gaussian Splatting for Efficient Dynamic Scene Reconstruction
Viaarxiv icon

Image2Sentence based Asymmetrical Zero-shot Composed Image Retrieval

Add code
Mar 03, 2024
Figure 1 for Image2Sentence based Asymmetrical Zero-shot Composed Image Retrieval
Figure 2 for Image2Sentence based Asymmetrical Zero-shot Composed Image Retrieval
Figure 3 for Image2Sentence based Asymmetrical Zero-shot Composed Image Retrieval
Figure 4 for Image2Sentence based Asymmetrical Zero-shot Composed Image Retrieval
Viaarxiv icon