Picture for Weichao Zhao

Weichao Zhao

DocR1: Evidence Page-Guided GRPO for Multi-Page Document Understanding

Add code
Aug 10, 2025
Viaarxiv icon

SLRTP2025 Sign Language Production Challenge: Methodology, Results, and Future Work

Add code
Aug 09, 2025
Viaarxiv icon

Cross-Modal Consistency Learning for Sign Language Recognition

Add code
Mar 16, 2025
Viaarxiv icon

Uni-Sign: Toward Unified Sign Language Understanding at Scale

Add code
Jan 25, 2025
Figure 1 for Uni-Sign: Toward Unified Sign Language Understanding at Scale
Figure 2 for Uni-Sign: Toward Unified Sign Language Understanding at Scale
Figure 3 for Uni-Sign: Toward Unified Sign Language Understanding at Scale
Figure 4 for Uni-Sign: Toward Unified Sign Language Understanding at Scale
Viaarxiv icon

Scaling up Multimodal Pre-training for Sign Language Understanding

Add code
Aug 16, 2024
Figure 1 for Scaling up Multimodal Pre-training for Sign Language Understanding
Figure 2 for Scaling up Multimodal Pre-training for Sign Language Understanding
Figure 3 for Scaling up Multimodal Pre-training for Sign Language Understanding
Figure 4 for Scaling up Multimodal Pre-training for Sign Language Understanding
Viaarxiv icon

TabPedia: Towards Comprehensive Visual Table Understanding with Concept Synergy

Add code
Jun 03, 2024
Figure 1 for TabPedia: Towards Comprehensive Visual Table Understanding with Concept Synergy
Figure 2 for TabPedia: Towards Comprehensive Visual Table Understanding with Concept Synergy
Figure 3 for TabPedia: Towards Comprehensive Visual Table Understanding with Concept Synergy
Figure 4 for TabPedia: Towards Comprehensive Visual Table Understanding with Concept Synergy
Viaarxiv icon

MASA: Motion-aware Masked Autoencoder with Semantic Alignment for Sign Language Recognition

Add code
May 31, 2024
Figure 1 for MASA: Motion-aware Masked Autoencoder with Semantic Alignment for Sign Language Recognition
Figure 2 for MASA: Motion-aware Masked Autoencoder with Semantic Alignment for Sign Language Recognition
Figure 3 for MASA: Motion-aware Masked Autoencoder with Semantic Alignment for Sign Language Recognition
Figure 4 for MASA: Motion-aware Masked Autoencoder with Semantic Alignment for Sign Language Recognition
Viaarxiv icon

Exploiting Spatial-Temporal Context for Interacting Hand Reconstruction on Monocular RGB Video

Add code
Aug 08, 2023
Figure 1 for Exploiting Spatial-Temporal Context for Interacting Hand Reconstruction on Monocular RGB Video
Figure 2 for Exploiting Spatial-Temporal Context for Interacting Hand Reconstruction on Monocular RGB Video
Figure 3 for Exploiting Spatial-Temporal Context for Interacting Hand Reconstruction on Monocular RGB Video
Figure 4 for Exploiting Spatial-Temporal Context for Interacting Hand Reconstruction on Monocular RGB Video
Viaarxiv icon

SignBERT+: Hand-model-aware Self-supervised Pre-training for Sign Language Understanding

Add code
May 08, 2023
Viaarxiv icon

BEST: BERT Pre-Training for Sign Language Recognition with Coupling Tokenization

Add code
Feb 13, 2023
Viaarxiv icon