Picture for Wengang Zhou

Wengang Zhou

Structural Action Transformer for 3D Dexterous Manipulation

Add code
Mar 04, 2026
Viaarxiv icon

StepVAR: Structure-Texture Guided Pruning for Visual Autoregressive Models

Add code
Mar 02, 2026
Viaarxiv icon

Primary-Fine Decoupling for Action Generation in Robotic Imitation

Add code
Feb 25, 2026
Viaarxiv icon

BookNet: Book Image Rectification via Cross-Page Attention Network

Add code
Jan 29, 2026
Viaarxiv icon

Make-It-Poseable: Feed-forward Latent Posing Model for 3D Humanoid Character Animation

Add code
Dec 18, 2025
Figure 1 for Make-It-Poseable: Feed-forward Latent Posing Model for 3D Humanoid Character Animation
Figure 2 for Make-It-Poseable: Feed-forward Latent Posing Model for 3D Humanoid Character Animation
Figure 3 for Make-It-Poseable: Feed-forward Latent Posing Model for 3D Humanoid Character Animation
Figure 4 for Make-It-Poseable: Feed-forward Latent Posing Model for 3D Humanoid Character Animation
Viaarxiv icon

Gait-Adaptive Perceptive Humanoid Locomotion with Real-Time Under-Base Terrain Reconstruction

Add code
Dec 08, 2025
Viaarxiv icon

Enhancing the Outcome Reward-based RL Training of MLLMs with Self-Consistency Sampling

Add code
Nov 13, 2025
Figure 1 for Enhancing the Outcome Reward-based RL Training of MLLMs with Self-Consistency Sampling
Figure 2 for Enhancing the Outcome Reward-based RL Training of MLLMs with Self-Consistency Sampling
Figure 3 for Enhancing the Outcome Reward-based RL Training of MLLMs with Self-Consistency Sampling
Figure 4 for Enhancing the Outcome Reward-based RL Training of MLLMs with Self-Consistency Sampling
Viaarxiv icon

DocR1: Evidence Page-Guided GRPO for Multi-Page Document Understanding

Add code
Aug 10, 2025
Viaarxiv icon

SLRTP2025 Sign Language Production Challenge: Methodology, Results, and Future Work

Add code
Aug 09, 2025
Viaarxiv icon

Self-Classification Enhancement and Correction for Weakly Supervised Object Detection

Add code
May 22, 2025
Viaarxiv icon