Picture for Wengang Zhou

Wengang Zhou

DocR1: Evidence Page-Guided GRPO for Multi-Page Document Understanding

Add code
Aug 10, 2025
Viaarxiv icon

SLRTP2025 Sign Language Production Challenge: Methodology, Results, and Future Work

Add code
Aug 09, 2025
Viaarxiv icon

Self-Classification Enhancement and Correction for Weakly Supervised Object Detection

Add code
May 22, 2025
Viaarxiv icon

Multi-Level Aware Preference Learning: Enhancing RLHF for Complex Multi-Instruction Tasks

Add code
May 19, 2025
Viaarxiv icon

Bias Fitting to Mitigate Length Bias of Reward Model in RLHF

Add code
May 19, 2025
Viaarxiv icon

Mitigating Hallucination in VideoLLMs via Temporal-Aware Activation Engineering

Add code
May 19, 2025
Viaarxiv icon

VisuLogic: A Benchmark for Evaluating Visual Reasoning in Multi-modal Large Language Models

Add code
Apr 21, 2025
Viaarxiv icon

Multi-Scale Invertible Neural Network for Wide-Range Variable-Rate Learned Image Compression

Add code
Mar 27, 2025
Figure 1 for Multi-Scale Invertible Neural Network for Wide-Range Variable-Rate Learned Image Compression
Figure 2 for Multi-Scale Invertible Neural Network for Wide-Range Variable-Rate Learned Image Compression
Figure 3 for Multi-Scale Invertible Neural Network for Wide-Range Variable-Rate Learned Image Compression
Figure 4 for Multi-Scale Invertible Neural Network for Wide-Range Variable-Rate Learned Image Compression
Viaarxiv icon

Cross-Modal Consistency Learning for Sign Language Recognition

Add code
Mar 16, 2025
Viaarxiv icon

Multi-Cue Adaptive Visual Token Pruning for Large Vision-Language Models

Add code
Mar 11, 2025
Viaarxiv icon