Picture for Wenhao Zhang

Wenhao Zhang

VideoRouter: Query-Adaptive Dual Routing for Efficient Long-Video Understanding

Add code
May 07, 2026
Viaarxiv icon

Dynamic Pondering Sparsity-aware Mixture-of-Experts Transformer for Event Stream based Visual Object Tracking

Add code
May 07, 2026
Viaarxiv icon

TCOD: Exploring Temporal Curriculum in On-Policy Distillation for Multi-turn Autonomous Agents

Add code
Apr 28, 2026
Viaarxiv icon

Polynomial Expansion Rank Adaptation: Enhancing Low-Rank Fine-Tuning with High-Order Interactions

Add code
Apr 12, 2026
Viaarxiv icon

HiMAC: Hierarchical Macro-Micro Learning for Long-Horizon LLM Agents

Add code
Mar 01, 2026
Viaarxiv icon

On the Entropy Dynamics in Reinforcement Fine-Tuning of Large Language Models

Add code
Feb 03, 2026
Viaarxiv icon

IntentRL: Training Proactive User-intent Agents for Open-ended Deep Research via Reinforcement Learning

Add code
Feb 03, 2026
Viaarxiv icon

VideoCuRL: Video Curriculum Reinforcement Learning with Orthogonal Difficulty Decomposition

Add code
Dec 31, 2025
Viaarxiv icon

StereoVLA: Enhancing Vision-Language-Action Models with Stereo Vision

Add code
Dec 26, 2025
Figure 1 for StereoVLA: Enhancing Vision-Language-Action Models with Stereo Vision
Figure 2 for StereoVLA: Enhancing Vision-Language-Action Models with Stereo Vision
Figure 3 for StereoVLA: Enhancing Vision-Language-Action Models with Stereo Vision
Figure 4 for StereoVLA: Enhancing Vision-Language-Action Models with Stereo Vision
Viaarxiv icon

DiSE: A diffusion probabilistic model for automatic structure elucidation of organic compounds

Add code
Oct 30, 2025
Viaarxiv icon