Picture for Wenhao Yang

Wenhao Yang

On-the-Fly VLA Adaptation via Test-Time Reinforcement Learning

Add code
Jan 13, 2026
Viaarxiv icon

Deep But Reliable: Advancing Multi-turn Reasoning for Thinking with Images

Add code
Dec 19, 2025
Figure 1 for Deep But Reliable: Advancing Multi-turn Reasoning for Thinking with Images
Figure 2 for Deep But Reliable: Advancing Multi-turn Reasoning for Thinking with Images
Figure 3 for Deep But Reliable: Advancing Multi-turn Reasoning for Thinking with Images
Figure 4 for Deep But Reliable: Advancing Multi-turn Reasoning for Thinking with Images
Viaarxiv icon

Listening for "You": Enhancing Speech Image Retrieval via Target Speaker Extraction

Add code
Sep 11, 2025
Viaarxiv icon

Improved Analysis for Sign-based Methods with Momentum Updates

Add code
Jul 16, 2025
Figure 1 for Improved Analysis for Sign-based Methods with Momentum Updates
Figure 2 for Improved Analysis for Sign-based Methods with Momentum Updates
Figure 3 for Improved Analysis for Sign-based Methods with Momentum Updates
Figure 4 for Improved Analysis for Sign-based Methods with Momentum Updates
Viaarxiv icon

Discounted Online Convex Optimization: Uniform Regret Across a Continuous Interval

Add code
May 26, 2025
Viaarxiv icon

Cream of the Crop: Harvesting Rich, Scalable and Transferable Multi-Modal Data for Instruction Fine-Tuning

Add code
Mar 17, 2025
Viaarxiv icon

Exploring the Adversarial Vulnerabilities of Vision-Language-Action Models in Robotics

Add code
Nov 22, 2024
Figure 1 for Exploring the Adversarial Vulnerabilities of Vision-Language-Action Models in Robotics
Figure 2 for Exploring the Adversarial Vulnerabilities of Vision-Language-Action Models in Robotics
Figure 3 for Exploring the Adversarial Vulnerabilities of Vision-Language-Action Models in Robotics
Figure 4 for Exploring the Adversarial Vulnerabilities of Vision-Language-Action Models in Robotics
Viaarxiv icon

Limit Theorems for Stochastic Gradient Descent with Infinite Variance

Add code
Oct 21, 2024
Viaarxiv icon

Neuron-based Personality Trait Induction in Large Language Models

Add code
Oct 16, 2024
Viaarxiv icon

You Only Speak Once to See

Add code
Sep 27, 2024
Figure 1 for You Only Speak Once to See
Figure 2 for You Only Speak Once to See
Figure 3 for You Only Speak Once to See
Figure 4 for You Only Speak Once to See
Viaarxiv icon