Picture for Dong Yu

Dong Yu

Measure Twice, Click Once: Co-evolving Proposer and Visual Critic via Reinforcement Learning for GUI Grounding

Add code
Apr 23, 2026
Viaarxiv icon

Too Correct to Learn: Reinforcement Learning on Saturated Reasoning Data

Add code
Apr 20, 2026
Viaarxiv icon

Audio-DeepThinker: Progressive Reasoning-Aware Reinforcement Learning for High-Quality Chain-of-Thought Emergence in Audio Language Models

Add code
Apr 20, 2026
Viaarxiv icon

SPAGBias: Uncovering and Tracing Structured Spatial Gender Bias in Large Language Models

Add code
Apr 16, 2026
Viaarxiv icon

Unlocking Strong Supervision: A Data-Centric Study of General-Purpose Audio Pre-Training Methods

Add code
Mar 26, 2026
Viaarxiv icon

Covo-Audio Technical Report

Add code
Feb 10, 2026
Viaarxiv icon

Locas: Your Models are Principled Initializers of Locally-Supported Parametric Memories

Add code
Feb 04, 2026
Viaarxiv icon

Verified Critical Step Optimization for LLM Agents

Add code
Feb 03, 2026
Viaarxiv icon

Save the Good Prefix: Precise Error Penalization via Process-Supervised RL to Enhance LLM Reasoning

Add code
Jan 26, 2026
Viaarxiv icon

SpatialEmb: Extract and Encode Spatial Information for 1-Stage Multi-channel Multi-speaker ASR on Arbitrary Microphone Arrays

Add code
Jan 25, 2026
Viaarxiv icon