Picture for Yuqing Yang

Yuqing Yang

Rebellious Student: Reversing Teacher Signals for Reasoning Exploration with Self-Distilled RLVR

Add code
May 11, 2026
Viaarxiv icon

World-R1: Reinforcing 3D Constraints for Text-to-Video Generation

Add code
Apr 27, 2026
Viaarxiv icon

MM-WebAgent: A Hierarchical Multimodal Web Agent for Webpage Generation

Add code
Apr 16, 2026
Viaarxiv icon

Self-Evolving LLM Memory Extraction Across Heterogeneous Tasks

Add code
Apr 13, 2026
Viaarxiv icon

AVGen-Bench: A Task-Driven Benchmark for Multi-Granular Evaluation of Text-to-Audio-Video Generation

Add code
Apr 09, 2026
Viaarxiv icon

BizGenEval: A Systematic Benchmark for Commercial Visual Content Generation

Add code
Mar 26, 2026
Viaarxiv icon

Why Does Self-Distillation (Sometimes) Degrade the Reasoning Capability of LLMs?

Add code
Mar 25, 2026
Viaarxiv icon

SortedRL: Accelerating RL Training for LLMs through Online Length-Aware Scheduling

Add code
Mar 24, 2026
Viaarxiv icon

Act While Thinking: Accelerating LLM Agents via Pattern-Aware Speculative Tool Execution

Add code
Mar 19, 2026
Viaarxiv icon

Understanding Reasoning in LLMs through Strategic Information Allocation under Uncertainty

Add code
Mar 16, 2026
Viaarxiv icon