Picture for Dong Yu

Dong Yu

Audio-Thinker: Guiding Audio Language Model When and How to Think via Reinforcement Learning

Add code
Aug 12, 2025
Viaarxiv icon

R-Zero: Self-Evolving Reasoning LLM from Zero Data

Add code
Aug 07, 2025
Viaarxiv icon

Towards Hallucination-Free Music: A Reinforcement Learning Preference Optimization Framework for Reliable Song Generation

Add code
Aug 07, 2025
Viaarxiv icon

Efficient Scaling for LLM-based ASR

Add code
Aug 06, 2025
Viaarxiv icon

LeVo: High-Quality Song Generation with Multi-Preference Alignment

Add code
Jun 09, 2025
Viaarxiv icon

WAKE: Watermarking Audio with Key Enrichment

Add code
Jun 06, 2025
Viaarxiv icon

DeepTheorem: Advancing LLM Reasoning for Theorem Proving Through Natural Language and Reinforcement Learning

Add code
May 29, 2025
Viaarxiv icon

Mitigating Audiovisual Mismatch in Visual-Guide Audio Captioning

Add code
May 28, 2025
Viaarxiv icon

VScan: Rethinking Visual Token Reduction for Efficient Large Vision-Language Models

Add code
May 28, 2025
Viaarxiv icon

InComeS: Integrating Compression and Selection Mechanisms into LLMs for Efficient Model Editing

Add code
May 28, 2025
Viaarxiv icon