Picture for Shan Yang

Shan Yang

MMAE: A Massive Multitask Audio Editing Benchmark

Add code
Jun 05, 2026
Viaarxiv icon

AnyAudio-Judge: A Dynamic Rubric-Based Benchmark and Evaluator for Audio Instruction Following

Add code
Jun 02, 2026
Viaarxiv icon

Physics-R1: An Audited Olympiad Corpus and Recipe for Visual Physics Reasoning

Add code
May 13, 2026
Viaarxiv icon

WavCube: Unifying Speech Representation for Understanding and Generation via Semantic-Acoustic Joint Modeling

Add code
May 07, 2026
Viaarxiv icon

TMD-Bench: A Multi-Level Evaluation Paradigm for Music-Dance Co-Generation

Add code
May 03, 2026
Viaarxiv icon

Agentic Application in Power Grid Static Analysis: Automatic Code Generation and Error Correction

Add code
Apr 11, 2026
Viaarxiv icon

Beyond Test-Time Training: Learning to Reason via Hardware-Efficient Optimal Control

Add code
Mar 10, 2026
Viaarxiv icon

Descent-Guided Policy Gradient for Scalable Cooperative Multi-Agent Learning

Add code
Feb 23, 2026
Viaarxiv icon

Covo-Audio Technical Report

Add code
Feb 10, 2026
Viaarxiv icon

PhyAVBench: A Challenging Audio Physics-Sensitivity Benchmark for Physically Grounded Text-to-Audio-Video Generation

Add code
Dec 30, 2025
Viaarxiv icon