Picture for Ruiyang Xu

Ruiyang Xu

Native Active Perception as Reasoning for Omni-Modal Understanding

Add code
Jun 17, 2026
Viaarxiv icon

MMAE: A Massive Multitask Audio Editing Benchmark

Add code
Jun 05, 2026
Viaarxiv icon

LifeBench: A Benchmark for Long-Horizon Multi-Source Memory

Add code
Mar 04, 2026
Viaarxiv icon

The Interspeech 2026 Audio Reasoning Challenge: Evaluating Reasoning Process Quality for Audio Reasoning Models and Agents

Add code
Feb 15, 2026
Viaarxiv icon

SLAM-LLM: A Modular, Open-Source Multimodal Large Language Model Framework and Best Practice for Speech, Language, Audio and Music Processing

Add code
Jan 14, 2026
Viaarxiv icon

Omni-Captioner: Data Pipeline, Models, and Benchmark for Omni Detailed Perception

Add code
Oct 14, 2025
Viaarxiv icon

MMAR: A Challenging Benchmark for Deep Reasoning in Speech, Audio, Music, and Their Mix

Add code
May 19, 2025
Figure 1 for MMAR: A Challenging Benchmark for Deep Reasoning in Speech, Audio, Music, and Their Mix
Figure 2 for MMAR: A Challenging Benchmark for Deep Reasoning in Speech, Audio, Music, and Their Mix
Figure 3 for MMAR: A Challenging Benchmark for Deep Reasoning in Speech, Audio, Music, and Their Mix
Figure 4 for MMAR: A Challenging Benchmark for Deep Reasoning in Speech, Audio, Music, and Their Mix
Viaarxiv icon

Interleaved Speech-Text Language Models are Simple Streaming Text to Speech Synthesizers

Add code
Dec 23, 2024
Viaarxiv icon

SLAM-Omni: Timbre-Controllable Voice Interaction System with Single-Stage Training

Add code
Dec 20, 2024
Figure 1 for SLAM-Omni: Timbre-Controllable Voice Interaction System with Single-Stage Training
Figure 2 for SLAM-Omni: Timbre-Controllable Voice Interaction System with Single-Stage Training
Figure 3 for SLAM-Omni: Timbre-Controllable Voice Interaction System with Single-Stage Training
Figure 4 for SLAM-Omni: Timbre-Controllable Voice Interaction System with Single-Stage Training
Viaarxiv icon

CRUXEval-X: A Benchmark for Multilingual Code Reasoning, Understanding and Execution

Add code
Aug 23, 2024
Figure 1 for CRUXEval-X: A Benchmark for Multilingual Code Reasoning, Understanding and Execution
Figure 2 for CRUXEval-X: A Benchmark for Multilingual Code Reasoning, Understanding and Execution
Figure 3 for CRUXEval-X: A Benchmark for Multilingual Code Reasoning, Understanding and Execution
Figure 4 for CRUXEval-X: A Benchmark for Multilingual Code Reasoning, Understanding and Execution
Viaarxiv icon