Picture for Xuerui Yang

Xuerui Yang

Step-Audio-EditX Technical Report

Add code
Nov 05, 2025
Viaarxiv icon

Mind-Paced Speaking: A Dual-Brain Approach to Real-Time Reasoning in Spoken Language Models

Add code
Oct 10, 2025
Viaarxiv icon

Step-Audio 2 Technical Report

Add code
Jul 24, 2025
Viaarxiv icon

Step-Audio-AQAA: a Fully End-to-End Expressive Large Audio Language Model

Add code
Jun 10, 2025
Figure 1 for Step-Audio-AQAA: a Fully End-to-End Expressive Large Audio Language Model
Figure 2 for Step-Audio-AQAA: a Fully End-to-End Expressive Large Audio Language Model
Figure 3 for Step-Audio-AQAA: a Fully End-to-End Expressive Large Audio Language Model
Figure 4 for Step-Audio-AQAA: a Fully End-to-End Expressive Large Audio Language Model
Viaarxiv icon

Step-Audio: Unified Understanding and Generation in Intelligent Speech Interaction

Add code
Feb 18, 2025
Viaarxiv icon

BART based semantic correction for Mandarin automatic speech recognition system

Add code
Mar 26, 2021
Figure 1 for BART based semantic correction for Mandarin automatic speech recognition system
Figure 2 for BART based semantic correction for Mandarin automatic speech recognition system
Figure 3 for BART based semantic correction for Mandarin automatic speech recognition system
Figure 4 for BART based semantic correction for Mandarin automatic speech recognition system
Viaarxiv icon