Picture for Pengyuan Xie

Pengyuan Xie

Speaker-Reasoner: Scaling Interaction Turns and Reasoning Patterns for Timestamped Speaker-Attributed ASR

Add code
Apr 03, 2026
Viaarxiv icon

OmniCodec: Low Frame Rate Universal Audio Codec with Semantic-Acoustic Disentanglement

Add code
Mar 21, 2026
Viaarxiv icon

VoiceSculptor: Your Voice, Designed By You

Add code
Jan 15, 2026
Viaarxiv icon

NVSpeech: An Integrated and Scalable Pipeline for Human-Like Speech Modeling with Paralinguistic Vocalizations

Add code
Aug 06, 2025
Viaarxiv icon