Picture for Xueyi Pu

Xueyi Pu

Towards Streaming Synchronized Spatial Audio Generation via Autoregressive Diffusion Transformer

Add code
May 29, 2026
Viaarxiv icon

WavAlign: Enhancing Intelligence and Expressiveness in Spoken Dialogue Models via Adaptive Hybrid Post-Training

Add code
Apr 16, 2026
Viaarxiv icon

Modeling and Benchmarking Spoken Dialogue Rewards with Modality and Colloquialness

Add code
Mar 16, 2026
Viaarxiv icon