Picture for Jiexuan Zhang

Jiexuan Zhang

Audio-Omni: Extending Multi-modal Understanding to Versatile Audio Generation and Editing

Add code
Apr 12, 2026
Viaarxiv icon