Audio Generation


Physics-Informed Direction-Aware Neural Acoustic Fields

Add code
Jul 09, 2025
Viaarxiv icon

SecureSpeech: Prompt-based Speaker and Content Protection

Add code
Jul 10, 2025
Viaarxiv icon

Re-Bottleneck: Latent Re-Structuring for Neural Audio Autoencoders

Add code
Jul 10, 2025
Viaarxiv icon

Exploring State-Space-Model based Language Model in Music Generation

Add code
Jul 09, 2025
Viaarxiv icon

Scaling RL to Long Videos

Add code
Jul 10, 2025
Viaarxiv icon

Democratizing High-Fidelity Co-Speech Gesture Video Generation

Add code
Jul 09, 2025
Viaarxiv icon

VP-SelDoA: Visual-prompted Selective DoA Estimation of Target Sound via Semantic-Spatial Matching

Add code
Jul 10, 2025
Viaarxiv icon

SonicMotion: Dynamic Spatial Audio Soundscapes with Latent Diffusion Models

Add code
Jul 09, 2025
Viaarxiv icon

MEDTalk: Multimodal Controlled 3D Facial Animation with Dynamic Emotions by Disentangled Embedding

Add code
Jul 08, 2025
Viaarxiv icon

MixAssist: An Audio-Language Dataset for Co-Creative AI Assistance in Music Mixing

Add code
Jul 08, 2025
Viaarxiv icon