Picture for Varun Nagaraja

Varun Nagaraja

High Fidelity Text-Guided Music Generation and Editing via Single-Stage Flow Matching

Add code
Jul 04, 2024
Viaarxiv icon

On The Open Prompt Challenge In Conditional Audio Generation

Add code
Nov 01, 2023
Figure 1 for On The Open Prompt Challenge In Conditional Audio Generation
Figure 2 for On The Open Prompt Challenge In Conditional Audio Generation
Figure 3 for On The Open Prompt Challenge In Conditional Audio Generation
Figure 4 for On The Open Prompt Challenge In Conditional Audio Generation
Viaarxiv icon

FoleyGen: Visually-Guided Audio Generation

Add code
Sep 19, 2023
Figure 1 for FoleyGen: Visually-Guided Audio Generation
Figure 2 for FoleyGen: Visually-Guided Audio Generation
Figure 3 for FoleyGen: Visually-Guided Audio Generation
Figure 4 for FoleyGen: Visually-Guided Audio Generation
Viaarxiv icon

Stack-and-Delay: a new codebook pattern for music generation

Add code
Sep 15, 2023
Figure 1 for Stack-and-Delay: a new codebook pattern for music generation
Figure 2 for Stack-and-Delay: a new codebook pattern for music generation
Figure 3 for Stack-and-Delay: a new codebook pattern for music generation
Figure 4 for Stack-and-Delay: a new codebook pattern for music generation
Viaarxiv icon

Enhance audio generation controllability through representation similarity regularization

Add code
Sep 15, 2023
Figure 1 for Enhance audio generation controllability through representation similarity regularization
Figure 2 for Enhance audio generation controllability through representation similarity regularization
Figure 3 for Enhance audio generation controllability through representation similarity regularization
Figure 4 for Enhance audio generation controllability through representation similarity regularization
Viaarxiv icon

Streaming Transformer Transducer Based Speech Recognition Using Non-Causal Convolution

Add code
Oct 07, 2021
Figure 1 for Streaming Transformer Transducer Based Speech Recognition Using Non-Causal Convolution
Figure 2 for Streaming Transformer Transducer Based Speech Recognition Using Non-Causal Convolution
Figure 3 for Streaming Transformer Transducer Based Speech Recognition Using Non-Causal Convolution
Figure 4 for Streaming Transformer Transducer Based Speech Recognition Using Non-Causal Convolution
Viaarxiv icon

Collaborative Training of Acoustic Encoders for Speech Recognition

Add code
Jul 13, 2021
Figure 1 for Collaborative Training of Acoustic Encoders for Speech Recognition
Figure 2 for Collaborative Training of Acoustic Encoders for Speech Recognition
Figure 3 for Collaborative Training of Acoustic Encoders for Speech Recognition
Viaarxiv icon

Dynamic Encoder Transducer: A Flexible Solution For Trading Off Accuracy For Latency

Add code
Apr 05, 2021
Figure 1 for Dynamic Encoder Transducer: A Flexible Solution For Trading Off Accuracy For Latency
Figure 2 for Dynamic Encoder Transducer: A Flexible Solution For Trading Off Accuracy For Latency
Figure 3 for Dynamic Encoder Transducer: A Flexible Solution For Trading Off Accuracy For Latency
Figure 4 for Dynamic Encoder Transducer: A Flexible Solution For Trading Off Accuracy For Latency
Viaarxiv icon