Picture for Rithesh Kumar

Rithesh Kumar

PromptSep: Generative Audio Separation via Multimodal Prompting

Add code
Nov 06, 2025
Viaarxiv icon

DiTSE: High-Fidelity Generative Speech Enhancement via Latent Diffusion Transformers

Add code
Apr 13, 2025
Viaarxiv icon

SpeakEasy: Enhancing Text-to-Speech Interactions for Expressive Content Creation

Add code
Apr 07, 2025
Figure 1 for SpeakEasy: Enhancing Text-to-Speech Interactions for Expressive Content Creation
Figure 2 for SpeakEasy: Enhancing Text-to-Speech Interactions for Expressive Content Creation
Figure 3 for SpeakEasy: Enhancing Text-to-Speech Interactions for Expressive Content Creation
Figure 4 for SpeakEasy: Enhancing Text-to-Speech Interactions for Expressive Content Creation
Viaarxiv icon

DMDSpeech: Distilled Diffusion Model Surpassing The Teacher in Zero-shot Speech Synthesis via Direct Metric Optimization

Add code
Oct 14, 2024
Viaarxiv icon

VampNet: Music Generation via Masked Acoustic Token Modeling

Add code
Jul 12, 2023
Figure 1 for VampNet: Music Generation via Masked Acoustic Token Modeling
Figure 2 for VampNet: Music Generation via Masked Acoustic Token Modeling
Figure 3 for VampNet: Music Generation via Masked Acoustic Token Modeling
Figure 4 for VampNet: Music Generation via Masked Acoustic Token Modeling
Viaarxiv icon

High-Fidelity Audio Compression with Improved RVQGAN

Add code
Jun 11, 2023
Figure 1 for High-Fidelity Audio Compression with Improved RVQGAN
Figure 2 for High-Fidelity Audio Compression with Improved RVQGAN
Figure 3 for High-Fidelity Audio Compression with Improved RVQGAN
Figure 4 for High-Fidelity Audio Compression with Improved RVQGAN
Viaarxiv icon

Chunked Autoregressive GAN for Conditional Waveform Synthesis

Add code
Oct 19, 2021
Figure 1 for Chunked Autoregressive GAN for Conditional Waveform Synthesis
Figure 2 for Chunked Autoregressive GAN for Conditional Waveform Synthesis
Figure 3 for Chunked Autoregressive GAN for Conditional Waveform Synthesis
Figure 4 for Chunked Autoregressive GAN for Conditional Waveform Synthesis
Viaarxiv icon

NU-GAN: High resolution neural upsampling with GAN

Add code
Oct 22, 2020
Figure 1 for NU-GAN: High resolution neural upsampling with GAN
Figure 2 for NU-GAN: High resolution neural upsampling with GAN
Figure 3 for NU-GAN: High resolution neural upsampling with GAN
Viaarxiv icon

MelGAN: Generative Adversarial Networks for Conditional Waveform Synthesis

Add code
Oct 28, 2019
Figure 1 for MelGAN: Generative Adversarial Networks for Conditional Waveform Synthesis
Figure 2 for MelGAN: Generative Adversarial Networks for Conditional Waveform Synthesis
Figure 3 for MelGAN: Generative Adversarial Networks for Conditional Waveform Synthesis
Figure 4 for MelGAN: Generative Adversarial Networks for Conditional Waveform Synthesis
Viaarxiv icon

Maximum Entropy Generators for Energy-Based Models

Add code
Jan 24, 2019
Figure 1 for Maximum Entropy Generators for Energy-Based Models
Figure 2 for Maximum Entropy Generators for Energy-Based Models
Figure 3 for Maximum Entropy Generators for Energy-Based Models
Figure 4 for Maximum Entropy Generators for Energy-Based Models
Viaarxiv icon