Picture for Ambuj Mehrish

Ambuj Mehrish

Accent Conversion in Text-To-Speech Using Multi-Level VAE and Adversarial Training

Jun 03, 2024
Viaarxiv icon

HyperTTS: Parameter Efficient Adaptation in Text to Speech using Hypernetworks

Add code
Apr 06, 2024
Figure 1 for HyperTTS: Parameter Efficient Adaptation in Text to Speech using Hypernetworks
Figure 2 for HyperTTS: Parameter Efficient Adaptation in Text to Speech using Hypernetworks
Figure 3 for HyperTTS: Parameter Efficient Adaptation in Text to Speech using Hypernetworks
Figure 4 for HyperTTS: Parameter Efficient Adaptation in Text to Speech using Hypernetworks
Viaarxiv icon

CM-TTS: Enhancing Real Time Text-to-Speech Synthesis Efficiency through Weighted Samplers and Consistency Models

Add code
Mar 31, 2024
Figure 1 for CM-TTS: Enhancing Real Time Text-to-Speech Synthesis Efficiency through Weighted Samplers and Consistency Models
Figure 2 for CM-TTS: Enhancing Real Time Text-to-Speech Synthesis Efficiency through Weighted Samplers and Consistency Models
Figure 3 for CM-TTS: Enhancing Real Time Text-to-Speech Synthesis Efficiency through Weighted Samplers and Consistency Models
Figure 4 for CM-TTS: Enhancing Real Time Text-to-Speech Synthesis Efficiency through Weighted Samplers and Consistency Models
Viaarxiv icon

ADAPTERMIX: Exploring the Efficacy of Mixture of Adapters for Low-Resource TTS Adaptation

Add code
May 29, 2023
Figure 1 for ADAPTERMIX: Exploring the Efficacy of Mixture of Adapters for Low-Resource TTS Adaptation
Figure 2 for ADAPTERMIX: Exploring the Efficacy of Mixture of Adapters for Low-Resource TTS Adaptation
Figure 3 for ADAPTERMIX: Exploring the Efficacy of Mixture of Adapters for Low-Resource TTS Adaptation
Figure 4 for ADAPTERMIX: Exploring the Efficacy of Mixture of Adapters for Low-Resource TTS Adaptation
Viaarxiv icon

A Review of Deep Learning Techniques for Speech Processing

May 02, 2023
Figure 1 for A Review of Deep Learning Techniques for Speech Processing
Figure 2 for A Review of Deep Learning Techniques for Speech Processing
Figure 3 for A Review of Deep Learning Techniques for Speech Processing
Figure 4 for A Review of Deep Learning Techniques for Speech Processing
Viaarxiv icon

Text-to-Audio Generation using Instruction-Tuned LLM and Latent Diffusion Model

Add code
Apr 24, 2023
Figure 1 for Text-to-Audio Generation using Instruction-Tuned LLM and Latent Diffusion Model
Figure 2 for Text-to-Audio Generation using Instruction-Tuned LLM and Latent Diffusion Model
Figure 3 for Text-to-Audio Generation using Instruction-Tuned LLM and Latent Diffusion Model
Figure 4 for Text-to-Audio Generation using Instruction-Tuned LLM and Latent Diffusion Model
Viaarxiv icon

Evaluating Parameter-Efficient Transfer Learning Approaches on SURE Benchmark for Speech Understanding

Add code
Mar 02, 2023
Figure 1 for Evaluating Parameter-Efficient Transfer Learning Approaches on SURE Benchmark for Speech Understanding
Figure 2 for Evaluating Parameter-Efficient Transfer Learning Approaches on SURE Benchmark for Speech Understanding
Figure 3 for Evaluating Parameter-Efficient Transfer Learning Approaches on SURE Benchmark for Speech Understanding
Figure 4 for Evaluating Parameter-Efficient Transfer Learning Approaches on SURE Benchmark for Speech Understanding
Viaarxiv icon

Accented Text-to-Speech Synthesis with a Conditional Variational Autoencoder

Add code
Nov 07, 2022
Figure 1 for Accented Text-to-Speech Synthesis with a Conditional Variational Autoencoder
Figure 2 for Accented Text-to-Speech Synthesis with a Conditional Variational Autoencoder
Figure 3 for Accented Text-to-Speech Synthesis with a Conditional Variational Autoencoder
Figure 4 for Accented Text-to-Speech Synthesis with a Conditional Variational Autoencoder
Viaarxiv icon