Picture for Ambuj Mehrish

Ambuj Mehrish

Leveraging Parameter-Efficient Transfer Learning for Multi-Lingual Text-to-Speech Adaptation

Add code
Jun 25, 2024
Viaarxiv icon

Reward Steering with Evolutionary Heuristics for Decoding-time Alignment

Add code
Jun 25, 2024
Viaarxiv icon

Improving Text-To-Audio Models with Synthetic Captions

Add code
Jun 18, 2024
Viaarxiv icon

Accent Conversion in Text-To-Speech Using Multi-Level VAE and Adversarial Training

Add code
Jun 03, 2024
Figure 1 for Accent Conversion in Text-To-Speech Using Multi-Level VAE and Adversarial Training
Figure 2 for Accent Conversion in Text-To-Speech Using Multi-Level VAE and Adversarial Training
Figure 3 for Accent Conversion in Text-To-Speech Using Multi-Level VAE and Adversarial Training
Figure 4 for Accent Conversion in Text-To-Speech Using Multi-Level VAE and Adversarial Training
Viaarxiv icon

HyperTTS: Parameter Efficient Adaptation in Text to Speech using Hypernetworks

Add code
Apr 06, 2024
Figure 1 for HyperTTS: Parameter Efficient Adaptation in Text to Speech using Hypernetworks
Figure 2 for HyperTTS: Parameter Efficient Adaptation in Text to Speech using Hypernetworks
Figure 3 for HyperTTS: Parameter Efficient Adaptation in Text to Speech using Hypernetworks
Figure 4 for HyperTTS: Parameter Efficient Adaptation in Text to Speech using Hypernetworks
Viaarxiv icon

CM-TTS: Enhancing Real Time Text-to-Speech Synthesis Efficiency through Weighted Samplers and Consistency Models

Add code
Mar 31, 2024
Figure 1 for CM-TTS: Enhancing Real Time Text-to-Speech Synthesis Efficiency through Weighted Samplers and Consistency Models
Figure 2 for CM-TTS: Enhancing Real Time Text-to-Speech Synthesis Efficiency through Weighted Samplers and Consistency Models
Figure 3 for CM-TTS: Enhancing Real Time Text-to-Speech Synthesis Efficiency through Weighted Samplers and Consistency Models
Figure 4 for CM-TTS: Enhancing Real Time Text-to-Speech Synthesis Efficiency through Weighted Samplers and Consistency Models
Viaarxiv icon

ADAPTERMIX: Exploring the Efficacy of Mixture of Adapters for Low-Resource TTS Adaptation

Add code
May 29, 2023
Figure 1 for ADAPTERMIX: Exploring the Efficacy of Mixture of Adapters for Low-Resource TTS Adaptation
Figure 2 for ADAPTERMIX: Exploring the Efficacy of Mixture of Adapters for Low-Resource TTS Adaptation
Figure 3 for ADAPTERMIX: Exploring the Efficacy of Mixture of Adapters for Low-Resource TTS Adaptation
Figure 4 for ADAPTERMIX: Exploring the Efficacy of Mixture of Adapters for Low-Resource TTS Adaptation
Viaarxiv icon

A Review of Deep Learning Techniques for Speech Processing

Add code
May 02, 2023
Figure 1 for A Review of Deep Learning Techniques for Speech Processing
Figure 2 for A Review of Deep Learning Techniques for Speech Processing
Figure 3 for A Review of Deep Learning Techniques for Speech Processing
Figure 4 for A Review of Deep Learning Techniques for Speech Processing
Viaarxiv icon

Text-to-Audio Generation using Instruction-Tuned LLM and Latent Diffusion Model

Add code
Apr 24, 2023
Figure 1 for Text-to-Audio Generation using Instruction-Tuned LLM and Latent Diffusion Model
Figure 2 for Text-to-Audio Generation using Instruction-Tuned LLM and Latent Diffusion Model
Figure 3 for Text-to-Audio Generation using Instruction-Tuned LLM and Latent Diffusion Model
Figure 4 for Text-to-Audio Generation using Instruction-Tuned LLM and Latent Diffusion Model
Viaarxiv icon

Evaluating Parameter-Efficient Transfer Learning Approaches on SURE Benchmark for Speech Understanding

Add code
Mar 02, 2023
Figure 1 for Evaluating Parameter-Efficient Transfer Learning Approaches on SURE Benchmark for Speech Understanding
Figure 2 for Evaluating Parameter-Efficient Transfer Learning Approaches on SURE Benchmark for Speech Understanding
Figure 3 for Evaluating Parameter-Efficient Transfer Learning Approaches on SURE Benchmark for Speech Understanding
Figure 4 for Evaluating Parameter-Efficient Transfer Learning Approaches on SURE Benchmark for Speech Understanding
Viaarxiv icon