Alert button
Picture for Rafael Valle

Rafael Valle

Alert button

Audio Dialogues: Dialogues dataset for audio and music understanding

Add code
Bookmark button
Alert button
Apr 11, 2024
Arushi Goel, Zhifeng Kong, Rafael Valle, Bryan Catanzaro

Viaarxiv icon

Audio Flamingo: A Novel Audio Language Model with Few-Shot Learning and Dialogue Abilities

Add code
Bookmark button
Alert button
Feb 02, 2024
Zhifeng Kong, Arushi Goel, Rohan Badlani, Wei Ping, Rafael Valle, Bryan Catanzaro

Viaarxiv icon

Scaling NVIDIA's Multi-speaker Multi-lingual TTS Systems with Zero-Shot TTS to Indic Languages

Add code
Bookmark button
Alert button
Jan 29, 2024
Akshit Arora, Rohan Badlani, Sungwon Kim, Rafael Valle, Bryan Catanzaro

Viaarxiv icon

SelfVC: Voice Conversion With Iterative Refinement using Self Transformations

Add code
Bookmark button
Alert button
Oct 14, 2023
Paarth Neekhara, Shehzeen Hussain, Rafael Valle, Boris Ginsburg, Rishabh Ranjan, Shlomo Dubnov, Farinaz Koushanfar, Julian McAuley

Figure 1 for SelfVC: Voice Conversion With Iterative Refinement using Self Transformations
Figure 2 for SelfVC: Voice Conversion With Iterative Refinement using Self Transformations
Figure 3 for SelfVC: Voice Conversion With Iterative Refinement using Self Transformations
Figure 4 for SelfVC: Voice Conversion With Iterative Refinement using Self Transformations
Viaarxiv icon

VANI: Very-lightweight Accent-controllable TTS for Native and Non-native speakers with Identity Preservation

Add code
Bookmark button
Alert button
Mar 14, 2023
Rohan Badlani, Akshit Arora, Subhankar Ghosh, Rafael Valle, Kevin J. Shih, João Felipe Santos, Boris Ginsburg, Bryan Catanzaro

Figure 1 for VANI: Very-lightweight Accent-controllable TTS for Native and Non-native speakers with Identity Preservation
Viaarxiv icon

Multilingual Multiaccented Multispeaker TTS with RADTTS

Add code
Bookmark button
Alert button
Jan 24, 2023
Rohan Badlani, Rafael Valle, Kevin J. Shih, João Felipe Santos, Siddharth Gururani, Bryan Catanzaro

Figure 1 for Multilingual Multiaccented Multispeaker TTS with RADTTS
Figure 2 for Multilingual Multiaccented Multispeaker TTS with RADTTS
Figure 3 for Multilingual Multiaccented Multispeaker TTS with RADTTS
Figure 4 for Multilingual Multiaccented Multispeaker TTS with RADTTS
Viaarxiv icon

SPACE: Speech-driven Portrait Animation with Controllable Expression

Add code
Bookmark button
Alert button
Dec 07, 2022
Siddharth Gururani, Arun Mallya, Ting-Chun Wang, Rafael Valle, Ming-Yu Liu

Figure 1 for SPACE: Speech-driven Portrait Animation with Controllable Expression
Figure 2 for SPACE: Speech-driven Portrait Animation with Controllable Expression
Figure 3 for SPACE: Speech-driven Portrait Animation with Controllable Expression
Figure 4 for SPACE: Speech-driven Portrait Animation with Controllable Expression
Viaarxiv icon

SPACEx: Speech-driven Portrait Animation with Controllable Expression

Add code
Bookmark button
Alert button
Nov 17, 2022
Siddharth Gururani, Arun Mallya, Ting-Chun Wang, Rafael Valle, Ming-Yu Liu

Figure 1 for SPACEx: Speech-driven Portrait Animation with Controllable Expression
Figure 2 for SPACEx: Speech-driven Portrait Animation with Controllable Expression
Figure 3 for SPACEx: Speech-driven Portrait Animation with Controllable Expression
Figure 4 for SPACEx: Speech-driven Portrait Animation with Controllable Expression
Viaarxiv icon

Generative Modeling for Low Dimensional Speech Attributes with Neural Spline Flows

Add code
Bookmark button
Alert button
Mar 07, 2022
Kevin J. Shih, Rafael Valle, Rohan Badlani, João Felipe Santos, Bryan Catanzaro

Figure 1 for Generative Modeling for Low Dimensional Speech Attributes with Neural Spline Flows
Figure 2 for Generative Modeling for Low Dimensional Speech Attributes with Neural Spline Flows
Figure 3 for Generative Modeling for Low Dimensional Speech Attributes with Neural Spline Flows
Figure 4 for Generative Modeling for Low Dimensional Speech Attributes with Neural Spline Flows
Viaarxiv icon

One TTS Alignment To Rule Them All

Add code
Bookmark button
Alert button
Aug 23, 2021
Rohan Badlani, Adrian Łancucki, Kevin J. Shih, Rafael Valle, Wei Ping, Bryan Catanzaro

Figure 1 for One TTS Alignment To Rule Them All
Figure 2 for One TTS Alignment To Rule Them All
Figure 3 for One TTS Alignment To Rule Them All
Figure 4 for One TTS Alignment To Rule Them All
Viaarxiv icon