Alert button
Picture for Kevin J. Shih

Kevin J. Shih

Alert button

VANI: Very-lightweight Accent-controllable TTS for Native and Non-native speakers with Identity Preservation

Add code
Bookmark button
Alert button
Mar 14, 2023
Rohan Badlani, Akshit Arora, Subhankar Ghosh, Rafael Valle, Kevin J. Shih, João Felipe Santos, Boris Ginsburg, Bryan Catanzaro

Figure 1 for VANI: Very-lightweight Accent-controllable TTS for Native and Non-native speakers with Identity Preservation
Viaarxiv icon

Multilingual Multiaccented Multispeaker TTS with RADTTS

Add code
Bookmark button
Alert button
Jan 24, 2023
Rohan Badlani, Rafael Valle, Kevin J. Shih, João Felipe Santos, Siddharth Gururani, Bryan Catanzaro

Figure 1 for Multilingual Multiaccented Multispeaker TTS with RADTTS
Figure 2 for Multilingual Multiaccented Multispeaker TTS with RADTTS
Figure 3 for Multilingual Multiaccented Multispeaker TTS with RADTTS
Figure 4 for Multilingual Multiaccented Multispeaker TTS with RADTTS
Viaarxiv icon

Collecting The Puzzle Pieces: Disentangled Self-Driven Human Pose Transfer by Permuting Textures

Add code
Bookmark button
Alert button
Oct 06, 2022
Nannan Li, Kevin J. Shih, Bryan A. Plummer

Figure 1 for Collecting The Puzzle Pieces: Disentangled Self-Driven Human Pose Transfer by Permuting Textures
Figure 2 for Collecting The Puzzle Pieces: Disentangled Self-Driven Human Pose Transfer by Permuting Textures
Figure 3 for Collecting The Puzzle Pieces: Disentangled Self-Driven Human Pose Transfer by Permuting Textures
Figure 4 for Collecting The Puzzle Pieces: Disentangled Self-Driven Human Pose Transfer by Permuting Textures
Viaarxiv icon

Generative Modeling for Low Dimensional Speech Attributes with Neural Spline Flows

Add code
Bookmark button
Alert button
Mar 07, 2022
Kevin J. Shih, Rafael Valle, Rohan Badlani, João Felipe Santos, Bryan Catanzaro

Figure 1 for Generative Modeling for Low Dimensional Speech Attributes with Neural Spline Flows
Figure 2 for Generative Modeling for Low Dimensional Speech Attributes with Neural Spline Flows
Figure 3 for Generative Modeling for Low Dimensional Speech Attributes with Neural Spline Flows
Figure 4 for Generative Modeling for Low Dimensional Speech Attributes with Neural Spline Flows
Viaarxiv icon

One TTS Alignment To Rule Them All

Add code
Bookmark button
Alert button
Aug 23, 2021
Rohan Badlani, Adrian Łancucki, Kevin J. Shih, Rafael Valle, Wei Ping, Bryan Catanzaro

Figure 1 for One TTS Alignment To Rule Them All
Figure 2 for One TTS Alignment To Rule Them All
Figure 3 for One TTS Alignment To Rule Them All
Figure 4 for One TTS Alignment To Rule Them All
Viaarxiv icon

Unsupervised Disentanglement of Pose, Appearance and Background from Images and Videos

Add code
Bookmark button
Alert button
Jan 26, 2020
Aysegul Dundar, Kevin J. Shih, Animesh Garg, Robert Pottorf, Andrew Tao, Bryan Catanzaro

Figure 1 for Unsupervised Disentanglement of Pose, Appearance and Background from Images and Videos
Figure 2 for Unsupervised Disentanglement of Pose, Appearance and Background from Images and Videos
Figure 3 for Unsupervised Disentanglement of Pose, Appearance and Background from Images and Videos
Figure 4 for Unsupervised Disentanglement of Pose, Appearance and Background from Images and Videos
Viaarxiv icon

Video Interpolation and Prediction with Unsupervised Landmarks

Add code
Bookmark button
Alert button
Sep 06, 2019
Kevin J. Shih, Aysegul Dundar, Animesh Garg, Robert Pottorf, Andrew Tao, Bryan Catanzaro

Figure 1 for Video Interpolation and Prediction with Unsupervised Landmarks
Figure 2 for Video Interpolation and Prediction with Unsupervised Landmarks
Figure 3 for Video Interpolation and Prediction with Unsupervised Landmarks
Figure 4 for Video Interpolation and Prediction with Unsupervised Landmarks
Viaarxiv icon

Unsupervised Video Interpolation Using Cycle Consistency

Add code
Bookmark button
Alert button
Jun 13, 2019
Fitsum A. Reda, Deqing Sun, Aysegul Dundar, Mohammad Shoeybi, Guilin Liu, Kevin J. Shih, Andrew Tao, Jan Kautz, Bryan Catanzaro

Figure 1 for Unsupervised Video Interpolation Using Cycle Consistency
Figure 2 for Unsupervised Video Interpolation Using Cycle Consistency
Figure 3 for Unsupervised Video Interpolation Using Cycle Consistency
Figure 4 for Unsupervised Video Interpolation Using Cycle Consistency
Viaarxiv icon

Graphical Contrastive Losses for Scene Graph Generation

Add code
Bookmark button
Alert button
Mar 28, 2019
Ji Zhang, Kevin J. Shih, Ahmed Elgammal, Andrew Tao, Bryan Catanzaro

Figure 1 for Graphical Contrastive Losses for Scene Graph Generation
Figure 2 for Graphical Contrastive Losses for Scene Graph Generation
Figure 3 for Graphical Contrastive Losses for Scene Graph Generation
Figure 4 for Graphical Contrastive Losses for Scene Graph Generation
Viaarxiv icon