Alert button
Picture for Bryan Catanzaro

Bryan Catanzaro

Alert button

Hierarchical Multi-Scale Attention for Semantic Segmentation

Add code
Bookmark button
Alert button
May 21, 2020
Andrew Tao, Karan Sapra, Bryan Catanzaro

Figure 1 for Hierarchical Multi-Scale Attention for Semantic Segmentation
Figure 2 for Hierarchical Multi-Scale Attention for Semantic Segmentation
Figure 3 for Hierarchical Multi-Scale Attention for Semantic Segmentation
Figure 4 for Hierarchical Multi-Scale Attention for Semantic Segmentation
Viaarxiv icon

Large Scale Multi-Actor Generative Dialog Modeling

Add code
Bookmark button
Alert button
May 13, 2020
Alex Boyd, Raul Puri, Mohammad Shoeybi, Mostofa Patwary, Bryan Catanzaro

Figure 1 for Large Scale Multi-Actor Generative Dialog Modeling
Figure 2 for Large Scale Multi-Actor Generative Dialog Modeling
Figure 3 for Large Scale Multi-Actor Generative Dialog Modeling
Figure 4 for Large Scale Multi-Actor Generative Dialog Modeling
Viaarxiv icon

Flowtron: an Autoregressive Flow-based Generative Network for Text-to-Speech Synthesis

Add code
Bookmark button
Alert button
May 13, 2020
Rafael Valle, Kevin Shih, Ryan Prenger, Bryan Catanzaro

Figure 1 for Flowtron: an Autoregressive Flow-based Generative Network for Text-to-Speech Synthesis
Figure 2 for Flowtron: an Autoregressive Flow-based Generative Network for Text-to-Speech Synthesis
Figure 3 for Flowtron: an Autoregressive Flow-based Generative Network for Text-to-Speech Synthesis
Figure 4 for Flowtron: an Autoregressive Flow-based Generative Network for Text-to-Speech Synthesis
Viaarxiv icon

Panoptic-based Image Synthesis

Add code
Bookmark button
Alert button
Apr 21, 2020
Aysegul Dundar, Karan Sapra, Guilin Liu, Andrew Tao, Bryan Catanzaro

Figure 1 for Panoptic-based Image Synthesis
Figure 2 for Panoptic-based Image Synthesis
Figure 3 for Panoptic-based Image Synthesis
Figure 4 for Panoptic-based Image Synthesis
Viaarxiv icon

Training Question Answering Models From Synthetic Data

Add code
Bookmark button
Alert button
Feb 22, 2020
Raul Puri, Ryan Spring, Mostofa Patwary, Mohammad Shoeybi, Bryan Catanzaro

Figure 1 for Training Question Answering Models From Synthetic Data
Figure 2 for Training Question Answering Models From Synthetic Data
Figure 3 for Training Question Answering Models From Synthetic Data
Figure 4 for Training Question Answering Models From Synthetic Data
Viaarxiv icon

Unsupervised Disentanglement of Pose, Appearance and Background from Images and Videos

Add code
Bookmark button
Alert button
Jan 26, 2020
Aysegul Dundar, Kevin J. Shih, Animesh Garg, Robert Pottorf, Andrew Tao, Bryan Catanzaro

Figure 1 for Unsupervised Disentanglement of Pose, Appearance and Background from Images and Videos
Figure 2 for Unsupervised Disentanglement of Pose, Appearance and Background from Images and Videos
Figure 3 for Unsupervised Disentanglement of Pose, Appearance and Background from Images and Videos
Figure 4 for Unsupervised Disentanglement of Pose, Appearance and Background from Images and Videos
Viaarxiv icon

Neural ODEs for Image Segmentation with Level Sets

Add code
Bookmark button
Alert button
Dec 25, 2019
Rafael Valle, Fitsum Reda, Mohammad Shoeybi, Patrick Legresley, Andrew Tao, Bryan Catanzaro

Figure 1 for Neural ODEs for Image Segmentation with Level Sets
Figure 2 for Neural ODEs for Image Segmentation with Level Sets
Figure 3 for Neural ODEs for Image Segmentation with Level Sets
Figure 4 for Neural ODEs for Image Segmentation with Level Sets
Viaarxiv icon

Zero-shot Text Classification With Generative Language Models

Add code
Bookmark button
Alert button
Dec 10, 2019
Raul Puri, Bryan Catanzaro

Figure 1 for Zero-shot Text Classification With Generative Language Models
Figure 2 for Zero-shot Text Classification With Generative Language Models
Figure 3 for Zero-shot Text Classification With Generative Language Models
Figure 4 for Zero-shot Text Classification With Generative Language Models
Viaarxiv icon

Few-shot Video-to-Video Synthesis

Add code
Bookmark button
Alert button
Oct 28, 2019
Ting-Chun Wang, Ming-Yu Liu, Andrew Tao, Guilin Liu, Jan Kautz, Bryan Catanzaro

Figure 1 for Few-shot Video-to-Video Synthesis
Figure 2 for Few-shot Video-to-Video Synthesis
Figure 3 for Few-shot Video-to-Video Synthesis
Figure 4 for Few-shot Video-to-Video Synthesis
Viaarxiv icon

Mellotron: Multispeaker expressive voice synthesis by conditioning on rhythm, pitch and global style tokens

Add code
Bookmark button
Alert button
Oct 26, 2019
Rafael Valle, Jason Li, Ryan Prenger, Bryan Catanzaro

Figure 1 for Mellotron: Multispeaker expressive voice synthesis by conditioning on rhythm, pitch and global style tokens
Figure 2 for Mellotron: Multispeaker expressive voice synthesis by conditioning on rhythm, pitch and global style tokens
Figure 3 for Mellotron: Multispeaker expressive voice synthesis by conditioning on rhythm, pitch and global style tokens
Viaarxiv icon