Alert button
Picture for Yonghui Wu

Yonghui Wu

Alert button

Google USM: Scaling Automatic Speech Recognition Beyond 100 Languages

Add code
Bookmark button
Alert button
Mar 03, 2023
Yu Zhang, Wei Han, James Qin, Yongqiang Wang, Ankur Bapna, Zhehuai Chen, Nanxin Chen, Bo Li, Vera Axelrod, Gary Wang, Zhong Meng, Ke Hu, Andrew Rosenberg, Rohit Prabhavalkar, Daniel S. Park, Parisa Haghani, Jason Riesa, Ginger Perng, Hagen Soltau, Trevor Strohman, Bhuvana Ramabhadran, Tara Sainath, Pedro Moreno, Chung-Cheng Chiu, Johan Schalkwyk, Françoise Beaufays, Yonghui Wu

Figure 1 for Google USM: Scaling Automatic Speech Recognition Beyond 100 Languages
Figure 2 for Google USM: Scaling Automatic Speech Recognition Beyond 100 Languages
Figure 3 for Google USM: Scaling Automatic Speech Recognition Beyond 100 Languages
Figure 4 for Google USM: Scaling Automatic Speech Recognition Beyond 100 Languages
Viaarxiv icon

AnyTOD: A Programmable Task-Oriented Dialog System

Add code
Bookmark button
Alert button
Dec 20, 2022
Jeffrey Zhao, Yuan Cao, Raghav Gupta, Harrison Lee, Abhinav Rastogi, Mingqiu Wang, Hagen Soltau, Izhak Shafran, Yonghui Wu

Figure 1 for AnyTOD: A Programmable Task-Oriented Dialog System
Figure 2 for AnyTOD: A Programmable Task-Oriented Dialog System
Figure 3 for AnyTOD: A Programmable Task-Oriented Dialog System
Figure 4 for AnyTOD: A Programmable Task-Oriented Dialog System
Viaarxiv icon

Video-Text Modeling with Zero-Shot Transfer from Contrastive Captioners

Add code
Bookmark button
Alert button
Dec 09, 2022
Shen Yan, Tao Zhu, Zirui Wang, Yuan Cao, Mi Zhang, Soham Ghosh, Yonghui Wu, Jiahui Yu

Figure 1 for Video-Text Modeling with Zero-Shot Transfer from Contrastive Captioners
Figure 2 for Video-Text Modeling with Zero-Shot Transfer from Contrastive Captioners
Figure 3 for Video-Text Modeling with Zero-Shot Transfer from Contrastive Captioners
Figure 4 for Video-Text Modeling with Zero-Shot Transfer from Contrastive Captioners
Viaarxiv icon

SODA: A Natural Language Processing Package to Extract Social Determinants of Health for Cancer Studies

Add code
Bookmark button
Alert button
Dec 06, 2022
Zehao Yu, Xi Yang, Chong Dang, Prakash Adekkanattu, Braja Gopal Patra, Yifan Peng, Jyotishman Pathak, Debbie L. Wilson, Ching-Yuan Chang, Wei-Hsuan Lo-Ciganic, Thomas J. George, William R. Hogan, Yi Guo, Jiang Bian, Yonghui Wu

Figure 1 for SODA: A Natural Language Processing Package to Extract Social Determinants of Health for Cancer Studies
Figure 2 for SODA: A Natural Language Processing Package to Extract Social Determinants of Health for Cancer Studies
Figure 3 for SODA: A Natural Language Processing Package to Extract Social Determinants of Health for Cancer Studies
Figure 4 for SODA: A Natural Language Processing Package to Extract Social Determinants of Health for Cancer Studies
Viaarxiv icon

Training Text-To-Speech Systems From Synthetic Data: A Practical Approach For Accent Transfer Tasks

Add code
Bookmark button
Alert button
Aug 28, 2022
Lev Finkelstein, Heiga Zen, Norman Casagrande, Chun-an Chan, Ye Jia, Tom Kenter, Alexey Petelin, Jonathan Shen, Vincent Wan, Yu Zhang, Yonghui Wu, Rob Clark

Figure 1 for Training Text-To-Speech Systems From Synthetic Data: A Practical Approach For Accent Transfer Tasks
Figure 2 for Training Text-To-Speech Systems From Synthetic Data: A Practical Approach For Accent Transfer Tasks
Figure 3 for Training Text-To-Speech Systems From Synthetic Data: A Practical Approach For Accent Transfer Tasks
Figure 4 for Training Text-To-Speech Systems From Synthetic Data: A Practical Approach For Accent Transfer Tasks
Viaarxiv icon

N-Grammer: Augmenting Transformers with latent n-grams

Add code
Bookmark button
Alert button
Jul 13, 2022
Aurko Roy, Rohan Anil, Guangda Lai, Benjamin Lee, Jeffrey Zhao, Shuyuan Zhang, Shibo Wang, Ye Zhang, Shen Wu, Rigel Swavely, Tao, Yu, Phuong Dao, Christopher Fifty, Zhifeng Chen, Yonghui Wu

Figure 1 for N-Grammer: Augmenting Transformers with latent n-grams
Figure 2 for N-Grammer: Augmenting Transformers with latent n-grams
Figure 3 for N-Grammer: Augmenting Transformers with latent n-grams
Figure 4 for N-Grammer: Augmenting Transformers with latent n-grams
Viaarxiv icon

Scaling Autoregressive Models for Content-Rich Text-to-Image Generation

Add code
Bookmark button
Alert button
Jun 22, 2022
Jiahui Yu, Yuanzhong Xu, Jing Yu Koh, Thang Luong, Gunjan Baid, Zirui Wang, Vijay Vasudevan, Alexander Ku, Yinfei Yang, Burcu Karagol Ayan, Ben Hutchinson, Wei Han, Zarana Parekh, Xin Li, Han Zhang, Jason Baldridge, Yonghui Wu

Figure 1 for Scaling Autoregressive Models for Content-Rich Text-to-Image Generation
Figure 2 for Scaling Autoregressive Models for Content-Rich Text-to-Image Generation
Figure 3 for Scaling Autoregressive Models for Content-Rich Text-to-Image Generation
Figure 4 for Scaling Autoregressive Models for Content-Rich Text-to-Image Generation
Viaarxiv icon

Building Machine Translation Systems for the Next Thousand Languages

Add code
Bookmark button
Alert button
May 16, 2022
Ankur Bapna, Isaac Caswell, Julia Kreutzer, Orhan Firat, Daan van Esch, Aditya Siddhant, Mengmeng Niu, Pallavi Baljekar, Xavier Garcia, Wolfgang Macherey, Theresa Breiner, Vera Axelrod, Jason Riesa, Yuan Cao, Mia Xu Chen, Klaus Macherey, Maxim Krikun, Pidong Wang, Alexander Gutkin, Apurva Shah, Yanping Huang, Zhifeng Chen, Yonghui Wu, Macduff Hughes

Figure 1 for Building Machine Translation Systems for the Next Thousand Languages
Figure 2 for Building Machine Translation Systems for the Next Thousand Languages
Figure 3 for Building Machine Translation Systems for the Next Thousand Languages
Figure 4 for Building Machine Translation Systems for the Next Thousand Languages
Viaarxiv icon

CoCa: Contrastive Captioners are Image-Text Foundation Models

Add code
Bookmark button
Alert button
May 04, 2022
Jiahui Yu, Zirui Wang, Vijay Vasudevan, Legg Yeung, Mojtaba Seyedhosseini, Yonghui Wu

Figure 1 for CoCa: Contrastive Captioners are Image-Text Foundation Models
Figure 2 for CoCa: Contrastive Captioners are Image-Text Foundation Models
Figure 3 for CoCa: Contrastive Captioners are Image-Text Foundation Models
Figure 4 for CoCa: Contrastive Captioners are Image-Text Foundation Models
Viaarxiv icon