Alert button
Picture for Ruoming Pang

Ruoming Pang

Alert button

A comparison of end-to-end models for long-form speech recognition

Add code
Bookmark button
Alert button
Nov 06, 2019
Chung-Cheng Chiu, Wei Han, Yu Zhang, Ruoming Pang, Sergey Kishchenko, Patrick Nguyen, Arun Narayanan, Hank Liao, Shuyuan Zhang, Anjuli Kannan, Rohit Prabhavalkar, Zhifeng Chen, Tara Sainath, Yonghui Wu

Figure 1 for A comparison of end-to-end models for long-form speech recognition
Figure 2 for A comparison of end-to-end models for long-form speech recognition
Figure 3 for A comparison of end-to-end models for long-form speech recognition
Viaarxiv icon

Two-Pass End-to-End Speech Recognition

Add code
Bookmark button
Alert button
Aug 29, 2019
Tara N. Sainath, Ruoming Pang, David Rybach, Yanzhang He, Rohit Prabhavalkar, Wei Li, Mirkó Visontai, Qiao Liang, Trevor Strohman, Yonghui Wu, Ian McGraw, Chung-Cheng Chiu

Figure 1 for Two-Pass End-to-End Speech Recognition
Figure 2 for Two-Pass End-to-End Speech Recognition
Figure 3 for Two-Pass End-to-End Speech Recognition
Figure 4 for Two-Pass End-to-End Speech Recognition
Viaarxiv icon

Monotonic Infinite Lookback Attention for Simultaneous Machine Translation

Add code
Bookmark button
Alert button
Jun 12, 2019
Naveen Arivazhagan, Colin Cherry, Wolfgang Macherey, Chung-Cheng Chiu, Semih Yavuz, Ruoming Pang, Wei Li, Colin Raffel

Figure 1 for Monotonic Infinite Lookback Attention for Simultaneous Machine Translation
Figure 2 for Monotonic Infinite Lookback Attention for Simultaneous Machine Translation
Viaarxiv icon

Searching for MobileNetV3

Add code
Bookmark button
Alert button
May 14, 2019
Andrew Howard, Mark Sandler, Grace Chu, Liang-Chieh Chen, Bo Chen, Mingxing Tan, Weijun Wang, Yukun Zhu, Ruoming Pang, Vijay Vasudevan, Quoc V. Le, Hartwig Adam

Figure 1 for Searching for MobileNetV3
Figure 2 for Searching for MobileNetV3
Figure 3 for Searching for MobileNetV3
Figure 4 for Searching for MobileNetV3
Viaarxiv icon

NAS-FPN: Learning Scalable Feature Pyramid Architecture for Object Detection

Add code
Bookmark button
Alert button
Apr 16, 2019
Golnaz Ghiasi, Tsung-Yi Lin, Ruoming Pang, Quoc V. Le

Figure 1 for NAS-FPN: Learning Scalable Feature Pyramid Architecture for Object Detection
Figure 2 for NAS-FPN: Learning Scalable Feature Pyramid Architecture for Object Detection
Figure 3 for NAS-FPN: Learning Scalable Feature Pyramid Architecture for Object Detection
Figure 4 for NAS-FPN: Learning Scalable Feature Pyramid Architecture for Object Detection
Viaarxiv icon

Lingvo: a Modular and Scalable Framework for Sequence-to-Sequence Modeling

Add code
Bookmark button
Alert button
Feb 21, 2019
Jonathan Shen, Patrick Nguyen, Yonghui Wu, Zhifeng Chen, Mia X. Chen, Ye Jia, Anjuli Kannan, Tara Sainath, Yuan Cao, Chung-Cheng Chiu, Yanzhang He, Jan Chorowski, Smit Hinsu, Stella Laurenzo, James Qin, Orhan Firat, Wolfgang Macherey, Suyog Gupta, Ankur Bapna, Shuyuan Zhang, Ruoming Pang, Ron J. Weiss, Rohit Prabhavalkar, Qiao Liang, Benoit Jacob, Bowen Liang, HyoukJoong Lee, Ciprian Chelba, Sébastien Jean, Bo Li, Melvin Johnson, Rohan Anil, Rajat Tibrewal, Xiaobing Liu, Akiko Eriguchi, Navdeep Jaitly, Naveen Ari, Colin Cherry, Parisa Haghani, Otavio Good, Youlong Cheng, Raziel Alvarez, Isaac Caswell, Wei-Ning Hsu, Zongheng Yang, Kuan-Chieh Wang, Ekaterina Gonina, Katrin Tomanek, Ben Vanik, Zelin Wu, Llion Jones, Mike Schuster, Yanping Huang, Dehao Chen, Kazuki Irie, George Foster, John Richardson, Klaus Macherey, Antoine Bruguier, Heiga Zen, Colin Raffel, Shankar Kumar, Kanishka Rao, David Rybach, Matthew Murray, Vijayaditya Peddinti, Maxim Krikun, Michiel A. U. Bacchiani, Thomas B. Jablin, Rob Suderman, Ian Williams, Benjamin Lee, Deepti Bhatia, Justin Carlson, Semih Yavuz, Yu Zhang, Ian McGraw, Max Galkin, Qi Ge, Golan Pundak, Chad Whipkey, Todd Wang, Uri Alon, Dmitry Lepikhin, Ye Tian, Sara Sabour, William Chan, Shubham Toshniwal, Baohua Liao, Michael Nirschl, Pat Rondon

Figure 1 for Lingvo: a Modular and Scalable Framework for Sequence-to-Sequence Modeling
Figure 2 for Lingvo: a Modular and Scalable Framework for Sequence-to-Sequence Modeling
Figure 3 for Lingvo: a Modular and Scalable Framework for Sequence-to-Sequence Modeling
Viaarxiv icon

Domain Adaptive Transfer Learning with Specialist Models

Add code
Bookmark button
Alert button
Dec 11, 2018
Jiquan Ngiam, Daiyi Peng, Vijay Vasudevan, Simon Kornblith, Quoc V. Le, Ruoming Pang

Figure 1 for Domain Adaptive Transfer Learning with Specialist Models
Figure 2 for Domain Adaptive Transfer Learning with Specialist Models
Figure 3 for Domain Adaptive Transfer Learning with Specialist Models
Figure 4 for Domain Adaptive Transfer Learning with Specialist Models
Viaarxiv icon

Streaming End-to-end Speech Recognition For Mobile Devices

Add code
Bookmark button
Alert button
Nov 15, 2018
Yanzhang He, Tara N. Sainath, Rohit Prabhavalkar, Ian McGraw, Raziel Alvarez, Ding Zhao, David Rybach, Anjuli Kannan, Yonghui Wu, Ruoming Pang, Qiao Liang, Deepti Bhatia, Yuan Shangguan, Bo Li, Golan Pundak, Khe Chai Sim, Tom Bagby, Shuo-yiin Chang, Kanishka Rao, Alexander Gruenstein

Figure 1 for Streaming End-to-end Speech Recognition For Mobile Devices
Figure 2 for Streaming End-to-end Speech Recognition For Mobile Devices
Figure 3 for Streaming End-to-end Speech Recognition For Mobile Devices
Figure 4 for Streaming End-to-end Speech Recognition For Mobile Devices
Viaarxiv icon

Transfer Learning from Speaker Verification to Multispeaker Text-To-Speech Synthesis

Add code
Bookmark button
Alert button
Nov 05, 2018
Ye Jia, Yu Zhang, Ron J. Weiss, Quan Wang, Jonathan Shen, Fei Ren, Zhifeng Chen, Patrick Nguyen, Ruoming Pang, Ignacio Lopez Moreno, Yonghui Wu

Figure 1 for Transfer Learning from Speaker Verification to Multispeaker Text-To-Speech Synthesis
Figure 2 for Transfer Learning from Speaker Verification to Multispeaker Text-To-Speech Synthesis
Figure 3 for Transfer Learning from Speaker Verification to Multispeaker Text-To-Speech Synthesis
Figure 4 for Transfer Learning from Speaker Verification to Multispeaker Text-To-Speech Synthesis
Viaarxiv icon

Hierarchical Generative Modeling for Controllable Speech Synthesis

Add code
Bookmark button
Alert button
Oct 16, 2018
Wei-Ning Hsu, Yu Zhang, Ron J. Weiss, Heiga Zen, Yonghui Wu, Yuxuan Wang, Yuan Cao, Ye Jia, Zhifeng Chen, Jonathan Shen, Patrick Nguyen, Ruoming Pang

Figure 1 for Hierarchical Generative Modeling for Controllable Speech Synthesis
Figure 2 for Hierarchical Generative Modeling for Controllable Speech Synthesis
Figure 3 for Hierarchical Generative Modeling for Controllable Speech Synthesis
Figure 4 for Hierarchical Generative Modeling for Controllable Speech Synthesis
Viaarxiv icon