Alert button
Picture for Ruoming Pang

Ruoming Pang

Alert button

Cascaded encoders for unifying streaming and non-streaming ASR

Add code
Bookmark button
Alert button
Oct 27, 2020
Arun Narayanan, Tara N. Sainath, Ruoming Pang, Jiahui Yu, Chung-Cheng Chiu, Rohit Prabhavalkar, Ehsan Variani, Trevor Strohman

Figure 1 for Cascaded encoders for unifying streaming and non-streaming ASR
Figure 2 for Cascaded encoders for unifying streaming and non-streaming ASR
Figure 3 for Cascaded encoders for unifying streaming and non-streaming ASR
Figure 4 for Cascaded encoders for unifying streaming and non-streaming ASR
Viaarxiv icon

Unsupervised Learning of Disentangled Speech Content and Style Representation

Add code
Bookmark button
Alert button
Oct 24, 2020
Andros Tjandra, Ruoming Pang, Yu Zhang, Shigeki Karita

Figure 1 for Unsupervised Learning of Disentangled Speech Content and Style Representation
Figure 2 for Unsupervised Learning of Disentangled Speech Content and Style Representation
Figure 3 for Unsupervised Learning of Disentangled Speech Content and Style Representation
Figure 4 for Unsupervised Learning of Disentangled Speech Content and Style Representation
Viaarxiv icon

Improving Streaming Automatic Speech Recognition With Non-Streaming Model Distillation On Unsupervised Data

Add code
Bookmark button
Alert button
Oct 22, 2020
Thibault Doutre, Wei Han, Min Ma, Zhiyun Lu, Chung-Cheng Chiu, Ruoming Pang, Arun Narayanan, Ananya Misra, Yu Zhang, Liangliang Cao

Figure 1 for Improving Streaming Automatic Speech Recognition With Non-Streaming Model Distillation On Unsupervised Data
Figure 2 for Improving Streaming Automatic Speech Recognition With Non-Streaming Model Distillation On Unsupervised Data
Figure 3 for Improving Streaming Automatic Speech Recognition With Non-Streaming Model Distillation On Unsupervised Data
Figure 4 for Improving Streaming Automatic Speech Recognition With Non-Streaming Model Distillation On Unsupervised Data
Viaarxiv icon

FastEmit: Low-latency Streaming ASR with Sequence-level Emission Regularization

Add code
Bookmark button
Alert button
Oct 21, 2020
Jiahui Yu, Chung-Cheng Chiu, Bo Li, Shuo-yiin Chang, Tara N. Sainath, Yanzhang He, Arun Narayanan, Wei Han, Anmol Gulati, Yonghui Wu, Ruoming Pang

Figure 1 for FastEmit: Low-latency Streaming ASR with Sequence-level Emission Regularization
Figure 2 for FastEmit: Low-latency Streaming ASR with Sequence-level Emission Regularization
Figure 3 for FastEmit: Low-latency Streaming ASR with Sequence-level Emission Regularization
Figure 4 for FastEmit: Low-latency Streaming ASR with Sequence-level Emission Regularization
Viaarxiv icon

Pushing the Limits of Semi-Supervised Learning for Automatic Speech Recognition

Add code
Bookmark button
Alert button
Oct 20, 2020
Yu Zhang, James Qin, Daniel S. Park, Wei Han, Chung-Cheng Chiu, Ruoming Pang, Quoc V. Le, Yonghui Wu

Figure 1 for Pushing the Limits of Semi-Supervised Learning for Automatic Speech Recognition
Figure 2 for Pushing the Limits of Semi-Supervised Learning for Automatic Speech Recognition
Figure 3 for Pushing the Limits of Semi-Supervised Learning for Automatic Speech Recognition
Figure 4 for Pushing the Limits of Semi-Supervised Learning for Automatic Speech Recognition
Viaarxiv icon

Universal ASR: Unify and Improve Streaming ASR with Full-context Modeling

Add code
Bookmark button
Alert button
Oct 12, 2020
Jiahui Yu, Wei Han, Anmol Gulati, Chung-Cheng Chiu, Bo Li, Tara N. Sainath, Yonghui Wu, Ruoming Pang

Figure 1 for Universal ASR: Unify and Improve Streaming ASR with Full-context Modeling
Figure 2 for Universal ASR: Unify and Improve Streaming ASR with Full-context Modeling
Figure 3 for Universal ASR: Unify and Improve Streaming ASR with Full-context Modeling
Figure 4 for Universal ASR: Unify and Improve Streaming ASR with Full-context Modeling
Viaarxiv icon

Parallel Rescoring with Transformer for Streaming On-Device Speech Recognition

Add code
Bookmark button
Alert button
Sep 02, 2020
Wei Li, James Qin, Chung-Cheng Chiu, Ruoming Pang, Yanzhang He

Figure 1 for Parallel Rescoring with Transformer for Streaming On-Device Speech Recognition
Figure 2 for Parallel Rescoring with Transformer for Streaming On-Device Speech Recognition
Figure 3 for Parallel Rescoring with Transformer for Streaming On-Device Speech Recognition
Figure 4 for Parallel Rescoring with Transformer for Streaming On-Device Speech Recognition
Viaarxiv icon

Improving Tail Performance of a Deliberation E2E ASR Model Using a Large Text Corpus

Add code
Bookmark button
Alert button
Aug 25, 2020
Cal Peyser, Sepand Mavandadi, Tara N. Sainath, James Apfel, Ruoming Pang, Shankar Kumar

Figure 1 for Improving Tail Performance of a Deliberation E2E ASR Model Using a Large Text Corpus
Figure 2 for Improving Tail Performance of a Deliberation E2E ASR Model Using a Large Text Corpus
Figure 3 for Improving Tail Performance of a Deliberation E2E ASR Model Using a Large Text Corpus
Figure 4 for Improving Tail Performance of a Deliberation E2E ASR Model Using a Large Text Corpus
Viaarxiv icon

RNN-T Models Fail to Generalize to Out-of-Domain Audio: Causes and Solutions

Add code
Bookmark button
Alert button
May 17, 2020
Chung-Cheng Chiu, Arun Narayanan, Wei Han, Rohit Prabhavalkar, Yu Zhang, Navdeep Jaitly, Ruoming Pang, Tara N. Sainath, Patrick Nguyen, Liangliang Cao, Yonghui Wu

Figure 1 for RNN-T Models Fail to Generalize to Out-of-Domain Audio: Causes and Solutions
Figure 2 for RNN-T Models Fail to Generalize to Out-of-Domain Audio: Causes and Solutions
Figure 3 for RNN-T Models Fail to Generalize to Out-of-Domain Audio: Causes and Solutions
Figure 4 for RNN-T Models Fail to Generalize to Out-of-Domain Audio: Causes and Solutions
Viaarxiv icon