Alert button
Picture for W. Ronny Huang

W. Ronny Huang

Alert button

Multilingual and Fully Non-Autoregressive ASR with Large Language Model Fusion: A Comprehensive Study

Add code
Bookmark button
Alert button
Jan 23, 2024
W. Ronny Huang, Cyril Allauzen, Tongzhou Chen, Kilol Gupta, Ke Hu, James Qin, Yu Zhang, Yongqiang Wang, Shuo-Yiin Chang, Tara N. Sainath

Viaarxiv icon

Large-scale Language Model Rescoring on Long-form Data

Add code
Bookmark button
Alert button
Jun 13, 2023
Tongzhou Chen, Cyril Allauzen, Yinghui Huang, Daniel Park, David Rybach, W. Ronny Huang, Rodrigo Cabrera, Kartik Audhkhasi, Bhuvana Ramabhadran, Pedro J. Moreno, Michael Riley

Figure 1 for Large-scale Language Model Rescoring on Long-form Data
Figure 2 for Large-scale Language Model Rescoring on Long-form Data
Figure 3 for Large-scale Language Model Rescoring on Long-form Data
Figure 4 for Large-scale Language Model Rescoring on Long-form Data
Viaarxiv icon

Semantic Segmentation with Bidirectional Language Models Improves Long-form ASR

Add code
Bookmark button
Alert button
May 28, 2023
W. Ronny Huang, Hao Zhang, Shankar Kumar, Shuo-yiin Chang, Tara N. Sainath

Figure 1 for Semantic Segmentation with Bidirectional Language Models Improves Long-form ASR
Figure 2 for Semantic Segmentation with Bidirectional Language Models Improves Long-form ASR
Figure 3 for Semantic Segmentation with Bidirectional Language Models Improves Long-form ASR
Figure 4 for Semantic Segmentation with Bidirectional Language Models Improves Long-form ASR
Viaarxiv icon

E2E Segmentation in a Two-Pass Cascaded Encoder ASR Model

Add code
Bookmark button
Alert button
Nov 28, 2022
W. Ronny Huang, Shuo-Yiin Chang, Tara N. Sainath, Yanzhang He, David Rybach, Robert David, Rohit Prabhavalkar, Cyril Allauzen, Cal Peyser, Trevor D. Strohman

Figure 1 for E2E Segmentation in a Two-Pass Cascaded Encoder ASR Model
Figure 2 for E2E Segmentation in a Two-Pass Cascaded Encoder ASR Model
Figure 3 for E2E Segmentation in a Two-Pass Cascaded Encoder ASR Model
Figure 4 for E2E Segmentation in a Two-Pass Cascaded Encoder ASR Model
Viaarxiv icon

Modular Hybrid Autoregressive Transducer

Add code
Bookmark button
Alert button
Oct 31, 2022
Zhong Meng, Tongzhou Chen, Rohit Prabhavalkar, Yu Zhang, Gary Wang, Kartik Audhkhasi, Jesse Emond, Trevor Strohman, Bhuvana Ramabhadran, W. Ronny Huang, Ehsan Variani, Yinghui Huang, Pedro J. Moreno

Figure 1 for Modular Hybrid Autoregressive Transducer
Figure 2 for Modular Hybrid Autoregressive Transducer
Figure 3 for Modular Hybrid Autoregressive Transducer
Figure 4 for Modular Hybrid Autoregressive Transducer
Viaarxiv icon

E2E Segmenter: Joint Segmenting and Decoding for Long-Form ASR

Add code
Bookmark button
Alert button
Apr 22, 2022
W. Ronny Huang, Shuo-yiin Chang, David Rybach, Rohit Prabhavalkar, Tara N. Sainath, Cyril Allauzen, Cal Peyser, Zhiyun Lu

Figure 1 for E2E Segmenter: Joint Segmenting and Decoding for Long-Form ASR
Figure 2 for E2E Segmenter: Joint Segmenting and Decoding for Long-Form ASR
Figure 3 for E2E Segmenter: Joint Segmenting and Decoding for Long-Form ASR
Figure 4 for E2E Segmenter: Joint Segmenting and Decoding for Long-Form ASR
Viaarxiv icon

Detecting Unintended Memorization in Language-Model-Fused ASR

Add code
Bookmark button
Alert button
Apr 20, 2022
W. Ronny Huang, Steve Chien, Om Thakkar, Rajiv Mathews

Figure 1 for Detecting Unintended Memorization in Language-Model-Fused ASR
Figure 2 for Detecting Unintended Memorization in Language-Model-Fused ASR
Figure 3 for Detecting Unintended Memorization in Language-Model-Fused ASR
Figure 4 for Detecting Unintended Memorization in Language-Model-Fused ASR
Viaarxiv icon

Sentence-Select: Large-Scale Language Model Data Selection for Rare-Word Speech Recognition

Add code
Bookmark button
Alert button
Mar 09, 2022
W. Ronny Huang, Cal Peyser, Tara N. Sainath, Ruoming Pang, Trevor Strohman, Shankar Kumar

Figure 1 for Sentence-Select: Large-Scale Language Model Data Selection for Rare-Word Speech Recognition
Figure 2 for Sentence-Select: Large-Scale Language Model Data Selection for Rare-Word Speech Recognition
Figure 3 for Sentence-Select: Large-Scale Language Model Data Selection for Rare-Word Speech Recognition
Figure 4 for Sentence-Select: Large-Scale Language Model Data Selection for Rare-Word Speech Recognition
Viaarxiv icon

Capitalization Normalization for Language Modeling with an Accurate and Efficient Hierarchical RNN Model

Add code
Bookmark button
Alert button
Feb 16, 2022
Hao Zhang, You-Chi Cheng, Shankar Kumar, W. Ronny Huang, Mingqing Chen, Rajiv Mathews

Figure 1 for Capitalization Normalization for Language Modeling with an Accurate and Efficient Hierarchical RNN Model
Figure 2 for Capitalization Normalization for Language Modeling with an Accurate and Efficient Hierarchical RNN Model
Figure 3 for Capitalization Normalization for Language Modeling with an Accurate and Efficient Hierarchical RNN Model
Figure 4 for Capitalization Normalization for Language Modeling with an Accurate and Efficient Hierarchical RNN Model
Viaarxiv icon

Scaling End-to-End Models for Large-Scale Multilingual ASR

Add code
Bookmark button
Alert button
Apr 30, 2021
Bo Li, Ruoming Pang, Tara N. Sainath, Anmol Gulati, Yu Zhang, James Qin, Parisa Haghani, W. Ronny Huang, Min Ma

Figure 1 for Scaling End-to-End Models for Large-Scale Multilingual ASR
Figure 2 for Scaling End-to-End Models for Large-Scale Multilingual ASR
Figure 3 for Scaling End-to-End Models for Large-Scale Multilingual ASR
Viaarxiv icon