Alert button
Picture for David Rybach

David Rybach

Alert button

Large-scale Language Model Rescoring on Long-form Data

Add code
Bookmark button
Alert button
Jun 13, 2023
Tongzhou Chen, Cyril Allauzen, Yinghui Huang, Daniel Park, David Rybach, W. Ronny Huang, Rodrigo Cabrera, Kartik Audhkhasi, Bhuvana Ramabhadran, Pedro J. Moreno, Michael Riley

Figure 1 for Large-scale Language Model Rescoring on Long-form Data
Figure 2 for Large-scale Language Model Rescoring on Long-form Data
Figure 3 for Large-scale Language Model Rescoring on Long-form Data
Figure 4 for Large-scale Language Model Rescoring on Long-form Data
Viaarxiv icon

Alignment Entropy Regularization

Add code
Bookmark button
Alert button
Dec 22, 2022
Ehsan Variani, Ke Wu, David Rybach, Cyril Allauzen, Michael Riley

Figure 1 for Alignment Entropy Regularization
Figure 2 for Alignment Entropy Regularization
Figure 3 for Alignment Entropy Regularization
Figure 4 for Alignment Entropy Regularization
Viaarxiv icon

E2E Segmentation in a Two-Pass Cascaded Encoder ASR Model

Add code
Bookmark button
Alert button
Nov 28, 2022
W. Ronny Huang, Shuo-Yiin Chang, Tara N. Sainath, Yanzhang He, David Rybach, Robert David, Rohit Prabhavalkar, Cyril Allauzen, Cal Peyser, Trevor D. Strohman

Figure 1 for E2E Segmentation in a Two-Pass Cascaded Encoder ASR Model
Figure 2 for E2E Segmentation in a Two-Pass Cascaded Encoder ASR Model
Figure 3 for E2E Segmentation in a Two-Pass Cascaded Encoder ASR Model
Figure 4 for E2E Segmentation in a Two-Pass Cascaded Encoder ASR Model
Viaarxiv icon

Global Normalization for Streaming Speech Recognition in a Modular Framework

Add code
Bookmark button
Alert button
May 26, 2022
Ehsan Variani, Ke Wu, Michael Riley, David Rybach, Matt Shannon, Cyril Allauzen

Figure 1 for Global Normalization for Streaming Speech Recognition in a Modular Framework
Figure 2 for Global Normalization for Streaming Speech Recognition in a Modular Framework
Figure 3 for Global Normalization for Streaming Speech Recognition in a Modular Framework
Figure 4 for Global Normalization for Streaming Speech Recognition in a Modular Framework
Viaarxiv icon

E2E Segmenter: Joint Segmenting and Decoding for Long-Form ASR

Add code
Bookmark button
Alert button
Apr 22, 2022
W. Ronny Huang, Shuo-yiin Chang, David Rybach, Rohit Prabhavalkar, Tara N. Sainath, Cyril Allauzen, Cal Peyser, Zhiyun Lu

Figure 1 for E2E Segmenter: Joint Segmenting and Decoding for Long-Form ASR
Figure 2 for E2E Segmenter: Joint Segmenting and Decoding for Long-Form ASR
Figure 3 for E2E Segmenter: Joint Segmenting and Decoding for Long-Form ASR
Figure 4 for E2E Segmenter: Joint Segmenting and Decoding for Long-Form ASR
Viaarxiv icon

Improving Rare Word Recognition with LM-aware MWER Training

Add code
Bookmark button
Alert button
Apr 15, 2022
Weiran Wang, Tongzhou Chen, Tara N. Sainath, Ehsan Variani, Rohit Prabhavalkar, Ronny Huang, Bhuvana Ramabhadran, Neeraj Gaur, Sepand Mavandadi, Cal Peyser, Trevor Strohman, Yanzhang He, David Rybach

Figure 1 for Improving Rare Word Recognition with LM-aware MWER Training
Figure 2 for Improving Rare Word Recognition with LM-aware MWER Training
Figure 3 for Improving Rare Word Recognition with LM-aware MWER Training
Figure 4 for Improving Rare Word Recognition with LM-aware MWER Training
Viaarxiv icon

Handling Compounding in Mobile Keyboard Input

Add code
Bookmark button
Alert button
Jan 17, 2022
Andreas Kabel, Keith Hall, Tom Ouyang, David Rybach, Daan van Esch, Françoise Beaufays

Viaarxiv icon

Lookup-Table Recurrent Language Models for Long Tail Speech Recognition

Add code
Bookmark button
Alert button
Apr 09, 2021
W. Ronny Huang, Tara N. Sainath, Cal Peyser, Shankar Kumar, David Rybach, Trevor Strohman

Figure 1 for Lookup-Table Recurrent Language Models for Long Tail Speech Recognition
Figure 2 for Lookup-Table Recurrent Language Models for Long Tail Speech Recognition
Figure 3 for Lookup-Table Recurrent Language Models for Long Tail Speech Recognition
Figure 4 for Lookup-Table Recurrent Language Models for Long Tail Speech Recognition
Viaarxiv icon

Less Is More: Improved RNN-T Decoding Using Limited Label Context and Path Merging

Add code
Bookmark button
Alert button
Dec 12, 2020
Rohit Prabhavalkar, Yanzhang He, David Rybach, Sean Campbell, Arun Narayanan, Trevor Strohman, Tara N. Sainath

Figure 1 for Less Is More: Improved RNN-T Decoding Using Limited Label Context and Path Merging
Figure 2 for Less Is More: Improved RNN-T Decoding Using Limited Label Context and Path Merging
Figure 3 for Less Is More: Improved RNN-T Decoding Using Limited Label Context and Path Merging
Figure 4 for Less Is More: Improved RNN-T Decoding Using Limited Label Context and Path Merging
Viaarxiv icon

A Streaming On-Device End-to-End Model Surpassing Server-Side Conventional Model Quality and Latency

Add code
Bookmark button
Alert button
Mar 28, 2020
Tara N. Sainath, Yanzhang He, Bo Li, Arun Narayanan, Ruoming Pang, Antoine Bruguier, Shuo-yiin Chang, Wei Li, Raziel Alvarez, Zhifeng Chen, Chung-Cheng Chiu, David Garcia, Alex Gruenstein, Ke Hu, Minho Jin, Anjuli Kannan, Qiao Liang, Ian McGraw, Cal Peyser, Rohit Prabhavalkar, Golan Pundak, David Rybach, Yuan Shangguan, Yash Sheth, Trevor Strohman, Mirko Visontai, Yonghui Wu, Yu Zhang, Ding Zhao

Figure 1 for A Streaming On-Device End-to-End Model Surpassing Server-Side Conventional Model Quality and Latency
Figure 2 for A Streaming On-Device End-to-End Model Surpassing Server-Side Conventional Model Quality and Latency
Figure 3 for A Streaming On-Device End-to-End Model Surpassing Server-Side Conventional Model Quality and Latency
Figure 4 for A Streaming On-Device End-to-End Model Surpassing Server-Side Conventional Model Quality and Latency
Viaarxiv icon