Alert button
Picture for Trevor Strohman

Trevor Strohman

Alert button

Improving Rare Word Recognition with LM-aware MWER Training

Apr 15, 2022
Weiran Wang, Tongzhou Chen, Tara N. Sainath, Ehsan Variani, Rohit Prabhavalkar, Ronny Huang, Bhuvana Ramabhadran, Neeraj Gaur, Sepand Mavandadi, Cal Peyser, Trevor Strohman, Yanzhang He, David Rybach

Figure 1 for Improving Rare Word Recognition with LM-aware MWER Training
Figure 2 for Improving Rare Word Recognition with LM-aware MWER Training
Figure 3 for Improving Rare Word Recognition with LM-aware MWER Training
Figure 4 for Improving Rare Word Recognition with LM-aware MWER Training
Viaarxiv icon

A Unified Cascaded Encoder ASR Model for Dynamic Model Sizes

Apr 13, 2022
Shaojin Ding, Weiran Wang, Ding Zhao, Tara N. Sainath, Yanzhang He, Robert David, Rami Botros, Xin Wang, Rina Panigrahy, Qiao Liang, Dongseong Hwang, Ian McGraw, Rohit Prabhavalkar, Trevor Strohman

Figure 1 for A Unified Cascaded Encoder ASR Model for Dynamic Model Sizes
Figure 2 for A Unified Cascaded Encoder ASR Model for Dynamic Model Sizes
Figure 3 for A Unified Cascaded Encoder ASR Model for Dynamic Model Sizes
Figure 4 for A Unified Cascaded Encoder ASR Model for Dynamic Model Sizes
Viaarxiv icon

Pseudo Label Is Better Than Human Label

Mar 28, 2022
Dongseong Hwang, Khe Chai Sim, Zhouyuan Huo, Trevor Strohman

Figure 1 for Pseudo Label Is Better Than Human Label
Figure 2 for Pseudo Label Is Better Than Human Label
Figure 3 for Pseudo Label Is Better Than Human Label
Figure 4 for Pseudo Label Is Better Than Human Label
Viaarxiv icon

Sentence-Select: Large-Scale Language Model Data Selection for Rare-Word Speech Recognition

Mar 09, 2022
W. Ronny Huang, Cal Peyser, Tara N. Sainath, Ruoming Pang, Trevor Strohman, Shankar Kumar

Figure 1 for Sentence-Select: Large-Scale Language Model Data Selection for Rare-Word Speech Recognition
Figure 2 for Sentence-Select: Large-Scale Language Model Data Selection for Rare-Word Speech Recognition
Figure 3 for Sentence-Select: Large-Scale Language Model Data Selection for Rare-Word Speech Recognition
Figure 4 for Sentence-Select: Large-Scale Language Model Data Selection for Rare-Word Speech Recognition
Viaarxiv icon

Large-scale ASR Domain Adaptation using Self- and Semi-supervised Learning

Oct 13, 2021
Dongseong Hwang, Ananya Misra, Zhouyuan Huo, Nikhil Siddhartha, Shefali Garg, David Qiu, Khe Chai Sim, Trevor Strohman, Françoise Beaufays, Yanzhang He

Figure 1 for Large-scale ASR Domain Adaptation using Self- and Semi-supervised Learning
Figure 2 for Large-scale ASR Domain Adaptation using Self- and Semi-supervised Learning
Figure 3 for Large-scale ASR Domain Adaptation using Self- and Semi-supervised Learning
Figure 4 for Large-scale ASR Domain Adaptation using Self- and Semi-supervised Learning
Viaarxiv icon

Input Length Matters: An Empirical Study Of RNN-T And MWER Training For Long-form Telephony Speech Recognition

Oct 08, 2021
Zhiyun Lu, Yanwei Pan, Thibault Doutre, Liangliang Cao, Rohit Prabhavalkar, Chao Zhang, Trevor Strohman

Figure 1 for Input Length Matters: An Empirical Study Of RNN-T And MWER Training For Long-form Telephony Speech Recognition
Figure 2 for Input Length Matters: An Empirical Study Of RNN-T And MWER Training For Long-form Telephony Speech Recognition
Figure 3 for Input Length Matters: An Empirical Study Of RNN-T And MWER Training For Long-form Telephony Speech Recognition
Figure 4 for Input Length Matters: An Empirical Study Of RNN-T And MWER Training For Long-form Telephony Speech Recognition
Viaarxiv icon

Fast Contextual Adaptation with Neural Associative Memory for On-Device Personalized Speech Recognition

Oct 07, 2021
Tsendsuren Munkhdalai, Khe Chai Sim, Angad Chandorkar, Fan Gao, Mason Chua, Trevor Strohman, Françoise Beaufays

Figure 1 for Fast Contextual Adaptation with Neural Associative Memory for On-Device Personalized Speech Recognition
Figure 2 for Fast Contextual Adaptation with Neural Associative Memory for On-Device Personalized Speech Recognition
Figure 3 for Fast Contextual Adaptation with Neural Associative Memory for On-Device Personalized Speech Recognition
Figure 4 for Fast Contextual Adaptation with Neural Associative Memory for On-Device Personalized Speech Recognition
Viaarxiv icon

Large-scale ASR Domain Adaptation by Self- and Semi-supervised Learning

Oct 01, 2021
Dongseong Hwang, Ananya Misra, Zhouyuan Huo, Nikhil Siddhartha, Shefali Garg, David Qiu, Khe Chai Sim, Trevor Strohman, Françoise Beaufays, Yanzhang He

Figure 1 for Large-scale ASR Domain Adaptation by Self- and Semi-supervised Learning
Figure 2 for Large-scale ASR Domain Adaptation by Self- and Semi-supervised Learning
Figure 3 for Large-scale ASR Domain Adaptation by Self- and Semi-supervised Learning
Figure 4 for Large-scale ASR Domain Adaptation by Self- and Semi-supervised Learning
Viaarxiv icon

Incremental Layer-wise Self-Supervised Learning for Efficient Speech Domain Adaptation On Device

Oct 01, 2021
Zhouyuan Huo, Dongseong Hwang, Khe Chai Sim, Shefali Garg, Ananya Misra, Nikhil Siddhartha, Trevor Strohman, Françoise Beaufays

Figure 1 for Incremental Layer-wise Self-Supervised Learning for Efficient Speech Domain Adaptation On Device
Figure 2 for Incremental Layer-wise Self-Supervised Learning for Efficient Speech Domain Adaptation On Device
Figure 3 for Incremental Layer-wise Self-Supervised Learning for Efficient Speech Domain Adaptation On Device
Figure 4 for Incremental Layer-wise Self-Supervised Learning for Efficient Speech Domain Adaptation On Device
Viaarxiv icon

Lookup-Table Recurrent Language Models for Long Tail Speech Recognition

Apr 09, 2021
W. Ronny Huang, Tara N. Sainath, Cal Peyser, Shankar Kumar, David Rybach, Trevor Strohman

Figure 1 for Lookup-Table Recurrent Language Models for Long Tail Speech Recognition
Figure 2 for Lookup-Table Recurrent Language Models for Long Tail Speech Recognition
Figure 3 for Lookup-Table Recurrent Language Models for Long Tail Speech Recognition
Figure 4 for Lookup-Table Recurrent Language Models for Long Tail Speech Recognition
Viaarxiv icon