Alert button
Picture for Ehsan Variani

Ehsan Variani

Alert button

LAST: Scalable Lattice-Based Speech Modelling in JAX

Add code
Bookmark button
Alert button
Apr 25, 2023
Ke Wu, Ehsan Variani, Tom Bagby, Michael Riley

Figure 1 for LAST: Scalable Lattice-Based Speech Modelling in JAX
Figure 2 for LAST: Scalable Lattice-Based Speech Modelling in JAX
Figure 3 for LAST: Scalable Lattice-Based Speech Modelling in JAX
Viaarxiv icon

JEIT: Joint End-to-End Model and Internal Language Model Training for Speech Recognition

Add code
Bookmark button
Alert button
Feb 16, 2023
Zhong Meng, Weiran Wang, Rohit Prabhavalkar, Tara N. Sainath, Tongzhou Chen, Ehsan Variani, Yu Zhang, Bo Li, Andrew Rosenberg, Bhuvana Ramabhadran

Figure 1 for JEIT: Joint End-to-End Model and Internal Language Model Training for Speech Recognition
Figure 2 for JEIT: Joint End-to-End Model and Internal Language Model Training for Speech Recognition
Figure 3 for JEIT: Joint End-to-End Model and Internal Language Model Training for Speech Recognition
Figure 4 for JEIT: Joint End-to-End Model and Internal Language Model Training for Speech Recognition
Viaarxiv icon

Alignment Entropy Regularization

Add code
Bookmark button
Alert button
Dec 22, 2022
Ehsan Variani, Ke Wu, David Rybach, Cyril Allauzen, Michael Riley

Figure 1 for Alignment Entropy Regularization
Figure 2 for Alignment Entropy Regularization
Figure 3 for Alignment Entropy Regularization
Figure 4 for Alignment Entropy Regularization
Viaarxiv icon

Modular Hybrid Autoregressive Transducer

Add code
Bookmark button
Alert button
Oct 31, 2022
Zhong Meng, Tongzhou Chen, Rohit Prabhavalkar, Yu Zhang, Gary Wang, Kartik Audhkhasi, Jesse Emond, Trevor Strohman, Bhuvana Ramabhadran, W. Ronny Huang, Ehsan Variani, Yinghui Huang, Pedro J. Moreno

Figure 1 for Modular Hybrid Autoregressive Transducer
Figure 2 for Modular Hybrid Autoregressive Transducer
Figure 3 for Modular Hybrid Autoregressive Transducer
Figure 4 for Modular Hybrid Autoregressive Transducer
Viaarxiv icon

UserLibri: A Dataset for ASR Personalization Using Only Text

Add code
Bookmark button
Alert button
Jul 02, 2022
Theresa Breiner, Swaroop Ramaswamy, Ehsan Variani, Shefali Garg, Rajiv Mathews, Khe Chai Sim, Kilol Gupta, Mingqing Chen, Lara McConnaughey

Figure 1 for UserLibri: A Dataset for ASR Personalization Using Only Text
Figure 2 for UserLibri: A Dataset for ASR Personalization Using Only Text
Figure 3 for UserLibri: A Dataset for ASR Personalization Using Only Text
Figure 4 for UserLibri: A Dataset for ASR Personalization Using Only Text
Viaarxiv icon

Global Normalization for Streaming Speech Recognition in a Modular Framework

Add code
Bookmark button
Alert button
May 26, 2022
Ehsan Variani, Ke Wu, Michael Riley, David Rybach, Matt Shannon, Cyril Allauzen

Figure 1 for Global Normalization for Streaming Speech Recognition in a Modular Framework
Figure 2 for Global Normalization for Streaming Speech Recognition in a Modular Framework
Figure 3 for Global Normalization for Streaming Speech Recognition in a Modular Framework
Figure 4 for Global Normalization for Streaming Speech Recognition in a Modular Framework
Viaarxiv icon

Improving Rare Word Recognition with LM-aware MWER Training

Add code
Bookmark button
Alert button
Apr 15, 2022
Weiran Wang, Tongzhou Chen, Tara N. Sainath, Ehsan Variani, Rohit Prabhavalkar, Ronny Huang, Bhuvana Ramabhadran, Neeraj Gaur, Sepand Mavandadi, Cal Peyser, Trevor Strohman, Yanzhang He, David Rybach

Figure 1 for Improving Rare Word Recognition with LM-aware MWER Training
Figure 2 for Improving Rare Word Recognition with LM-aware MWER Training
Figure 3 for Improving Rare Word Recognition with LM-aware MWER Training
Figure 4 for Improving Rare Word Recognition with LM-aware MWER Training
Viaarxiv icon

Cascaded encoders for unifying streaming and non-streaming ASR

Add code
Bookmark button
Alert button
Oct 27, 2020
Arun Narayanan, Tara N. Sainath, Ruoming Pang, Jiahui Yu, Chung-Cheng Chiu, Rohit Prabhavalkar, Ehsan Variani, Trevor Strohman

Figure 1 for Cascaded encoders for unifying streaming and non-streaming ASR
Figure 2 for Cascaded encoders for unifying streaming and non-streaming ASR
Figure 3 for Cascaded encoders for unifying streaming and non-streaming ASR
Figure 4 for Cascaded encoders for unifying streaming and non-streaming ASR
Viaarxiv icon

Hybrid Autoregressive Transducer (hat)

Add code
Bookmark button
Alert button
Mar 12, 2020
Ehsan Variani, David Rybach, Cyril Allauzen, Michael Riley

Figure 1 for Hybrid Autoregressive Transducer (hat)
Figure 2 for Hybrid Autoregressive Transducer (hat)
Figure 3 for Hybrid Autoregressive Transducer (hat)
Figure 4 for Hybrid Autoregressive Transducer (hat)
Viaarxiv icon

A Density Ratio Approach to Language Model Fusion in End-To-End Automatic Speech Recognition

Add code
Bookmark button
Alert button
Feb 28, 2020
Erik McDermott, Hasim Sak, Ehsan Variani

Figure 1 for A Density Ratio Approach to Language Model Fusion in End-To-End Automatic Speech Recognition
Figure 2 for A Density Ratio Approach to Language Model Fusion in End-To-End Automatic Speech Recognition
Figure 3 for A Density Ratio Approach to Language Model Fusion in End-To-End Automatic Speech Recognition
Figure 4 for A Density Ratio Approach to Language Model Fusion in End-To-End Automatic Speech Recognition
Viaarxiv icon