Get our free extension to see links to code for papers anywhere online!

Chrome logo  Add to Chrome

Firefox logo Add to Firefox

An Investigation of Monotonic Transducers for Large-Scale Automatic Speech Recognition



Niko Moritz , Frank Seide , Duc Le , Jay Mahadeokar , Christian Fuegen

* Submitted to Interspeech 2022 

   Access Paper or Ask Questions

Federated Domain Adaptation for ASR with Full Self-Supervision



Junteng Jia , Jay Mahadeokar , Weiyi Zheng , Yuan Shangguan , Ozlem Kalinli , Frank Seide


   Access Paper or Ask Questions

Streaming parallel transducer beam search with fast-slow cascaded encoders



Jay Mahadeokar , Yangyang Shi , Ke Li , Duc Le , Jiedan Zhu , Vikas Chandra , Ozlem Kalinli , Michael L Seltzer

* 5 pages, 2 figures, Interspeech 2022 submission 

   Access Paper or Ask Questions

TorchAudio: Building Blocks for Audio and Speech Processing



Yao-Yuan Yang , Moto Hira , Zhaoheng Ni , Anjali Chourdia , Artyom Astafurov , Caroline Chen , Ching-Feng Yeh , Christian Puhrsch , David Pollack , Dmitriy Genzel , Donny Greenberg , Edward Z. Yang , Jason Lian , Jay Mahadeokar , Jeff Hwang , Ji Chen , Peter Goldsborough , Prabhat Roy , Sean Narenthiran , Shinji Watanabe , Soumith Chintala , Vincent Quenneville-Bélair , Yangyang Shi

* Submitted to ICASSP 2022 

   Access Paper or Ask Questions

Streaming Transformer Transducer Based Speech Recognition Using Non-Causal Convolution



Yangyang Shi , Chunyang Wu , Dilin Wang , Alex Xiao , Jay Mahadeokar , Xiaohui Zhang , Chunxi Liu , Ke Li , Yuan Shangguan , Varun Nagaraja , Ozlem Kalinli , Mike Seltzer

* 5 pages, 3 figures, submit to ICASSP 2022 

   Access Paper or Ask Questions

Flexi-Transducer: Optimizing Latency, Accuracy and Compute forMulti-Domain On-Device Scenarios



Jay Mahadeokar , Yangyang Shi , Yuan Shangguan , Chunyang Wu , Alex Xiao , Hang Su , Duc Le , Ozlem Kalinli , Christian Fuegen , Michael L. Seltzer

* Submitted to Interspeech 2021 (under review) 

   Access Paper or Ask Questions

Dissecting User-Perceived Latency of On-Device E2E Speech Recognition



Yuan Shangguan , Rohit Prabhavalkar , Hang Su , Jay Mahadeokar , Yangyang Shi , Jiatong Zhou , Chunyang Wu , Duc Le , Ozlem Kalinli , Christian Fuegen , Michael L. Seltzer

* Submitted to Interspeech 2021 

   Access Paper or Ask Questions

Contextualized Streaming End-to-End Speech Recognition with Trie-Based Deep Biasing and Shallow Fusion



Duc Le , Mahaveer Jain , Gil Keren , Suyoun Kim , Yangyang Shi , Jay Mahadeokar , Julian Chan , Yuan Shangguan , Christian Fuegen , Ozlem Kalinli , Yatharth Saraf , Michael L. Seltzer

* Submitted to INTERSPEECH 2021 

   Access Paper or Ask Questions

Dynamic Encoder Transducer: A Flexible Solution For Trading Off Accuracy For Latency



Yangyang Shi , Varun Nagaraja , Chunyang Wu , Jay Mahadeokar , Duc Le , Rohit Prabhavalkar , Alex Xiao , Ching-Feng Yeh , Julian Chan , Christian Fuegen , Ozlem Kalinli , Michael L. Seltzer

* 5 pages, 2 figures, submitted Interspeech 2021 

   Access Paper or Ask Questions

Memory-efficient Speech Recognition on Smart Devices



Ganesh Venkatesh , Alagappan Valliappan , Jay Mahadeokar , Yuan Shangguan , Christian Fuegen , Michael L. Seltzer , Vikas Chandra

* ICASSP 2021 

   Access Paper or Ask Questions

1
2
>>