Get our free extension to see links to code for papers anywhere online!

Chrome logo  Add to Chrome

Firefox logo Add to Firefox

TorchAudio: Building Blocks for Audio and Speech Processing



Yao-Yuan Yang , Moto Hira , Zhaoheng Ni , Anjali Chourdia , Artyom Astafurov , Caroline Chen , Ching-Feng Yeh , Christian Puhrsch , David Pollack , Dmitriy Genzel , Donny Greenberg , Edward Z. Yang , Jason Lian , Jay Mahadeokar , Jeff Hwang , Ji Chen , Peter Goldsborough , Prabhat Roy , Sean Narenthiran , Shinji Watanabe , Soumith Chintala , Vincent Quenneville-Bélair , Yangyang Shi

* Submitted to ICASSP 2022 

   Access Paper or Ask Questions

Dynamic Encoder Transducer: A Flexible Solution For Trading Off Accuracy For Latency



Yangyang Shi , Varun Nagaraja , Chunyang Wu , Jay Mahadeokar , Duc Le , Rohit Prabhavalkar , Alex Xiao , Ching-Feng Yeh , Julian Chan , Christian Fuegen , Ozlem Kalinli , Michael L. Seltzer

* 5 pages, 2 figures, submitted Interspeech 2021 

   Access Paper or Ask Questions

Semantic Distance: A New Metric for ASR Performance Analysis Towards Spoken Language Understanding



Suyoun Kim , Abhinav Arora , Duc Le , Ching-Feng Yeh , Christian Fuegen , Ozlem Kalinli , Michael L. Seltzer

* submitted to Interspeech 2021 

   Access Paper or Ask Questions

Alignment Restricted Streaming Recurrent Neural Network Transducer



Jay Mahadeokar , Yuan Shangguan , Duc Le , Gil Keren , Hang Su , Thong Le , Ching-Feng Yeh , Christian Fuegen , Michael L. Seltzer

* Accepted for presentation at IEEE Spoken Language Technology Workshop (SLT) 2021 

   Access Paper or Ask Questions

Streaming Attention-Based Models with Augmented Memory for End-to-End Speech Recognition



Ching-Feng Yeh , Yongqiang Wang , Yangyang Shi , Chunyang Wu , Frank Zhang , Julian Chan , Michael L. Seltzer

* IEEE Spoken Language Technology Workshop 2021 

   Access Paper or Ask Questions

Transformer in action: a comparative study of transformer-based acoustic models for large scale speech recognition applications



Yongqiang Wang , Yangyang Shi , Frank Zhang , Chunyang Wu , Julian Chan , Ching-Feng Yeh , Alex Xiao

* submitted to ICASSP2021 

   Access Paper or Ask Questions

Emformer: Efficient Memory Transformer Based Acoustic Model For Low Latency Streaming Speech Recognition



Yangyang Shi , Yongqiang Wang , Chunyang Wu , Ching-Feng Yeh , Julian Chan , Frank Zhang , Duc Le , Mike Seltzer

* 5 pages, 2 figures, submitted to ICASSP 2021 

   Access Paper or Ask Questions

Weak-Attention Suppression For Transformer Based Speech Recognition



Yangyang Shi , Yongqiang Wang , Chunyang Wu , Christian Fuegen , Frank Zhang , Duc Le , Ching-Feng Yeh , Michael L. Seltzer

* submitted to interspeech 2020 

   Access Paper or Ask Questions

1
2
>>