Get our free extension to see links to code for papers anywhere online!

Chrome logo  Add to Chrome

Firefox logo Add to Firefox

Defense against Adversarial Attacks on Hybrid Speech Recognition using Joint Adversarial Fine-tuning with Denoiser



Sonal Joshi , Saurabh Kataria , Yiwen Shao , Piotr Zelasko , Jesus Villalba , Sanjeev Khudanpur , Najim Dehak

* Submitted to Interspeech 2022 

   Access Paper or Ask Questions

PHO-LID: A Unified Model Incorporating Acoustic-Phonetic and Phonotactic Information for Language Identification



Hexin Liu , Leibny Paola Garcia Perera , Andy W. H. Khong , Suzy J. Styles , Sanjeev Khudanpur

* Submitted to Interspeech 2022, updated to the submitted version 

   Access Paper or Ask Questions

Investigating self-supervised learning for speech enhancement and separation



Zili Huang , Shinji Watanabe , Shu-wen Yang , Paola Garcia , Sanjeev Khudanpur

* To appear in ICASSP 2022 

   Access Paper or Ask Questions

Enhance Language Identification using Dual-mode Model with Knowledge Distillation



Hexin Liu , Leibny Paola Garcia Perera , Andy W. H. Khong , Justin Dauwels , Suzy J. Styles , Sanjeev Khudanpur

* Submitted to Odyssey 2022 

   Access Paper or Ask Questions

Lhotse: a speech data representation library for the modern deep learning ecosystem



Piotr Żelasko , Daniel Povey , Jan "Yenda" Trmal , Sanjeev Khudanpur

* Accepted for presentation at NeurIPS 2021 Data-Centric AI (DCAI) Workshop 

   Access Paper or Ask Questions

Injecting Text and Cross-lingual Supervision in Few-shot Learning from Self-Supervised Models



Matthew Wiesner , Desh Raj , Sanjeev Khudanpur

* \c{opyright} 2021 IEEE. Personal use of this material is permitted. Permission from IEEE must be obtained for all other uses, in any current or future media, including reprinting/republishing this material for advertising or promotional purposes, creating new collective works, for resale or redistribution to servers or lists, or reuse of any copyrighted component of this work in other works 

   Access Paper or Ask Questions

GigaSpeech: An Evolving, Multi-domain ASR Corpus with 10,000 Hours of Transcribed Audio



Guoguo Chen , Shuzhou Chai , Guanbo Wang , Jiayu Du , Wei-Qiang Zhang , Chao Weng , Dan Su , Daniel Povey , Jan Trmal , Junbo Zhang , Mingjie Jin , Sanjeev Khudanpur , Shinji Watanabe , Shuaijiang Zhao , Wei Zou , Xiangang Li , Xuchen Yao , Yongqing Wang , Yujun Wang , Zhao You , Zhiyong Yan


   Access Paper or Ask Questions

Reformulating DOVER-Lap Label Mapping as a Graph Partitioning Problem



Desh Raj , Sanjeev Khudanpur

* 5 pages, 3 figures. Submitted to INTERSPEECH 2021 

   Access Paper or Ask Questions

Adversarial Attacks and Defenses for Speech Recognition Systems



Piotr Żelasko , Sonal Joshi , Yiwen Shao , Jesus Villalba , Jan Trmal , Najim Dehak , Sanjeev Khudanpur

* This work has been submitted to the IEEE for possible publication. Copyright may be transferred without notice, after which this version may no longer be accessible 

   Access Paper or Ask Questions

An Asynchronous WFST-Based Decoder For Automatic Speech Recognition



Hang Lv , Zhehuai Chen , Hainan Xu , Daniel Povey , Lei Xie , Sanjeev Khudanpur

* 5 pages, 5 figures, icassp 

   Access Paper or Ask Questions

1
2
3
4
>>