Get our free extension to see links to code for papers anywhere online!

Chrome logo Add to Chrome

Firefox logo Add to Firefox

Picture for Sanjeev Khudanpur

Injecting Text and Cross-lingual Supervision in Few-shot Learning from Self-Supervised Models


Oct 10, 2021
Matthew Wiesner, Desh Raj, Sanjeev Khudanpur

* \c{opyright} 2021 IEEE. Personal use of this material is permitted. Permission from IEEE must be obtained for all other uses, in any current or future media, including reprinting/republishing this material for advertising or promotional purposes, creating new collective works, for resale or redistribution to servers or lists, or reuse of any copyrighted component of this work in other works 

  Access Paper or Ask Questions

GigaSpeech: An Evolving, Multi-domain ASR Corpus with 10,000 Hours of Transcribed Audio


Jun 13, 2021
Guoguo Chen, Shuzhou Chai, Guanbo Wang, Jiayu Du, Wei-Qiang Zhang, Chao Weng, Dan Su, Daniel Povey, Jan Trmal, Junbo Zhang, Mingjie Jin, Sanjeev Khudanpur, Shinji Watanabe, Shuaijiang Zhao, Wei Zou, Xiangang Li, Xuchen Yao, Yongqing Wang, Yujun Wang, Zhao You, Zhiyong Yan


  Access Paper or Ask Questions

Reformulating DOVER-Lap Label Mapping as a Graph Partitioning Problem


Apr 05, 2021
Desh Raj, Sanjeev Khudanpur

* 5 pages, 3 figures. Submitted to INTERSPEECH 2021 

  Access Paper or Ask Questions

Adversarial Attacks and Defenses for Speech Recognition Systems


Mar 31, 2021
Piotr Żelasko, Sonal Joshi, Yiwen Shao, Jesus Villalba, Jan Trmal, Najim Dehak, Sanjeev Khudanpur

* This work has been submitted to the IEEE for possible publication. Copyright may be transferred without notice, after which this version may no longer be accessible 

  Access Paper or Ask Questions

An Asynchronous WFST-Based Decoder For Automatic Speech Recognition


Mar 16, 2021
Hang Lv, Zhehuai Chen, Hainan Xu, Daniel Povey, Lei Xie, Sanjeev Khudanpur

* 5 pages, 5 figures, icassp 

  Access Paper or Ask Questions

Learning Feature Weights using Reward Modeling for Denoising Parallel Corpora


Mar 11, 2021
Gaurav Kumar, Philipp Koehn, Sanjeev Khudanpur

* 10 pages, 2 figures 

  Access Paper or Ask Questions

Learning Policies for Multilingual Training of Neural Machine Translation Systems


Mar 11, 2021
Gaurav Kumar, Philipp Koehn, Sanjeev Khudanpur

* 7 pages, 2 figures 

  Access Paper or Ask Questions

A Parallelizable Lattice Rescoring Strategy with Neural Language Models


Mar 08, 2021
Ke Li, Daniel Povey, Sanjeev Khudanpur

* To appear at ICASSP 2021. 5 pages, 1 figure 

  Access Paper or Ask Questions

Wake Word Detection with Streaming Transformers


Feb 08, 2021
Yiming Wang, Hang Lv, Daniel Povey, Lei Xie, Sanjeev Khudanpur

* Accepted at IEEE ICASSP 2021. 5 pages, 3 figures 

  Access Paper or Ask Questions

The Hitachi-JHU DIHARD III System: Competitive End-to-End Neural Diarization and X-Vector Clustering Systems Combined by DOVER-Lap


Feb 02, 2021
Shota Horiguchi, Nelson Yalta, Paola Garcia, Yuki Takashima, Yawen Xue, Desh Raj, Zili Huang, Yusuke Fujita, Shinji Watanabe, Sanjeev Khudanpur


  Access Paper or Ask Questions

Fine-grained activity recognition for assembly videos


Dec 02, 2020
Jonathan D. Jones, Cathryn Cortesa, Amy Shelton, Barbara Landau, Sanjeev Khudanpur, Gregory D. Hager

* 8 pages, 6 figures. Submitted to RA-L/ICRA 2021 

  Access Paper or Ask Questions

Efficient MDI Adaptation for n-gram Language Models


Aug 05, 2020
Ruizhe Huang, Ke Li, Ashish Arora, Dan Povey, Sanjeev Khudanpur

* To appear in INTERSPEECH 2020. Appendix A of this full version will be filled soon 

  Access Paper or Ask Questions

Wake Word Detection with Alignment-Free Lattice-Free MMI


May 25, 2020
Yiming Wang, Hang Lv, Daniel Povey, Lei Xie, Sanjeev Khudanpur

* Submitted to Interspeech 2020. 5 pages, 3 figures 

  Access Paper or Ask Questions

PyChain: A Fully Parallelized PyTorch Implementation of LF-MMI for End-to-End ASR


May 20, 2020
Yiwen Shao, Yiming Wang, Daniel Povey, Sanjeev Khudanpur

* Submtted to Interspeech 2020 

  Access Paper or Ask Questions

CHiME-6 Challenge:Tackling Multispeaker Speech Recognition for Unsegmented Recordings


May 02, 2020
Shinji Watanabe, Michael Mandel, Jon Barker, Emmanuel Vincent, Ashish Arora, Xuankai Chang, Sanjeev Khudanpur, Vimal Manohar, Daniel Povey, Desh Raj, David Snyder, Aswin Shanmugam Subramanian, Jan Trmal, Bar Ben Yair, Christoph Boeddeker, Zhaoheng Ni, Yusuke Fujita, Shota Horiguchi, Naoyuki Kanda, Takuya Yoshioka, Neville Ryant


  Access Paper or Ask Questions

Espresso: A Fast End-to-end Neural Speech Recognition Toolkit


Oct 15, 2019
Yiming Wang, Tongfei Chen, Hainan Xu, Shuoyang Ding, Hang Lv, Yiwen Shao, Nanyun Peng, Lei Xie, Shinji Watanabe, Sanjeev Khudanpur

* Accepted to ASRU 2019 

  Access Paper or Ask Questions

Probing the Information Encoded in X-vectors


Sep 30, 2019
Desh Raj, David Snyder, Daniel Povey, Sanjeev Khudanpur

* Accepted at IEEE Workshop on Automatic Speech Recognition and Understanding (ASRU) 2019 

  Access Paper or Ask Questions

Probing the Information Encoded in x-vectors


Sep 13, 2019
Desh Raj, David Snyder, Daniel Povey, Sanjeev Khudanpur

* Accepted at IEEE Workshop on Automatic Speech Recognition and Understanding (ASRU) 2019 

  Access Paper or Ask Questions

Low Resource Multi-modal Data Augmentation for End-to-end ASR


Dec 10, 2018
Matthew Wiesner, Adithya Renduchintala, Shinji Watanabe, Chunxi Liu, Najim Dehak, Sanjeev Khudanpur

* This work has been submitted to the IEEE for possible publication. Copyright may be transferred without notice, after which this version may no longer be accessible 

  Access Paper or Ask Questions

Building Corpora for Single-Channel Speech Separation Across Multiple Domains


Nov 06, 2018
Matthew Maciejewski, Gregory Sell, Leibny Paola Garcia-Perera, Shinji Watanabe, Sanjeev Khudanpur

* This work has been submitted to the IEEE for possible publication. Copyright may be transferred without notice, after which this version may no longer be accessible 

  Access Paper or Ask Questions

Low-Resource Contextual Topic Identification on Speech


Sep 28, 2018
Chunxi Liu, Matthew Wiesner, Shinji Watanabe, Craig Harman, Jan Trmal, Najim Dehak, Sanjeev Khudanpur

* Accepted for publication at 2018 IEEE Workshop on Spoken Language Technology (SLT) 

  Access Paper or Ask Questions

A GPU-based WFST Decoder with Exact Lattice Generation


Jul 27, 2018
Zhehuai Chen, Justin Luitjens, Hainan Xu, Yiming Wang, Daniel Povey, Sanjeev Khudanpur

* accepted by INTERSPEECH 2018 

  Access Paper or Ask Questions

Automatic Speech Recognition and Topic Identification for Almost-Zero-Resource Languages


Jun 18, 2018
Matthew Wiesner, Chunxi Liu, Lucas Ondel, Craig Harman, Vimal Manohar, Jan Trmal, Zhongqiang Huang, Najim Dehak, Sanjeev Khudanpur

* Accepted for publication at Interspeech 2018 

  Access Paper or Ask Questions

Bayesian Models for Unit Discovery on a Very Low Resource Language


Feb 20, 2018
Lucas Ondel, Pierre Godard, Laurent Besacier, Elin Larsen, Mark Hasegawa-Johnson, Odette Scharenborg, Emmanuel Dupoux, Lukas Burget, François Yvon, Sanjeev Khudanpur

* Accepted to ICASSP 2018 

  Access Paper or Ask Questions

Topic Identification for Speech without ASR


Jul 11, 2017
Chunxi Liu, Jan Trmal, Matthew Wiesner, Craig Harman, Sanjeev Khudanpur

* 5 pages, 2 figures; accepted for publication at Interspeech 2017 

  Access Paper or Ask Questions

Acoustic data-driven lexicon learning based on a greedy pronunciation selection framework


Jun 12, 2017
Xiaohui Zhang, Vimal Manohar, Daniel Povey, Sanjeev Khudanpur


  Access Paper or Ask Questions

Using of heterogeneous corpora for training of an ASR system


Jun 01, 2017
Jan Trmal, Gaurav Kumar, Vimal Manohar, Sanjeev Khudanpur, Matt Post, Paul McNamee


  Access Paper or Ask Questions