Get our free extension to see links to code for papers anywhere online!

Chrome logo Add to Chrome

Firefox logo Add to Firefox

Picture for Frank Zhang

Scaling ASR Improves Zero and Few Shot Learning


Nov 29, 2021
Alex Xiao, Weiyi Zheng, Gil Keren, Duc Le, Frank Zhang, Christian Fuegen, Ozlem Kalinli, Yatharth Saraf, Abdelrahman Mohamed


  Access Paper or Ask Questions

Accent-Robust Automatic Speech Recognition Using Supervised and Unsupervised Wav2vec Embeddings


Oct 08, 2021
Jialu Li, Vimal Manohar, Pooja Chitkara, Andros Tjandra, Michael Picheny, Frank Zhang, Xiaohui Zhang, Yatharth Saraf

* Submitted to ICASSP 2022 

  Access Paper or Ask Questions

Improved Language Identification Through Cross-Lingual Self-Supervised Learning


Aug 04, 2021
Andros Tjandra, Diptanu Gon Choudhury, Frank Zhang, Kritika Singh, Alexis Conneau, Alexei Baevski, Assaf Sela, Yatharth Saraf, Michael Auli

* Submitted to ASRU 2021 

  Access Paper or Ask Questions

On lattice-free boosted MMI training of HMM and CTC-based full-context ASR models


Jul 09, 2021
Xiaohui Zhang, Vimal Manohar, David Zhang, Frank Zhang, Yangyang Shi, Nayan Singhal, Julian Chan, Fuchun Peng, Yatharth Saraf, Mike Seltzer

* submitted to ASRU 2021 

  Access Paper or Ask Questions

Improving RNN Transducer Based ASR with Auxiliary Tasks


Nov 09, 2020
Chunxi Liu, Frank Zhang, Duc Le, Suyoun Kim, Yatharth Saraf, Geoffrey Zweig

* Accepted for publication at IEEE Spoken Language Technology Workshop (SLT), 2021 

  Access Paper or Ask Questions

Streaming Attention-Based Models with Augmented Memory for End-to-End Speech Recognition


Nov 03, 2020
Ching-Feng Yeh, Yongqiang Wang, Yangyang Shi, Chunyang Wu, Frank Zhang, Julian Chan, Michael L. Seltzer

* IEEE Spoken Language Technology Workshop 2021 

  Access Paper or Ask Questions

Transformer in action: a comparative study of transformer-based acoustic models for large scale speech recognition applications


Oct 29, 2020
Yongqiang Wang, Yangyang Shi, Frank Zhang, Chunyang Wu, Julian Chan, Ching-Feng Yeh, Alex Xiao

* submitted to ICASSP2021 

  Access Paper or Ask Questions

Emformer: Efficient Memory Transformer Based Acoustic Model For Low Latency Streaming Speech Recognition


Oct 29, 2020
Yangyang Shi, Yongqiang Wang, Chunyang Wu, Ching-Feng Yeh, Julian Chan, Frank Zhang, Duc Le, Mike Seltzer

* 5 pages, 2 figures, submitted to ICASSP 2021 

  Access Paper or Ask Questions

Fast, Simpler and More Accurate Hybrid ASR Systems Using Wordpieces


May 19, 2020
Frank Zhang, Yongqiang Wang, Xiaohui Zhang, Chunxi Liu, Yatharth Saraf, Geoffrey Zweig

* submitted to interspeech 2020 

  Access Paper or Ask Questions

Weak-Attention Suppression For Transformer Based Speech Recognition


May 18, 2020
Yangyang Shi, Yongqiang Wang, Chunyang Wu, Christian Fuegen, Frank Zhang, Duc Le, Ching-Feng Yeh, Michael L. Seltzer

* submitted to interspeech 2020 

  Access Paper or Ask Questions

Streaming Transformer-based Acoustic Models Using Self-attention with Augmented Memory


May 16, 2020
Chunyang Wu, Yongqiang Wang, Yangyang Shi, Ching-Feng Yeh, Frank Zhang

* submitted to Interspeech 2020 

  Access Paper or Ask Questions

Contextualizing ASR Lattice Rescoring with Hybrid Pointer Network Language Model


May 15, 2020
Da-Rong Liu, Chunxi Liu, Frank Zhang, Gabriel Synnaeve, Yatharth Saraf, Geoffrey Zweig


  Access Paper or Ask Questions

Training ASR models by Generation of Contextual Information


Oct 27, 2019
Kritika Singh, Dmytro Okhonko, Jun Liu, Yongqiang Wang, Frank Zhang, Ross Girshick, Sergey Edunov, Fuchun Peng, Yatharth Saraf, Geoffrey Zweig, Abdelrahman Mohamed


  Access Paper or Ask Questions

Deja-vu: Double Feature Presentation in Deep Transformer Networks


Oct 23, 2019
Andros Tjandra, Chunxi Liu, Frank Zhang, Xiaohui Zhang, Yongqiang Wang, Gabriel Synnaeve, Satoshi Nakamura, Geoffrey Zweig


  Access Paper or Ask Questions

Transformer-based Acoustic Modeling for Hybrid Speech Recognition


Oct 22, 2019
Yongqiang Wang, Abdelrahman Mohamed, Duc Le, Chunxi Liu, Alex Xiao, Jay Mahadeokar, Hongzhao Huang, Andros Tjandra, Xiaohui Zhang, Frank Zhang, Christian Fuegen, Geoffrey Zweig, Michael L. Seltzer


  Access Paper or Ask Questions