Get our free extension to see links to code for papers anywhere online!

Chrome logo  Add to Chrome

Firefox logo Add to Firefox

Contrastive Siamese Network for Semi-supervised Speech Recognition


May 27, 2022
Soheil Khorram , Jaeyoung Kim , Anshuman Tripathi , Han Lu , Qian Zhang , Hasim Sak


   Access Paper or Ask Questions

  • Share via Twitter
  • Share via Facebook
  • Share via LinkedIn
  • Share via Whatsapp
  • Share via Messenger
  • Share via Email

Turn-to-Diarize: Online Speaker Diarization Constrained by Transformer Transducer Speaker Turn Detection


Oct 05, 2021
Wei Xia , Han Lu , Quan Wang , Anshuman Tripathi , Yiling Huang , Ignacio Lopez Moreno , Hasim Sak


   Access Paper or Ask Questions

  • Share via Twitter
  • Share via Facebook
  • Share via LinkedIn
  • Share via Whatsapp
  • Share via Messenger
  • Share via Email

Reducing Streaming ASR Model Delay with Self Alignment


May 06, 2021
Jaeyoung Kim , Han Lu , Anshuman Tripathi , Qian Zhang , Hasim Sak

* submitted to INTERSPEECH 2021 

   Access Paper or Ask Questions

  • Share via Twitter
  • Share via Facebook
  • Share via LinkedIn
  • Share via Whatsapp
  • Share via Messenger
  • Share via Email

Transformer Transducer: One Model Unifying Streaming and Non-streaming Speech Recognition


Oct 07, 2020
Anshuman Tripathi , Jaeyoung Kim , Qian Zhang , Han Lu , Hasim Sak


   Access Paper or Ask Questions

  • Share via Twitter
  • Share via Facebook
  • Share via LinkedIn
  • Share via Whatsapp
  • Share via Messenger
  • Share via Email

A Density Ratio Approach to Language Model Fusion in End-To-End Automatic Speech Recognition


Feb 28, 2020
Erik McDermott , Hasim Sak , Ehsan Variani

* 8 pages, 4 figures, presented at 2019 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU 2019) 

   Access Paper or Ask Questions

  • Share via Twitter
  • Share via Facebook
  • Share via LinkedIn
  • Share via Whatsapp
  • Share via Messenger
  • Share via Email

Transformer Transducer: A Streamable Speech Recognition Model with Transformer Encoders and RNN-T Loss


Feb 14, 2020
Qian Zhang , Han Lu , Hasim Sak , Anshuman Tripathi , Erik McDermott , Stephen Koo , Shankar Kumar

* This is the final version of the paper submitted to the ICASSP 2020 on Oct 21, 2019 

   Access Paper or Ask Questions

  • Share via Twitter
  • Share via Facebook
  • Share via LinkedIn
  • Share via Whatsapp
  • Share via Messenger
  • Share via Email

Adversarial Training for Multilingual Acoustic Modeling


Jun 17, 2019
Ke Hu , Hasim Sak , Hank Liao


   Access Paper or Ask Questions

  • Share via Twitter
  • Share via Facebook
  • Share via LinkedIn
  • Share via Whatsapp
  • Share via Messenger
  • Share via Email

Large-Scale Visual Speech Recognition


Oct 01, 2018
Brendan Shillingford , Yannis Assael , Matthew W. Hoffman , Thomas Paine , Cían Hughes , Utsav Prabhu , Hank Liao , Hasim Sak , Kanishka Rao , Lorrayne Bennett , Marie Mulville , Ben Coppin , Ben Laurie , Andrew Senior , Nando de Freitas


   Access Paper or Ask Questions

  • Share via Twitter
  • Share via Facebook
  • Share via LinkedIn
  • Share via Whatsapp
  • Share via Messenger
  • Share via Email
1
2
>>