Get our free extension to see links to code for papers anywhere online!

Chrome logo  Add to Chrome

Firefox logo Add to Firefox

Variable Attention Masking for Configurable Transformer Transducer Speech Recognition


Nov 02, 2022
Pawel Swietojanski, Stefan Braun, Dogan Can, Thiago Fraga da Silva, Arnab Ghoshal, Takaaki Hori, Roger Hsiao, Henry Mason, Erik McDermott, Honza Silovsky, Ruchir Travadi, Xiaodan Zhuang

Add code

* 5 pages, 4 figures, 2 Tables 

   Access Paper or Ask Questions

  • Share via Twitter
  • Share via Facebook
  • Share via LinkedIn
  • Share via Whatsapp
  • Share via Messenger
  • Share via Email

Extended Graph Temporal Classification for Multi-Speaker End-to-End ASR


Mar 01, 2022
Xuankai Chang, Niko Moritz, Takaaki Hori, Shinji Watanabe, Jonathan Le Roux

Add code

* To appear in ICASSP2022 

   Access Paper or Ask Questions

  • Share via Twitter
  • Share via Facebook
  • Share via LinkedIn
  • Share via Whatsapp
  • Share via Messenger
  • Share via Email

Sequence Transduction with Graph-based Supervision


Nov 01, 2021
Niko Moritz, Takaaki Hori, Shinji Watanabe, Jonathan Le Roux

Add code

* Submitted to IEEE ICASSP 2022 

   Access Paper or Ask Questions

  • Share via Twitter
  • Share via Facebook
  • Share via LinkedIn
  • Share via Whatsapp
  • Share via Messenger
  • Share via Email

Audio-Visual Scene-Aware Dialog and Reasoning using Audio-Visual Transformers with Joint Student-Teacher Learning


Oct 13, 2021
Ankit P. Shah, Shijie Geng, Peng Gao, Anoop Cherian, Takaaki Hori, Tim K. Marks, Jonathan Le Roux, Chiori Hori

Add code

* https://dstc10.dstc.community/home and https://github.com/dialogtekgeek/AVSD-DSTC10_Official/ 

   Access Paper or Ask Questions

  • Share via Twitter
  • Share via Facebook
  • Share via LinkedIn
  • Share via Whatsapp
  • Share via Messenger
  • Share via Email

Advancing Momentum Pseudo-Labeling with Conformer and Initialization Strategy


Oct 11, 2021
Yosuke Higuchi, Niko Moritz, Jonathan Le Roux, Takaaki Hori

Add code

* Submitted to ICASSP2022 

   Access Paper or Ask Questions

  • Share via Twitter
  • Share via Facebook
  • Share via LinkedIn
  • Share via Whatsapp
  • Share via Messenger
  • Share via Email

Optimizing Latency for Online Video CaptioningUsing Audio-Visual Transformers


Aug 04, 2021
Chiori Hori, Takaaki Hori, Jonathan Le Roux

Add code

* Interspeech 2021 accepted 

   Access Paper or Ask Questions

  • Share via Twitter
  • Share via Facebook
  • Share via LinkedIn
  • Share via Whatsapp
  • Share via Messenger
  • Share via Email

Dual Causal/Non-Causal Self-Attention for Streaming End-to-End Speech Recognition


Jul 02, 2021
Niko Moritz, Takaaki Hori, Jonathan Le Roux

Add code

* Accepted to Interspeech 2021 

   Access Paper or Ask Questions

  • Share via Twitter
  • Share via Facebook
  • Share via LinkedIn
  • Share via Whatsapp
  • Share via Messenger
  • Share via Email

Momentum Pseudo-Labeling for Semi-Supervised Speech Recognition


Jun 16, 2021
Yosuke Higuchi, Niko Moritz, Jonathan Le Roux, Takaaki Hori

Add code

* Accepted to Interspeech 2021 

   Access Paper or Ask Questions

  • Share via Twitter
  • Share via Facebook
  • Share via LinkedIn
  • Share via Whatsapp
  • Share via Messenger
  • Share via Email

Advanced Long-context End-to-end Speech Recognition Using Context-expanded Transformers


Apr 19, 2021
Takaaki Hori, Niko Moritz, Chiori Hori, Jonathan Le Roux

Add code

* Submitted to INTERSPEECH 2021 

   Access Paper or Ask Questions

  • Share via Twitter
  • Share via Facebook
  • Share via LinkedIn
  • Share via Whatsapp
  • Share via Messenger
  • Share via Email

Capturing Multi-Resolution Context by Dilated Self-Attention


Apr 07, 2021
Niko Moritz, Takaaki Hori, Jonathan Le Roux

Add code

* In Proc. ICASSP 2021 

   Access Paper or Ask Questions

  • Share via Twitter
  • Share via Facebook
  • Share via LinkedIn
  • Share via Whatsapp
  • Share via Messenger
  • Share via Email
1
2
3
4
>>