Get our free extension to see links to code for papers anywhere online!

Chrome logo  Add to Chrome

Firefox logo Add to Firefox

Locate This, Not That: Class-Conditioned Sound Event DOA Estimation


Mar 08, 2022
Olga Slizovskaia, Gordon Wichern, Zhong-Qiu Wang, Jonathan Le Roux

Add code

* Accepted for publication at ICASSP 2022 

   Access Paper or Ask Questions

  • Share via Twitter
  • Share via Facebook
  • Share via LinkedIn
  • Share via Whatsapp
  • Share via Messenger
  • Share via Email

Extended Graph Temporal Classification for Multi-Speaker End-to-End ASR


Mar 01, 2022
Xuankai Chang, Niko Moritz, Takaaki Hori, Shinji Watanabe, Jonathan Le Roux

Add code

* To appear in ICASSP2022 

   Access Paper or Ask Questions

  • Share via Twitter
  • Share via Facebook
  • Share via LinkedIn
  • Share via Whatsapp
  • Share via Messenger
  • Share via Email

(2.5+1)D Spatio-Temporal Scene Graphs for Video Question Answering


Feb 18, 2022
Anoop Cherian, Chiori Hori, Tim K. Marks, Jonathan Le Roux

Add code

* Accepted at AAAI 2022 (Oral) 

   Access Paper or Ask Questions

  • Share via Twitter
  • Share via Facebook
  • Share via LinkedIn
  • Share via Whatsapp
  • Share via Messenger
  • Share via Email

Sequence Transduction with Graph-based Supervision


Nov 01, 2021
Niko Moritz, Takaaki Hori, Shinji Watanabe, Jonathan Le Roux

Add code

* Submitted to IEEE ICASSP 2022 

   Access Paper or Ask Questions

  • Share via Twitter
  • Share via Facebook
  • Share via LinkedIn
  • Share via Whatsapp
  • Share via Messenger
  • Share via Email

The Cocktail Fork Problem: Three-Stem Audio Separation for Real-World Soundtracks


Oct 19, 2021
Darius Petermann, Gordon Wichern, Zhong-Qiu Wang, Jonathan Le Roux

Add code

* Submitted to ICASSP2022. For resources and examples, see https://cocktail-fork.github.io 

   Access Paper or Ask Questions

  • Share via Twitter
  • Share via Facebook
  • Share via LinkedIn
  • Share via Whatsapp
  • Share via Messenger
  • Share via Email

Audio-Visual Scene-Aware Dialog and Reasoning using Audio-Visual Transformers with Joint Student-Teacher Learning


Oct 13, 2021
Ankit P. Shah, Shijie Geng, Peng Gao, Anoop Cherian, Takaaki Hori, Tim K. Marks, Jonathan Le Roux, Chiori Hori

Add code

* https://dstc10.dstc.community/home and https://github.com/dialogtekgeek/AVSD-DSTC10_Official/ 

   Access Paper or Ask Questions

  • Share via Twitter
  • Share via Facebook
  • Share via LinkedIn
  • Share via Whatsapp
  • Share via Messenger
  • Share via Email

Advancing Momentum Pseudo-Labeling with Conformer and Initialization Strategy


Oct 11, 2021
Yosuke Higuchi, Niko Moritz, Jonathan Le Roux, Takaaki Hori

Add code

* Submitted to ICASSP2022 

   Access Paper or Ask Questions

  • Share via Twitter
  • Share via Facebook
  • Share via LinkedIn
  • Share via Whatsapp
  • Share via Messenger
  • Share via Email

Leveraging Low-Distortion Target Estimates for Improved Speech Enhancement


Oct 01, 2021
Zhong-Qiu Wang, Gordon Wichern, Jonathan Le Roux

Add code

* in submission 

   Access Paper or Ask Questions

  • Share via Twitter
  • Share via Facebook
  • Share via LinkedIn
  • Share via Whatsapp
  • Share via Messenger
  • Share via Email

Visual Scene Graphs for Audio Source Separation


Sep 24, 2021
Moitreya Chatterjee, Jonathan Le Roux, Narendra Ahuja, Anoop Cherian

Add code

* Accepted at ICCV 2021 

   Access Paper or Ask Questions

  • Share via Twitter
  • Share via Facebook
  • Share via LinkedIn
  • Share via Whatsapp
  • Share via Messenger
  • Share via Email

Convolutive Prediction for Monaural Speech Dereverberation and Noisy-Reverberant Speaker Separation


Aug 16, 2021
Zhong-Qiu Wang, Gordon Wichern, Jonathan Le Roux

Add code

* 16 pages, 4 figures 

   Access Paper or Ask Questions

  • Share via Twitter
  • Share via Facebook
  • Share via LinkedIn
  • Share via Whatsapp
  • Share via Messenger
  • Share via Email
<<
1
2
3
4
5
6
>>