Get our free extension to see links to code for papers anywhere online!

 Add to Chrome

 Add to Firefox

CatalyzeX Code Finder - Browser extension linking code for ML papers across the web! | Product Hunt Embed
Semi-Supervised Speech Recognition via Graph-based Temporal Classification

Oct 29, 2020
Niko Moritz, Takaaki Hori, Jonathan Le Roux

* Submitted to ICASSP 2021 

  Access Paper or Ask Questions

Transcription Is All You Need: Learning to Separate Musical Mixtures with Score as Supervision

Oct 22, 2020
Yun-Ning Hung, Gordon Wichern, Jonathan Le Roux


  Access Paper or Ask Questions

Multi-Pass Transformer for Machine Translation

Sep 23, 2020
Peng Gao, Chiori Hori, Shijie Geng, Takaaki Hori, Jonathan Le Roux

* 10 pages, 5 figures and 2 tables 

  Access Paper or Ask Questions

AutoClip: Adaptive Gradient Clipping for Source Separation Networks

Jul 25, 2020
Prem Seetharaman, Gordon Wichern, Bryan Pardo, Jonathan Le Roux

* Accepted at 2020 IEEE International Workshop on Machine Learning for Signal Processing, Sept.\ 21--24, 2020, Espoo, Finland 

  Access Paper or Ask Questions

Spatio-Temporal Scene Graphs for Video Dialog

Jul 08, 2020
Shijie Geng, Peng Gao, Chiori Hori, Jonathan Le Roux, Anoop Cherian


  Access Paper or Ask Questions

Detecting Audio Attacks on ASR Systems with Dropout Uncertainty

Jun 02, 2020
Tejas Jayashankar, Jonathan Le Roux, Pierre Moulin


  Access Paper or Ask Questions

Unsupervised Speaker Adaptation using Attention-based Speaker Memory for End-to-End ASR

Feb 14, 2020
Leda Sarı, Niko Moritz, Takaaki Hori, Jonathan Le Roux

* To appear in Proc. ICASSP 2020 

  Access Paper or Ask Questions

End-to-End Multi-speaker Speech Recognition with Transformer

Feb 13, 2020
Xuankai Chang, Wangyou Zhang, Yanmin Qian, Jonathan Le Roux, Shinji Watanabe

* To appear in ICASSP 2020 

  Access Paper or Ask Questions

Streaming automatic speech recognition with the transformer model

Jan 09, 2020
Niko Moritz, Takaaki Hori, Jonathan Le Roux


  Access Paper or Ask Questions

Finding Strength in Weakness: Learning to Separate Sounds with Weak Supervision

Nov 06, 2019
Fatemeh Pishdadian, Gordon Wichern, Jonathan Le Roux


  Access Paper or Ask Questions

Bootstrapping deep music separation from primitive auditory grouping principles

Oct 23, 2019
Prem Seetharaman, Gordon Wichern, Jonathan Le Roux, Bryan Pardo


  Access Paper or Ask Questions

MIMO-SPEECH: End-to-End Multi-Channel Multi-Speaker Speech Recognition

Oct 16, 2019
Xuankai Chang, Wangyou Zhang, Yanmin Qian, Jonathan Le Roux, Shinji Watanabe

* Accepted at ASRU 2019 

  Access Paper or Ask Questions

Cutting Music Source Separation Some Slakh: A Dataset to Study the Impact of Training Data Quality and Quantity

Sep 18, 2019
Ethan Manilow, Gordon Wichern, Prem Seetharaman, Jonathan Le Roux

* Accepted for publication at WASPAA 2019 

  Access Paper or Ask Questions

WHAM!: Extending Speech Separation to Noisy Environments

Jul 02, 2019
Gordon Wichern, Joe Antognini, Michael Flynn, Licheng Richard Zhu, Emmett McQuinn, Dwight Crow, Ethan Manilow, Jonathan Le Roux

* Accepted for publication at Interspeech 2019 

  Access Paper or Ask Questions

Universal Sound Separation

May 08, 2019
Ilya Kavalerov, Scott Wisdom, Hakan Erdogan, Brian Patton, Kevin Wilson, Jonathan Le Roux, John R. Hershey

* 5 pages, submitted to WASPAA 2019 

  Access Paper or Ask Questions

Class-conditional embeddings for music source separation

Nov 07, 2018
Prem Seetharaman, Gordon Wichern, Shrikant Venkataramani, Jonathan Le Roux

* 5 pages 

  Access Paper or Ask Questions

Bootstrapping single-channel source separation via unsupervised spatial clustering on stereo mixtures

Nov 06, 2018
Prem Seetharaman, Gordon Wichern, Jonathan Le Roux, Bryan Pardo

* 5 pages, 2 figures 

  Access Paper or Ask Questions

Cycle-consistency training for end-to-end speech recognition

Nov 02, 2018
Takaaki Hori, Ramon Astudillo, Tomoki Hayashi, Yu Zhang, Shinji Watanabe, Jonathan Le Roux

* Submitted to ICASSP'19 

  Access Paper or Ask Questions

Phasebook and Friends: Leveraging Discrete Representations for Source Separation

Oct 02, 2018
Jonathan Le Roux, Gordon Wichern, Shinji Watanabe, Andy Sarroff, John R. Hershey


  Access Paper or Ask Questions

A Purely End-to-end System for Multi-speaker Speech Recognition

May 15, 2018
Hiroshi Seki, Takaaki Hori, Shinji Watanabe, Jonathan Le Roux, John R. Hershey

* ACL 2018 

  Access Paper or Ask Questions

End-to-End Speech Separation with Unfolded Iterative Phase Reconstruction

Apr 26, 2018
Zhong-Qiu Wang, Jonathan Le Roux, DeLiang Wang, John R. Hershey

* Submitted to Interspeech 2018 

  Access Paper or Ask Questions

Deep Clustering and Conventional Networks for Music Separation: Stronger Together

Jun 15, 2017
Yi Luo, Zhuo Chen, John R. Hershey, Jonathan Le Roux, Nima Mesgarani

* Published in ICASSP 2017 

  Access Paper or Ask Questions

Full-Capacity Unitary Recurrent Neural Networks

Oct 31, 2016
Scott Wisdom, Thomas Powers, John R. Hershey, Jonathan Le Roux, Les Atlas

* 9 pages, to appear in NIPS 

  Access Paper or Ask Questions

Single-Channel Multi-Speaker Separation using Deep Clustering

Jul 07, 2016
Yusuf Isik, Jonathan Le Roux, Zhuo Chen, Shinji Watanabe, John R. Hershey


  Access Paper or Ask Questions

Deep clustering: Discriminative embeddings for segmentation and separation

Aug 18, 2015
John R. Hershey, Zhuo Chen, Jonathan Le Roux, Shinji Watanabe

* Originally submitted on June 5, 2015 

  Access Paper or Ask Questions

Deep Unfolding: Model-Based Inspiration of Novel Deep Architectures

Nov 20, 2014
John R. Hershey, Jonathan Le Roux, Felix Weninger

* Added sections on reducing belief propagation to network activation functions, and on conversion between conventional network parameters and BP potentials for binary MRFs. Some bugs and typos were also fixed, and notation made a bit clearer 

  Access Paper or Ask Questions

Block Coordinate Descent for Sparse NMF

Mar 18, 2013
Vamsi K. Potluru, Sergey M. Plis, Jonathan Le Roux, Barak A. Pearlmutter, Vince D. Calhoun, Thomas P. Hayes


  Access Paper or Ask Questions