Unsupervised Speaker Adaptation using Attention-based Speaker Memory for End-to-End ASR

Feb 14, 2020
Leda Sarı, Niko Moritz, Takaaki Hori, Jonathan Le Roux

* To appear in Proc. ICASSP 2020 

  Access Model/Code and Paper
End-to-End Multi-speaker Speech Recognition with Transformer

Feb 13, 2020
Xuankai Chang, Wangyou Zhang, Yanmin Qian, Jonathan Le Roux, Shinji Watanabe

* To appear in ICASSP 2020 

  Access Model/Code and Paper
Streaming automatic speech recognition with the transformer model

Jan 09, 2020
Niko Moritz, Takaaki Hori, Jonathan Le Roux


  Access Model/Code and Paper
Finding Strength in Weakness: Learning to Separate Sounds with Weak Supervision

Nov 06, 2019
Fatemeh Pishdadian, Gordon Wichern, Jonathan Le Roux


  Access Model/Code and Paper
Bootstrapping deep music separation from primitive auditory grouping principles

Oct 23, 2019
Prem Seetharaman, Gordon Wichern, Jonathan Le Roux, Bryan Pardo


  Access Model/Code and Paper
MIMO-SPEECH: End-to-End Multi-Channel Multi-Speaker Speech Recognition

Oct 16, 2019
Xuankai Chang, Wangyou Zhang, Yanmin Qian, Jonathan Le Roux, Shinji Watanabe

* Accepted at ASRU 2019 

  Access Model/Code and Paper
Cutting Music Source Separation Some Slakh: A Dataset to Study the Impact of Training Data Quality and Quantity

Sep 18, 2019
Ethan Manilow, Gordon Wichern, Prem Seetharaman, Jonathan Le Roux

* Accepted for publication at WASPAA 2019 

  Access Model/Code and Paper
WHAM!: Extending Speech Separation to Noisy Environments

Jul 02, 2019
Gordon Wichern, Joe Antognini, Michael Flynn, Licheng Richard Zhu, Emmett McQuinn, Dwight Crow, Ethan Manilow, Jonathan Le Roux

* Accepted for publication at Interspeech 2019 

  Access Model/Code and Paper
Universal Sound Separation

May 08, 2019
Ilya Kavalerov, Scott Wisdom, Hakan Erdogan, Brian Patton, Kevin Wilson, Jonathan Le Roux, John R. Hershey

* 5 pages, submitted to WASPAA 2019 

  Access Model/Code and Paper
Class-conditional embeddings for music source separation

Nov 07, 2018
Prem Seetharaman, Gordon Wichern, Shrikant Venkataramani, Jonathan Le Roux

* 5 pages 

  Access Model/Code and Paper
Bootstrapping single-channel source separation via unsupervised spatial clustering on stereo mixtures

Nov 06, 2018
Prem Seetharaman, Gordon Wichern, Jonathan Le Roux, Bryan Pardo

* 5 pages, 2 figures 

  Access Model/Code and Paper
Cycle-consistency training for end-to-end speech recognition

Nov 02, 2018
Takaaki Hori, Ramon Astudillo, Tomoki Hayashi, Yu Zhang, Shinji Watanabe, Jonathan Le Roux

* Submitted to ICASSP'19 

  Access Model/Code and Paper
Phasebook and Friends: Leveraging Discrete Representations for Source Separation

Oct 02, 2018
Jonathan Le Roux, Gordon Wichern, Shinji Watanabe, Andy Sarroff, John R. Hershey


  Access Model/Code and Paper
A Purely End-to-end System for Multi-speaker Speech Recognition

May 15, 2018
Hiroshi Seki, Takaaki Hori, Shinji Watanabe, Jonathan Le Roux, John R. Hershey

* ACL 2018 

  Access Model/Code and Paper
End-to-End Speech Separation with Unfolded Iterative Phase Reconstruction

Apr 26, 2018
Zhong-Qiu Wang, Jonathan Le Roux, DeLiang Wang, John R. Hershey

* Submitted to Interspeech 2018 

  Access Model/Code and Paper
Deep Clustering and Conventional Networks for Music Separation: Stronger Together

Jun 15, 2017
Yi Luo, Zhuo Chen, John R. Hershey, Jonathan Le Roux, Nima Mesgarani

* Published in ICASSP 2017 

  Access Model/Code and Paper
Full-Capacity Unitary Recurrent Neural Networks

Oct 31, 2016
Scott Wisdom, Thomas Powers, John R. Hershey, Jonathan Le Roux, Les Atlas

* 9 pages, to appear in NIPS 

  Access Model/Code and Paper
Single-Channel Multi-Speaker Separation using Deep Clustering

Jul 07, 2016
Yusuf Isik, Jonathan Le Roux, Zhuo Chen, Shinji Watanabe, John R. Hershey


  Access Model/Code and Paper
Deep clustering: Discriminative embeddings for segmentation and separation

Aug 18, 2015
John R. Hershey, Zhuo Chen, Jonathan Le Roux, Shinji Watanabe

* Originally submitted on June 5, 2015 

  Access Model/Code and Paper
Deep Unfolding: Model-Based Inspiration of Novel Deep Architectures

Nov 20, 2014
John R. Hershey, Jonathan Le Roux, Felix Weninger

* Added sections on reducing belief propagation to network activation functions, and on conversion between conventional network parameters and BP potentials for binary MRFs. Some bugs and typos were also fixed, and notation made a bit clearer 

  Access Model/Code and Paper
Block Coordinate Descent for Sparse NMF

Mar 18, 2013
Vamsi K. Potluru, Sergey M. Plis, Jonathan Le Roux, Barak A. Pearlmutter, Vince D. Calhoun, Thomas P. Hayes


  Access Model/Code and Paper