Alert button
Picture for John R. Hershey

John R. Hershey

Alert button

Phasebook and Friends: Leveraging Discrete Representations for Source Separation

Add code
Bookmark button
Alert button
Oct 02, 2018
Jonathan Le Roux, Gordon Wichern, Shinji Watanabe, Andy Sarroff, John R. Hershey

Figure 1 for Phasebook and Friends: Leveraging Discrete Representations for Source Separation
Figure 2 for Phasebook and Friends: Leveraging Discrete Representations for Source Separation
Figure 3 for Phasebook and Friends: Leveraging Discrete Representations for Source Separation
Figure 4 for Phasebook and Friends: Leveraging Discrete Representations for Source Separation
Viaarxiv icon

A Purely End-to-end System for Multi-speaker Speech Recognition

Add code
Bookmark button
Alert button
May 15, 2018
Hiroshi Seki, Takaaki Hori, Shinji Watanabe, Jonathan Le Roux, John R. Hershey

Figure 1 for A Purely End-to-end System for Multi-speaker Speech Recognition
Figure 2 for A Purely End-to-end System for Multi-speaker Speech Recognition
Figure 3 for A Purely End-to-end System for Multi-speaker Speech Recognition
Figure 4 for A Purely End-to-end System for Multi-speaker Speech Recognition
Viaarxiv icon

End-to-End Speech Separation with Unfolded Iterative Phase Reconstruction

Add code
Bookmark button
Alert button
Apr 26, 2018
Zhong-Qiu Wang, Jonathan Le Roux, DeLiang Wang, John R. Hershey

Figure 1 for End-to-End Speech Separation with Unfolded Iterative Phase Reconstruction
Figure 2 for End-to-End Speech Separation with Unfolded Iterative Phase Reconstruction
Figure 3 for End-to-End Speech Separation with Unfolded Iterative Phase Reconstruction
Viaarxiv icon

Deep Long Short-Term Memory Adaptive Beamforming Networks For Multichannel Robust Speech Recognition

Add code
Bookmark button
Alert button
Nov 21, 2017
Zhong Meng, Shinji Watanabe, John R. Hershey, Hakan Erdogan

Figure 1 for Deep Long Short-Term Memory Adaptive Beamforming Networks For Multichannel Robust Speech Recognition
Figure 2 for Deep Long Short-Term Memory Adaptive Beamforming Networks For Multichannel Robust Speech Recognition
Figure 3 for Deep Long Short-Term Memory Adaptive Beamforming Networks For Multichannel Robust Speech Recognition
Viaarxiv icon

Deep Clustering and Conventional Networks for Music Separation: Stronger Together

Add code
Bookmark button
Alert button
Jun 15, 2017
Yi Luo, Zhuo Chen, John R. Hershey, Jonathan Le Roux, Nima Mesgarani

Figure 1 for Deep Clustering and Conventional Networks for Music Separation: Stronger Together
Figure 2 for Deep Clustering and Conventional Networks for Music Separation: Stronger Together
Figure 3 for Deep Clustering and Conventional Networks for Music Separation: Stronger Together
Viaarxiv icon

Multichannel End-to-end Speech Recognition

Add code
Bookmark button
Alert button
Mar 14, 2017
Tsubasa Ochiai, Shinji Watanabe, Takaaki Hori, John R. Hershey

Figure 1 for Multichannel End-to-end Speech Recognition
Figure 2 for Multichannel End-to-end Speech Recognition
Figure 3 for Multichannel End-to-end Speech Recognition
Figure 4 for Multichannel End-to-end Speech Recognition
Viaarxiv icon

Attention-Based Multimodal Fusion for Video Description

Add code
Bookmark button
Alert button
Mar 09, 2017
Chiori Hori, Takaaki Hori, Teng-Yok Lee, Kazuhiro Sumi, John R. Hershey, Tim K. Marks

Figure 1 for Attention-Based Multimodal Fusion for Video Description
Figure 2 for Attention-Based Multimodal Fusion for Video Description
Figure 3 for Attention-Based Multimodal Fusion for Video Description
Figure 4 for Attention-Based Multimodal Fusion for Video Description
Viaarxiv icon

Full-Capacity Unitary Recurrent Neural Networks

Add code
Bookmark button
Alert button
Oct 31, 2016
Scott Wisdom, Thomas Powers, John R. Hershey, Jonathan Le Roux, Les Atlas

Viaarxiv icon

Single-Channel Multi-Speaker Separation using Deep Clustering

Add code
Bookmark button
Alert button
Jul 07, 2016
Yusuf Isik, Jonathan Le Roux, Zhuo Chen, Shinji Watanabe, John R. Hershey

Figure 1 for Single-Channel Multi-Speaker Separation using Deep Clustering
Figure 2 for Single-Channel Multi-Speaker Separation using Deep Clustering
Figure 3 for Single-Channel Multi-Speaker Separation using Deep Clustering
Figure 4 for Single-Channel Multi-Speaker Separation using Deep Clustering
Viaarxiv icon

Global-Local Face Upsampling Network

Add code
Bookmark button
Alert button
Apr 27, 2016
Oncel Tuzel, Yuichi Taguchi, John R. Hershey

Figure 1 for Global-Local Face Upsampling Network
Figure 2 for Global-Local Face Upsampling Network
Figure 3 for Global-Local Face Upsampling Network
Figure 4 for Global-Local Face Upsampling Network
Viaarxiv icon