Picture for Jonathan Le Roux

Jonathan Le Roux

IDS, S2A, LTCI

Sequence Transduction with Graph-based Supervision

Add code
Nov 01, 2021
Figure 1 for Sequence Transduction with Graph-based Supervision
Figure 2 for Sequence Transduction with Graph-based Supervision
Figure 3 for Sequence Transduction with Graph-based Supervision
Viaarxiv icon

The Cocktail Fork Problem: Three-Stem Audio Separation for Real-World Soundtracks

Add code
Oct 19, 2021
Figure 1 for The Cocktail Fork Problem: Three-Stem Audio Separation for Real-World Soundtracks
Figure 2 for The Cocktail Fork Problem: Three-Stem Audio Separation for Real-World Soundtracks
Figure 3 for The Cocktail Fork Problem: Three-Stem Audio Separation for Real-World Soundtracks
Figure 4 for The Cocktail Fork Problem: Three-Stem Audio Separation for Real-World Soundtracks
Viaarxiv icon

Audio-Visual Scene-Aware Dialog and Reasoning using Audio-Visual Transformers with Joint Student-Teacher Learning

Add code
Oct 13, 2021
Figure 1 for Audio-Visual Scene-Aware Dialog and Reasoning using Audio-Visual Transformers with Joint Student-Teacher Learning
Figure 2 for Audio-Visual Scene-Aware Dialog and Reasoning using Audio-Visual Transformers with Joint Student-Teacher Learning
Figure 3 for Audio-Visual Scene-Aware Dialog and Reasoning using Audio-Visual Transformers with Joint Student-Teacher Learning
Figure 4 for Audio-Visual Scene-Aware Dialog and Reasoning using Audio-Visual Transformers with Joint Student-Teacher Learning
Viaarxiv icon

Advancing Momentum Pseudo-Labeling with Conformer and Initialization Strategy

Add code
Oct 11, 2021
Figure 1 for Advancing Momentum Pseudo-Labeling with Conformer and Initialization Strategy
Figure 2 for Advancing Momentum Pseudo-Labeling with Conformer and Initialization Strategy
Figure 3 for Advancing Momentum Pseudo-Labeling with Conformer and Initialization Strategy
Viaarxiv icon

Leveraging Low-Distortion Target Estimates for Improved Speech Enhancement

Add code
Oct 01, 2021
Figure 1 for Leveraging Low-Distortion Target Estimates for Improved Speech Enhancement
Figure 2 for Leveraging Low-Distortion Target Estimates for Improved Speech Enhancement
Figure 3 for Leveraging Low-Distortion Target Estimates for Improved Speech Enhancement
Figure 4 for Leveraging Low-Distortion Target Estimates for Improved Speech Enhancement
Viaarxiv icon

Visual Scene Graphs for Audio Source Separation

Add code
Sep 24, 2021
Figure 1 for Visual Scene Graphs for Audio Source Separation
Figure 2 for Visual Scene Graphs for Audio Source Separation
Figure 3 for Visual Scene Graphs for Audio Source Separation
Figure 4 for Visual Scene Graphs for Audio Source Separation
Viaarxiv icon

Convolutive Prediction for Monaural Speech Dereverberation and Noisy-Reverberant Speaker Separation

Add code
Aug 16, 2021
Figure 1 for Convolutive Prediction for Monaural Speech Dereverberation and Noisy-Reverberant Speaker Separation
Figure 2 for Convolutive Prediction for Monaural Speech Dereverberation and Noisy-Reverberant Speaker Separation
Figure 3 for Convolutive Prediction for Monaural Speech Dereverberation and Noisy-Reverberant Speaker Separation
Figure 4 for Convolutive Prediction for Monaural Speech Dereverberation and Noisy-Reverberant Speaker Separation
Viaarxiv icon

Convolutive Prediction for Reverberant Speech Separation

Add code
Aug 16, 2021
Figure 1 for Convolutive Prediction for Reverberant Speech Separation
Figure 2 for Convolutive Prediction for Reverberant Speech Separation
Figure 3 for Convolutive Prediction for Reverberant Speech Separation
Viaarxiv icon

On The Compensation Between Magnitude and Phase in Speech Separation

Add code
Aug 11, 2021
Figure 1 for On The Compensation Between Magnitude and Phase in Speech Separation
Figure 2 for On The Compensation Between Magnitude and Phase in Speech Separation
Figure 3 for On The Compensation Between Magnitude and Phase in Speech Separation
Figure 4 for On The Compensation Between Magnitude and Phase in Speech Separation
Viaarxiv icon

Optimizing Latency for Online Video CaptioningUsing Audio-Visual Transformers

Add code
Aug 04, 2021
Figure 1 for Optimizing Latency for Online Video CaptioningUsing Audio-Visual Transformers
Figure 2 for Optimizing Latency for Online Video CaptioningUsing Audio-Visual Transformers
Figure 3 for Optimizing Latency for Online Video CaptioningUsing Audio-Visual Transformers
Figure 4 for Optimizing Latency for Online Video CaptioningUsing Audio-Visual Transformers
Viaarxiv icon