Alert button
Picture for Jonathan Le Roux

Jonathan Le Roux

Alert button

Style-transfer based Speech and Audio-visual Scene Understanding for Robot Action Sequence Acquisition from Videos

Add code
Bookmark button
Alert button
Jun 27, 2023
Chiori Hori, Puyuan Peng, David Harwath, Xinyu Liu, Kei Ota, Siddarth Jain, Radu Corcodel, Devesh Jha, Diego Romeres, Jonathan Le Roux

Figure 1 for Style-transfer based Speech and Audio-visual Scene Understanding for Robot Action Sequence Acquisition from Videos
Figure 2 for Style-transfer based Speech and Audio-visual Scene Understanding for Robot Action Sequence Acquisition from Videos
Figure 3 for Style-transfer based Speech and Audio-visual Scene Understanding for Robot Action Sequence Acquisition from Videos
Figure 4 for Style-transfer based Speech and Audio-visual Scene Understanding for Robot Action Sequence Acquisition from Videos
Viaarxiv icon

Pac-HuBERT: Self-Supervised Music Source Separation via Primitive Auditory Clustering and Hidden-Unit BERT

Add code
Bookmark button
Alert button
Apr 04, 2023
Ke Chen, Gordon Wichern, François G. Germain, Jonathan Le Roux

Figure 1 for Pac-HuBERT: Self-Supervised Music Source Separation via Primitive Auditory Clustering and Hidden-Unit BERT
Figure 2 for Pac-HuBERT: Self-Supervised Music Source Separation via Primitive Auditory Clustering and Hidden-Unit BERT
Figure 3 for Pac-HuBERT: Self-Supervised Music Source Separation via Primitive Auditory Clustering and Hidden-Unit BERT
Figure 4 for Pac-HuBERT: Self-Supervised Music Source Separation via Primitive Auditory Clustering and Hidden-Unit BERT
Viaarxiv icon

TS-SEP: Joint Diarization and Separation Conditioned on Estimated Speaker Embeddings

Add code
Bookmark button
Alert button
Mar 08, 2023
Christoph Boeddeker, Aswin Shanmugam Subramanian, Gordon Wichern, Reinhold Haeb-Umbach, Jonathan Le Roux

Figure 1 for TS-SEP: Joint Diarization and Separation Conditioned on Estimated Speaker Embeddings
Figure 2 for TS-SEP: Joint Diarization and Separation Conditioned on Estimated Speaker Embeddings
Figure 3 for TS-SEP: Joint Diarization and Separation Conditioned on Estimated Speaker Embeddings
Figure 4 for TS-SEP: Joint Diarization and Separation Conditioned on Estimated Speaker Embeddings
Viaarxiv icon

Tackling the Cocktail Fork Problem for Separation and Transcription of Real-World Soundtracks

Add code
Bookmark button
Alert button
Dec 14, 2022
Darius Petermann, Gordon Wichern, Aswin Shanmugam Subramanian, Zhong-Qiu Wang, Jonathan Le Roux

Figure 1 for Tackling the Cocktail Fork Problem for Separation and Transcription of Real-World Soundtracks
Figure 2 for Tackling the Cocktail Fork Problem for Separation and Transcription of Real-World Soundtracks
Figure 3 for Tackling the Cocktail Fork Problem for Separation and Transcription of Real-World Soundtracks
Figure 4 for Tackling the Cocktail Fork Problem for Separation and Transcription of Real-World Soundtracks
Viaarxiv icon

Hyperbolic Audio Source Separation

Add code
Bookmark button
Alert button
Dec 09, 2022
Darius Petermann, Gordon Wichern, Aswin Subramanian, Jonathan Le Roux

Figure 1 for Hyperbolic Audio Source Separation
Figure 2 for Hyperbolic Audio Source Separation
Figure 3 for Hyperbolic Audio Source Separation
Figure 4 for Hyperbolic Audio Source Separation
Viaarxiv icon

Latent Iterative Refinement for Modular Source Separation

Add code
Bookmark button
Alert button
Nov 22, 2022
Dimitrios Bralios, Efthymios Tzinis, Gordon Wichern, Paris Smaragdis, Jonathan Le Roux

Figure 1 for Latent Iterative Refinement for Modular Source Separation
Figure 2 for Latent Iterative Refinement for Modular Source Separation
Figure 3 for Latent Iterative Refinement for Modular Source Separation
Figure 4 for Latent Iterative Refinement for Modular Source Separation
Viaarxiv icon

Reverberation as Supervision for Speech Separation

Add code
Bookmark button
Alert button
Nov 15, 2022
Rohith Aralikatti, Christoph Boeddeker, Gordon Wichern, Aswin Shanmugam Subramanian, Jonathan Le Roux

Figure 1 for Reverberation as Supervision for Speech Separation
Figure 2 for Reverberation as Supervision for Speech Separation
Figure 3 for Reverberation as Supervision for Speech Separation
Figure 4 for Reverberation as Supervision for Speech Separation
Viaarxiv icon

Optimal Condition Training for Target Source Separation

Add code
Bookmark button
Alert button
Nov 11, 2022
Efthymios Tzinis, Gordon Wichern, Paris Smaragdis, Jonathan Le Roux

Figure 1 for Optimal Condition Training for Target Source Separation
Figure 2 for Optimal Condition Training for Target Source Separation
Figure 3 for Optimal Condition Training for Target Source Separation
Viaarxiv icon

Cold Diffusion for Speech Enhancement

Add code
Bookmark button
Alert button
Nov 04, 2022
Hao Yen, François G. Germain, Gordon Wichern, Jonathan Le Roux

Figure 1 for Cold Diffusion for Speech Enhancement
Figure 2 for Cold Diffusion for Speech Enhancement
Viaarxiv icon

Towards End-to-end Speaker Diarization in the Wild

Add code
Bookmark button
Alert button
Nov 02, 2022
Zexu Pan, Gordon Wichern, François G. Germain, Aswin Subramanian, Jonathan Le Roux

Figure 1 for Towards End-to-end Speaker Diarization in the Wild
Figure 2 for Towards End-to-end Speaker Diarization in the Wild
Figure 3 for Towards End-to-end Speaker Diarization in the Wild
Figure 4 for Towards End-to-end Speaker Diarization in the Wild
Viaarxiv icon