Picture for Jonathan Le Roux

Jonathan Le Roux

MERL

Disentangled Acoustic Fields For Multimodal Physical Scene Understanding

Add code
Jul 16, 2024
Figure 1 for Disentangled Acoustic Fields For Multimodal Physical Scene Understanding
Figure 2 for Disentangled Acoustic Fields For Multimodal Physical Scene Understanding
Figure 3 for Disentangled Acoustic Fields For Multimodal Physical Scene Understanding
Figure 4 for Disentangled Acoustic Fields For Multimodal Physical Scene Understanding
Viaarxiv icon

Speech dereverberation constrained on room impulse response characteristics

Add code
Jul 10, 2024
Figure 1 for Speech dereverberation constrained on room impulse response characteristics
Figure 2 for Speech dereverberation constrained on room impulse response characteristics
Figure 3 for Speech dereverberation constrained on room impulse response characteristics
Viaarxiv icon

Sound Event Bounding Boxes

Add code
Jun 06, 2024
Figure 1 for Sound Event Bounding Boxes
Figure 2 for Sound Event Bounding Boxes
Figure 3 for Sound Event Bounding Boxes
Figure 4 for Sound Event Bounding Boxes
Viaarxiv icon

SMITIN: Self-Monitored Inference-Time INtervention for Generative Music Transformers

Add code
Apr 02, 2024
Figure 1 for SMITIN: Self-Monitored Inference-Time INtervention for Generative Music Transformers
Figure 2 for SMITIN: Self-Monitored Inference-Time INtervention for Generative Music Transformers
Figure 3 for SMITIN: Self-Monitored Inference-Time INtervention for Generative Music Transformers
Figure 4 for SMITIN: Self-Monitored Inference-Time INtervention for Generative Music Transformers
Viaarxiv icon

Why does music source separation benefit from cacophony?

Add code
Feb 28, 2024
Figure 1 for Why does music source separation benefit from cacophony?
Figure 2 for Why does music source separation benefit from cacophony?
Figure 3 for Why does music source separation benefit from cacophony?
Figure 4 for Why does music source separation benefit from cacophony?
Viaarxiv icon

NIIRF: Neural IIR Filter Field for HRTF Upsampling and Personalization

Add code
Feb 27, 2024
Viaarxiv icon

GLA-Grad: A Griffin-Lim Extended Waveform Generation Diffusion Model

Add code
Feb 09, 2024
Viaarxiv icon

SpecDiff-GAN: A Spectrally-Shaped Noise Diffusion GAN for Speech and Music Synthesis

Add code
Jan 30, 2024
Viaarxiv icon

NeuroHeed+: Improving Neuro-steered Speaker Extraction with Joint Auditory Attention Detection

Add code
Dec 12, 2023
Figure 1 for NeuroHeed+: Improving Neuro-steered Speaker Extraction with Joint Auditory Attention Detection
Figure 2 for NeuroHeed+: Improving Neuro-steered Speaker Extraction with Joint Auditory Attention Detection
Figure 3 for NeuroHeed+: Improving Neuro-steered Speaker Extraction with Joint Auditory Attention Detection
Figure 4 for NeuroHeed+: Improving Neuro-steered Speaker Extraction with Joint Auditory Attention Detection
Viaarxiv icon

Scenario-Aware Audio-Visual TF-GridNet for Target Speech Extraction

Add code
Oct 30, 2023
Figure 1 for Scenario-Aware Audio-Visual TF-GridNet for Target Speech Extraction
Figure 2 for Scenario-Aware Audio-Visual TF-GridNet for Target Speech Extraction
Figure 3 for Scenario-Aware Audio-Visual TF-GridNet for Target Speech Extraction
Figure 4 for Scenario-Aware Audio-Visual TF-GridNet for Target Speech Extraction
Viaarxiv icon