Picture for Jonathan Le Roux

Jonathan Le Roux

MERL

SMITIN: Self-Monitored Inference-Time INtervention for Generative Music Transformers

Add code
Apr 02, 2024
Figure 1 for SMITIN: Self-Monitored Inference-Time INtervention for Generative Music Transformers
Figure 2 for SMITIN: Self-Monitored Inference-Time INtervention for Generative Music Transformers
Figure 3 for SMITIN: Self-Monitored Inference-Time INtervention for Generative Music Transformers
Figure 4 for SMITIN: Self-Monitored Inference-Time INtervention for Generative Music Transformers
Viaarxiv icon

Why does music source separation benefit from cacophony?

Add code
Feb 28, 2024
Figure 1 for Why does music source separation benefit from cacophony?
Figure 2 for Why does music source separation benefit from cacophony?
Figure 3 for Why does music source separation benefit from cacophony?
Figure 4 for Why does music source separation benefit from cacophony?
Viaarxiv icon

NIIRF: Neural IIR Filter Field for HRTF Upsampling and Personalization

Add code
Feb 27, 2024
Viaarxiv icon

GLA-Grad: A Griffin-Lim Extended Waveform Generation Diffusion Model

Add code
Feb 09, 2024
Viaarxiv icon

SpecDiff-GAN: A Spectrally-Shaped Noise Diffusion GAN for Speech and Music Synthesis

Add code
Jan 30, 2024
Viaarxiv icon

NeuroHeed+: Improving Neuro-steered Speaker Extraction with Joint Auditory Attention Detection

Add code
Dec 12, 2023
Viaarxiv icon

Scenario-Aware Audio-Visual TF-GridNet for Target Speech Extraction

Add code
Oct 30, 2023
Figure 1 for Scenario-Aware Audio-Visual TF-GridNet for Target Speech Extraction
Figure 2 for Scenario-Aware Audio-Visual TF-GridNet for Target Speech Extraction
Figure 3 for Scenario-Aware Audio-Visual TF-GridNet for Target Speech Extraction
Figure 4 for Scenario-Aware Audio-Visual TF-GridNet for Target Speech Extraction
Viaarxiv icon

Generation or Replication: Auscultating Audio Latent Diffusion Models

Add code
Oct 16, 2023
Figure 1 for Generation or Replication: Auscultating Audio Latent Diffusion Models
Figure 2 for Generation or Replication: Auscultating Audio Latent Diffusion Models
Figure 3 for Generation or Replication: Auscultating Audio Latent Diffusion Models
Figure 4 for Generation or Replication: Auscultating Audio Latent Diffusion Models
Viaarxiv icon

Improving Audio Captioning Models with Fine-grained Audio Features, Text Embedding Supervision, and LLM Mix-up Augmentation

Add code
Sep 29, 2023
Viaarxiv icon

The Sound Demixing Challenge 2023 $\unicode{x2013}$ Cinematic Demixing Track

Add code
Aug 14, 2023
Figure 1 for The Sound Demixing Challenge 2023 $\unicode{x2013}$ Cinematic Demixing Track
Figure 2 for The Sound Demixing Challenge 2023 $\unicode{x2013}$ Cinematic Demixing Track
Figure 3 for The Sound Demixing Challenge 2023 $\unicode{x2013}$ Cinematic Demixing Track
Figure 4 for The Sound Demixing Challenge 2023 $\unicode{x2013}$ Cinematic Demixing Track
Viaarxiv icon