Picture for Gael Le Lan

Gael Le Lan

Exploring Audio Hallucination in Egocentric Video Understanding

Add code
Apr 26, 2026
Viaarxiv icon

Neural Computers

Add code
Apr 07, 2026
Viaarxiv icon

Free-Range Gaussians: Non-Grid-Aligned Generative 3D Gaussian Reconstruction

Add code
Apr 06, 2026
Viaarxiv icon

EgoAVU: Egocentric Audio-Visual Understanding

Add code
Feb 05, 2026
Viaarxiv icon

Conditional Flow Matching for Visually-Guided Acoustic Highlighting

Add code
Feb 03, 2026
Viaarxiv icon

SLAP: Scalable Language-Audio Pretraining with Variable-Duration Audio and Multi-Objective Training

Add code
Jan 18, 2026
Viaarxiv icon

High Fidelity Text-Guided Music Generation and Editing via Single-Stage Flow Matching

Add code
Jul 04, 2024
Figure 1 for High Fidelity Text-Guided Music Generation and Editing via Single-Stage Flow Matching
Figure 2 for High Fidelity Text-Guided Music Generation and Editing via Single-Stage Flow Matching
Figure 3 for High Fidelity Text-Guided Music Generation and Editing via Single-Stage Flow Matching
Figure 4 for High Fidelity Text-Guided Music Generation and Editing via Single-Stage Flow Matching
Viaarxiv icon

Masked Audio Generation using a Single Non-Autoregressive Transformer

Add code
Jan 09, 2024
Figure 1 for Masked Audio Generation using a Single Non-Autoregressive Transformer
Figure 2 for Masked Audio Generation using a Single Non-Autoregressive Transformer
Figure 3 for Masked Audio Generation using a Single Non-Autoregressive Transformer
Figure 4 for Masked Audio Generation using a Single Non-Autoregressive Transformer
Viaarxiv icon

In-Context Prompt Editing For Conditional Audio Generation

Add code
Nov 01, 2023
Viaarxiv icon

Exploring Speech Enhancement for Low-resource Speech Synthesis

Add code
Sep 19, 2023
Figure 1 for Exploring Speech Enhancement for Low-resource Speech Synthesis
Figure 2 for Exploring Speech Enhancement for Low-resource Speech Synthesis
Figure 3 for Exploring Speech Enhancement for Low-resource Speech Synthesis
Figure 4 for Exploring Speech Enhancement for Low-resource Speech Synthesis
Viaarxiv icon