Picture for Xavier Alameda-Pineda

Xavier Alameda-Pineda

ROBOTLEARN

The Equalizer: Introducing Shape-Gain Decomposition in Neural Audio Codecs

Add code
Feb 17, 2026
Viaarxiv icon

Residual Tokens Enhance Masked Autoencoders for Speech Modeling

Add code
Jan 27, 2026
Viaarxiv icon

OpenSocInt: A Multi-modal Training Environment for Human-Aware Social Navigation

Add code
Jan 05, 2026
Viaarxiv icon

Modeling strategies for speech enhancement in the latent space of a neural audio codec

Add code
Oct 30, 2025
Viaarxiv icon

Posterior Transition Modeling for Unsupervised Diffusion-Based Speech Enhancement

Add code
Jul 03, 2025
Figure 1 for Posterior Transition Modeling for Unsupervised Diffusion-Based Speech Enhancement
Figure 2 for Posterior Transition Modeling for Unsupervised Diffusion-Based Speech Enhancement
Viaarxiv icon

AnCoGen: Analysis, Control and Generation of Speech with a Masked Autoencoder

Add code
Jan 09, 2025
Viaarxiv icon

Diffusion-based Unsupervised Audio-visual Speech Enhancement

Add code
Oct 04, 2024
Figure 1 for Diffusion-based Unsupervised Audio-visual Speech Enhancement
Figure 2 for Diffusion-based Unsupervised Audio-visual Speech Enhancement
Viaarxiv icon

Lost and Found: Overcoming Detector Failures in Online Multi-Object Tracking

Add code
Jul 16, 2024
Viaarxiv icon

MEGA: Masked Generative Autoencoder for Human Mesh Recovery

Add code
May 29, 2024
Viaarxiv icon

Socially Pertinent Robots in Gerontological Healthcare

Add code
Apr 11, 2024
Figure 1 for Socially Pertinent Robots in Gerontological Healthcare
Figure 2 for Socially Pertinent Robots in Gerontological Healthcare
Figure 3 for Socially Pertinent Robots in Gerontological Healthcare
Figure 4 for Socially Pertinent Robots in Gerontological Healthcare
Viaarxiv icon