Picture for Andrea Fanelli

Andrea Fanelli

Decomposing multimodal embedding spaces with group-sparse autoencoders

Add code
Jan 27, 2026
Viaarxiv icon

Towards Mitigating Hallucinations in Large Vision-Language Models by Refining Textual Embeddings

Add code
Nov 07, 2025
Viaarxiv icon

Audio-Visual Speech Separation via Bottleneck Iterative Network

Add code
Jul 09, 2025
Figure 1 for Audio-Visual Speech Separation via Bottleneck Iterative Network
Figure 2 for Audio-Visual Speech Separation via Bottleneck Iterative Network
Figure 3 for Audio-Visual Speech Separation via Bottleneck Iterative Network
Figure 4 for Audio-Visual Speech Separation via Bottleneck Iterative Network
Viaarxiv icon

Are Deep Speech Denoising Models Robust to Adversarial Noise?

Add code
Mar 14, 2025
Viaarxiv icon

XAttnMark: Learning Robust Audio Watermarking with Cross-Attention

Add code
Feb 07, 2025
Figure 1 for XAttnMark: Learning Robust Audio Watermarking with Cross-Attention
Figure 2 for XAttnMark: Learning Robust Audio Watermarking with Cross-Attention
Figure 3 for XAttnMark: Learning Robust Audio Watermarking with Cross-Attention
Figure 4 for XAttnMark: Learning Robust Audio Watermarking with Cross-Attention
Viaarxiv icon

AudioMiXR: Spatial Audio Object Manipulation with 6DoF for Sound Design in Augmented Reality

Add code
Feb 05, 2025
Figure 1 for AudioMiXR: Spatial Audio Object Manipulation with 6DoF for Sound Design in Augmented Reality
Figure 2 for AudioMiXR: Spatial Audio Object Manipulation with 6DoF for Sound Design in Augmented Reality
Figure 3 for AudioMiXR: Spatial Audio Object Manipulation with 6DoF for Sound Design in Augmented Reality
Figure 4 for AudioMiXR: Spatial Audio Object Manipulation with 6DoF for Sound Design in Augmented Reality
Viaarxiv icon

Accent Conversion with Articulatory Representations

Add code
Jun 10, 2024
Figure 1 for Accent Conversion with Articulatory Representations
Figure 2 for Accent Conversion with Articulatory Representations
Figure 3 for Accent Conversion with Articulatory Representations
Figure 4 for Accent Conversion with Articulatory Representations
Viaarxiv icon

Low latency transformers for speech processing

Add code
Feb 27, 2023
Figure 1 for Low latency transformers for speech processing
Figure 2 for Low latency transformers for speech processing
Figure 3 for Low latency transformers for speech processing
Figure 4 for Low latency transformers for speech processing
Viaarxiv icon