Picture for Hannes Gamper

Hannes Gamper

Make Some Noise: Towards LLM audio reasoning and generation using sound tokens

Add code
Mar 28, 2025
Figure 1 for Make Some Noise: Towards LLM audio reasoning and generation using sound tokens
Figure 2 for Make Some Noise: Towards LLM audio reasoning and generation using sound tokens
Figure 3 for Make Some Noise: Towards LLM audio reasoning and generation using sound tokens
Figure 4 for Make Some Noise: Towards LLM audio reasoning and generation using sound tokens
Viaarxiv icon

Quality Over Quantity? LLM-Based Curation for a Data-Efficient Audio-Video Foundation Model

Add code
Mar 12, 2025
Viaarxiv icon

Distillation and Pruning for Scalable Self-Supervised Representation-Based Speech Quality Assessment

Add code
Feb 07, 2025
Figure 1 for Distillation and Pruning for Scalable Self-Supervised Representation-Based Speech Quality Assessment
Figure 2 for Distillation and Pruning for Scalable Self-Supervised Representation-Based Speech Quality Assessment
Figure 3 for Distillation and Pruning for Scalable Self-Supervised Representation-Based Speech Quality Assessment
Viaarxiv icon

Audio Entailment: Assessing Deductive Reasoning for Audio Understanding

Add code
Jul 25, 2024
Viaarxiv icon

Multi-label audio classification with a noisy zero-shot teacher

Add code
Jul 20, 2024
Figure 1 for Multi-label audio classification with a noisy zero-shot teacher
Figure 2 for Multi-label audio classification with a noisy zero-shot teacher
Figure 3 for Multi-label audio classification with a noisy zero-shot teacher
Figure 4 for Multi-label audio classification with a noisy zero-shot teacher
Viaarxiv icon

Gaussian Flow Bridges for Audio Domain Transfer with Unpaired Data

Add code
May 29, 2024
Figure 1 for Gaussian Flow Bridges for Audio Domain Transfer with Unpaired Data
Figure 2 for Gaussian Flow Bridges for Audio Domain Transfer with Unpaired Data
Figure 3 for Gaussian Flow Bridges for Audio Domain Transfer with Unpaired Data
Figure 4 for Gaussian Flow Bridges for Audio Domain Transfer with Unpaired Data
Viaarxiv icon

PAM: Prompting Audio-Language Models for Audio Quality Assessment

Add code
Feb 01, 2024
Figure 1 for PAM: Prompting Audio-Language Models for Audio Quality Assessment
Figure 2 for PAM: Prompting Audio-Language Models for Audio Quality Assessment
Figure 3 for PAM: Prompting Audio-Language Models for Audio Quality Assessment
Figure 4 for PAM: Prompting Audio-Language Models for Audio Quality Assessment
Viaarxiv icon

CMMD: Contrastive Multi-Modal Diffusion for Video-Audio Conditional Modeling

Add code
Dec 08, 2023
Figure 1 for CMMD: Contrastive Multi-Modal Diffusion for Video-Audio Conditional Modeling
Figure 2 for CMMD: Contrastive Multi-Modal Diffusion for Video-Audio Conditional Modeling
Figure 3 for CMMD: Contrastive Multi-Modal Diffusion for Video-Audio Conditional Modeling
Figure 4 for CMMD: Contrastive Multi-Modal Diffusion for Video-Audio Conditional Modeling
Viaarxiv icon

Adapting Frechet Audio Distance for Generative Music Evaluation

Add code
Nov 02, 2023
Figure 1 for Adapting Frechet Audio Distance for Generative Music Evaluation
Figure 2 for Adapting Frechet Audio Distance for Generative Music Evaluation
Figure 3 for Adapting Frechet Audio Distance for Generative Music Evaluation
Figure 4 for Adapting Frechet Audio Distance for Generative Music Evaluation
Viaarxiv icon

ICASSP 2023 Acoustic Echo Cancellation Challenge

Add code
Sep 22, 2023
Figure 1 for ICASSP 2023 Acoustic Echo Cancellation Challenge
Figure 2 for ICASSP 2023 Acoustic Echo Cancellation Challenge
Figure 3 for ICASSP 2023 Acoustic Echo Cancellation Challenge
Figure 4 for ICASSP 2023 Acoustic Echo Cancellation Challenge
Viaarxiv icon