Picture for Gerhard Widmer

Gerhard Widmer

TACOS: Temporally-aligned Audio CaptiOnS for Language-Audio Pretraining

Add code
May 12, 2025
Viaarxiv icon

How to Infer Repeat Structures in MIDI Performances

Add code
May 08, 2025
Viaarxiv icon

Pairing Real-Time Piano Transcription with Symbol-level Tracking for Precise and Robust Score Following

Add code
May 08, 2025
Viaarxiv icon

Creating a Good Teacher for Knowledge Distillation in Acoustic Scene Classification

Add code
Mar 14, 2025
Viaarxiv icon

Exploring Performance-Complexity Trade-Offs in Sound Event Detection

Add code
Mar 14, 2025
Viaarxiv icon

Estimating Musical Surprisal in Audio

Add code
Jan 13, 2025
Viaarxiv icon

Language Models for Music Medicine Generation

Add code
Nov 13, 2024
Viaarxiv icon

Effective Pre-Training of Audio Transformers for Sound Event Detection

Add code
Sep 14, 2024
Figure 1 for Effective Pre-Training of Audio Transformers for Sound Event Detection
Figure 2 for Effective Pre-Training of Audio Transformers for Sound Event Detection
Figure 3 for Effective Pre-Training of Audio Transformers for Sound Event Detection
Viaarxiv icon

Estimated Audio-Caption Correspondences Improve Language-Based Audio Retrieval

Add code
Aug 21, 2024
Viaarxiv icon

Improving Query-by-Vocal Imitation with Contrastive Learning and Audio Pretraining

Add code
Aug 21, 2024
Viaarxiv icon