Picture for Heinrich Dinkel

Heinrich Dinkel

MiDashengLM: Efficient Audio Understanding with General Audio Captions

Add code
Aug 06, 2025
Viaarxiv icon

Efficient Speech Enhancement via Embeddings from Pre-trained Generative Audioencoders

Add code
Jun 13, 2025
Viaarxiv icon

GLAP: General contrastive audio-text pretraining across domains and languages

Add code
Jun 12, 2025
Viaarxiv icon

X-ARES: A Comprehensive Framework for Assessing Audio Encoder Performance

Add code
May 22, 2025
Viaarxiv icon

Reinforcement Learning Outperforms Supervised Fine-Tuning: A Case Study on Audio Question Answering

Add code
Mar 17, 2025
Figure 1 for Reinforcement Learning Outperforms Supervised Fine-Tuning: A Case Study on Audio Question Answering
Figure 2 for Reinforcement Learning Outperforms Supervised Fine-Tuning: A Case Study on Audio Question Answering
Figure 3 for Reinforcement Learning Outperforms Supervised Fine-Tuning: A Case Study on Audio Question Answering
Figure 4 for Reinforcement Learning Outperforms Supervised Fine-Tuning: A Case Study on Audio Question Answering
Viaarxiv icon

The ICME 2025 Audio Encoder Capability Challenge

Add code
Jan 25, 2025
Viaarxiv icon

Enhancing Automated Audio Captioning via Large Language Models with Optimized Audio Encoding

Add code
Jun 19, 2024
Viaarxiv icon

Bridging Language Gaps in Audio-Text Retrieval

Add code
Jun 11, 2024
Figure 1 for Bridging Language Gaps in Audio-Text Retrieval
Figure 2 for Bridging Language Gaps in Audio-Text Retrieval
Figure 3 for Bridging Language Gaps in Audio-Text Retrieval
Figure 4 for Bridging Language Gaps in Audio-Text Retrieval
Viaarxiv icon

Scaling up masked audio encoder learning for general audio classification

Add code
Jun 11, 2024
Figure 1 for Scaling up masked audio encoder learning for general audio classification
Figure 2 for Scaling up masked audio encoder learning for general audio classification
Figure 3 for Scaling up masked audio encoder learning for general audio classification
Figure 4 for Scaling up masked audio encoder learning for general audio classification
Viaarxiv icon

CED: Consistent ensemble distillation for audio tagging

Add code
Sep 08, 2023
Figure 1 for CED: Consistent ensemble distillation for audio tagging
Figure 2 for CED: Consistent ensemble distillation for audio tagging
Figure 3 for CED: Consistent ensemble distillation for audio tagging
Figure 4 for CED: Consistent ensemble distillation for audio tagging
Viaarxiv icon