Picture for Alessio Brutti

Alessio Brutti

The Warmup Dilemma: How Learning Rate Strategies Impact Speech-to-Text Model Convergence

Add code
May 29, 2025
Viaarxiv icon

FAMA: The First Large-Scale Open-Science Speech Foundation Model for English and Italian

Add code
May 28, 2025
Viaarxiv icon

An Effective Training Framework for Light-Weight Automatic Speech Recognition Models

Add code
May 22, 2025
Viaarxiv icon

Scaling and Enhancing LLM-based AVSR: A Sparse Mixture of Projectors Approach

Add code
May 21, 2025
Viaarxiv icon

Granary: Speech Recognition and Translation Dataset in 25 European Languages

Add code
May 19, 2025
Viaarxiv icon

MOSEL: 950,000 Hours of Speech Data for Open-Source Speech Foundation Model Training on EU Languages

Add code
Oct 01, 2024
Figure 1 for MOSEL: 950,000 Hours of Speech Data for Open-Source Speech Foundation Model Training on EU Languages
Figure 2 for MOSEL: 950,000 Hours of Speech Data for Open-Source Speech Foundation Model Training on EU Languages
Figure 3 for MOSEL: 950,000 Hours of Speech Data for Open-Source Speech Foundation Model Training on EU Languages
Figure 4 for MOSEL: 950,000 Hours of Speech Data for Open-Source Speech Foundation Model Training on EU Languages
Viaarxiv icon

Federating Dynamic Models using Early-Exit Architectures for Automatic Speech Recognition on Heterogeneous Clients

Add code
May 27, 2024
Viaarxiv icon

Efficient Fine-tuning of Audio Spectrogram Transformers via Soft Mixture of Adapters

Add code
Feb 01, 2024
Viaarxiv icon

Parameter-Efficient Transfer Learning of Audio Spectrogram Transformers

Add code
Dec 07, 2023
Viaarxiv icon

Continual Contrastive Spoken Language Understanding

Add code
Oct 04, 2023
Figure 1 for Continual Contrastive Spoken Language Understanding
Figure 2 for Continual Contrastive Spoken Language Understanding
Figure 3 for Continual Contrastive Spoken Language Understanding
Figure 4 for Continual Contrastive Spoken Language Understanding
Viaarxiv icon