Picture for Marco Matassoni

Marco Matassoni

The Warmup Dilemma: How Learning Rate Strategies Impact Speech-to-Text Model Convergence

Add code
May 29, 2025
Viaarxiv icon

FAMA: The First Large-Scale Open-Science Speech Foundation Model for English and Italian

Add code
May 28, 2025
Viaarxiv icon

MOSEL: 950,000 Hours of Speech Data for Open-Source Speech Foundation Model Training on EU Languages

Add code
Oct 01, 2024
Figure 1 for MOSEL: 950,000 Hours of Speech Data for Open-Source Speech Foundation Model Training on EU Languages
Figure 2 for MOSEL: 950,000 Hours of Speech Data for Open-Source Speech Foundation Model Training on EU Languages
Figure 3 for MOSEL: 950,000 Hours of Speech Data for Open-Source Speech Foundation Model Training on EU Languages
Figure 4 for MOSEL: 950,000 Hours of Speech Data for Open-Source Speech Foundation Model Training on EU Languages
Viaarxiv icon

L2 proficiency assessment using self-supervised speech representations

Add code
Nov 16, 2022
Viaarxiv icon

Proficiency assessment of L2 spoken English using wav2vec 2.0

Add code
Oct 24, 2022
Figure 1 for Proficiency assessment of L2 spoken English using wav2vec 2.0
Figure 2 for Proficiency assessment of L2 spoken English using wav2vec 2.0
Figure 3 for Proficiency assessment of L2 spoken English using wav2vec 2.0
Figure 4 for Proficiency assessment of L2 spoken English using wav2vec 2.0
Viaarxiv icon

Seed Words Based Data Selection for Language Model Adaptation

Add code
Jul 20, 2021
Figure 1 for Seed Words Based Data Selection for Language Model Adaptation
Figure 2 for Seed Words Based Data Selection for Language Model Adaptation
Figure 3 for Seed Words Based Data Selection for Language Model Adaptation
Figure 4 for Seed Words Based Data Selection for Language Model Adaptation
Viaarxiv icon

Mixtures of Deep Neural Experts for Automated Speech Scoring

Add code
Jun 23, 2021
Figure 1 for Mixtures of Deep Neural Experts for Automated Speech Scoring
Figure 2 for Mixtures of Deep Neural Experts for Automated Speech Scoring
Figure 3 for Mixtures of Deep Neural Experts for Automated Speech Scoring
Viaarxiv icon

Learning to Rank Microphones for Distant Speech Recognition

Add code
Apr 13, 2021
Figure 1 for Learning to Rank Microphones for Distant Speech Recognition
Figure 2 for Learning to Rank Microphones for Distant Speech Recognition
Figure 3 for Learning to Rank Microphones for Distant Speech Recognition
Figure 4 for Learning to Rank Microphones for Distant Speech Recognition
Viaarxiv icon

Experiments of ASR-based mispronunciation detection for children and adult English learners

Add code
Apr 13, 2021
Figure 1 for Experiments of ASR-based mispronunciation detection for children and adult English learners
Figure 2 for Experiments of ASR-based mispronunciation detection for children and adult English learners
Figure 3 for Experiments of ASR-based mispronunciation detection for children and adult English learners
Viaarxiv icon

TLT-school: a Corpus of Non Native Children Speech

Add code
Jan 22, 2020
Figure 1 for TLT-school: a Corpus of Non Native Children Speech
Figure 2 for TLT-school: a Corpus of Non Native Children Speech
Figure 3 for TLT-school: a Corpus of Non Native Children Speech
Figure 4 for TLT-school: a Corpus of Non Native Children Speech
Viaarxiv icon