Picture for Herman Kamper

Herman Kamper

Analyzing and Improving Speaker Similarity Assessment for Speech Synthesis

Add code
Jul 02, 2025
Viaarxiv icon

The mutual exclusivity bias of bilingual visually grounded speech models

Add code
Jun 04, 2025
Viaarxiv icon

Spoken Language Modeling with Duration-Penalized Self-Supervised Units

Add code
May 29, 2025
Viaarxiv icon

Speech Recognition for Automatically Assessing Afrikaans and isiXhosa Preschool Oral Narratives

Add code
Jan 11, 2025
Viaarxiv icon

MARS6: A Small and Robust Hierarchical-Codec Text-to-Speech Model

Add code
Jan 10, 2025
Viaarxiv icon

Unsupervised Word Discovery: Boundary Detection with Clustering vs. Dynamic Programming

Add code
Sep 22, 2024
Figure 1 for Unsupervised Word Discovery: Boundary Detection with Clustering vs. Dynamic Programming
Figure 2 for Unsupervised Word Discovery: Boundary Detection with Clustering vs. Dynamic Programming
Figure 3 for Unsupervised Word Discovery: Boundary Detection with Clustering vs. Dynamic Programming
Figure 4 for Unsupervised Word Discovery: Boundary Detection with Clustering vs. Dynamic Programming
Viaarxiv icon

Improved Visually Prompted Keyword Localisation in Real Low-Resource Settings

Add code
Sep 09, 2024
Figure 1 for Improved Visually Prompted Keyword Localisation in Real Low-Resource Settings
Figure 2 for Improved Visually Prompted Keyword Localisation in Real Low-Resource Settings
Figure 3 for Improved Visually Prompted Keyword Localisation in Real Low-Resource Settings
Figure 4 for Improved Visually Prompted Keyword Localisation in Real Low-Resource Settings
Viaarxiv icon

Spoken-Term Discovery using Discrete Speech Units

Add code
Aug 26, 2024
Viaarxiv icon

Translating speech with just images

Add code
Jun 11, 2024
Viaarxiv icon

Visually Grounded Speech Models have a Mutual Exclusivity Bias

Add code
Mar 20, 2024
Viaarxiv icon