Picture for Herman Kamper

Herman Kamper

The mutual exclusivity bias of bilingual visually grounded speech models

Add code
Jun 04, 2025
Viaarxiv icon

Spoken Language Modeling with Duration-Penalized Self-Supervised Units

Add code
May 29, 2025
Viaarxiv icon

Speech Recognition for Automatically Assessing Afrikaans and isiXhosa Preschool Oral Narratives

Add code
Jan 11, 2025
Viaarxiv icon

MARS6: A Small and Robust Hierarchical-Codec Text-to-Speech Model

Add code
Jan 10, 2025
Viaarxiv icon

Unsupervised Word Discovery: Boundary Detection with Clustering vs. Dynamic Programming

Add code
Sep 22, 2024
Figure 1 for Unsupervised Word Discovery: Boundary Detection with Clustering vs. Dynamic Programming
Figure 2 for Unsupervised Word Discovery: Boundary Detection with Clustering vs. Dynamic Programming
Figure 3 for Unsupervised Word Discovery: Boundary Detection with Clustering vs. Dynamic Programming
Figure 4 for Unsupervised Word Discovery: Boundary Detection with Clustering vs. Dynamic Programming
Viaarxiv icon

Improved Visually Prompted Keyword Localisation in Real Low-Resource Settings

Add code
Sep 09, 2024
Figure 1 for Improved Visually Prompted Keyword Localisation in Real Low-Resource Settings
Figure 2 for Improved Visually Prompted Keyword Localisation in Real Low-Resource Settings
Figure 3 for Improved Visually Prompted Keyword Localisation in Real Low-Resource Settings
Figure 4 for Improved Visually Prompted Keyword Localisation in Real Low-Resource Settings
Viaarxiv icon

Spoken-Term Discovery using Discrete Speech Units

Add code
Aug 26, 2024
Viaarxiv icon

Translating speech with just images

Add code
Jun 11, 2024
Viaarxiv icon

Visually Grounded Speech Models have a Mutual Exclusivity Bias

Add code
Mar 20, 2024
Viaarxiv icon

Revisiting speech segmentation and lexicon learning with better features

Add code
Jan 31, 2024
Viaarxiv icon