Picture for Herman Kamper

Herman Kamper

Translating speech with just images

Add code
Jun 11, 2024
Viaarxiv icon

Visually Grounded Speech Models have a Mutual Exclusivity Bias

Add code
Mar 20, 2024
Figure 1 for Visually Grounded Speech Models have a Mutual Exclusivity Bias
Figure 2 for Visually Grounded Speech Models have a Mutual Exclusivity Bias
Figure 3 for Visually Grounded Speech Models have a Mutual Exclusivity Bias
Figure 4 for Visually Grounded Speech Models have a Mutual Exclusivity Bias
Viaarxiv icon

Revisiting speech segmentation and lexicon learning with better features

Add code
Jan 31, 2024
Viaarxiv icon

Voice Conversion for Stuttered Speech, Instruments, Unseen Languages and Textually Described Voices

Add code
Oct 12, 2023
Figure 1 for Voice Conversion for Stuttered Speech, Instruments, Unseen Languages and Textually Described Voices
Figure 2 for Voice Conversion for Stuttered Speech, Instruments, Unseen Languages and Textually Described Voices
Figure 3 for Voice Conversion for Stuttered Speech, Instruments, Unseen Languages and Textually Described Voices
Figure 4 for Voice Conversion for Stuttered Speech, Instruments, Unseen Languages and Textually Described Voices
Viaarxiv icon

Rhythm Modeling for Voice Conversion

Add code
Jul 12, 2023
Figure 1 for Rhythm Modeling for Voice Conversion
Figure 2 for Rhythm Modeling for Voice Conversion
Figure 3 for Rhythm Modeling for Voice Conversion
Figure 4 for Rhythm Modeling for Voice Conversion
Viaarxiv icon

Leveraging multilingual transfer for unsupervised semantic acoustic word embeddings

Add code
Jul 05, 2023
Figure 1 for Leveraging multilingual transfer for unsupervised semantic acoustic word embeddings
Figure 2 for Leveraging multilingual transfer for unsupervised semantic acoustic word embeddings
Figure 3 for Leveraging multilingual transfer for unsupervised semantic acoustic word embeddings
Figure 4 for Leveraging multilingual transfer for unsupervised semantic acoustic word embeddings
Viaarxiv icon

Disentanglement in a GAN for Unconditional Speech Synthesis

Add code
Jul 04, 2023
Figure 1 for Disentanglement in a GAN for Unconditional Speech Synthesis
Figure 2 for Disentanglement in a GAN for Unconditional Speech Synthesis
Figure 3 for Disentanglement in a GAN for Unconditional Speech Synthesis
Figure 4 for Disentanglement in a GAN for Unconditional Speech Synthesis
Viaarxiv icon

Visually grounded few-shot word learning in low-resource settings

Add code
Jun 21, 2023
Figure 1 for Visually grounded few-shot word learning in low-resource settings
Figure 2 for Visually grounded few-shot word learning in low-resource settings
Figure 3 for Visually grounded few-shot word learning in low-resource settings
Figure 4 for Visually grounded few-shot word learning in low-resource settings
Viaarxiv icon

Towards hate speech detection in low-resource languages: Comparing ASR to acoustic word embeddings on Wolof and Swahili

Add code
Jun 01, 2023
Figure 1 for Towards hate speech detection in low-resource languages: Comparing ASR to acoustic word embeddings on Wolof and Swahili
Figure 2 for Towards hate speech detection in low-resource languages: Comparing ASR to acoustic word embeddings on Wolof and Swahili
Figure 3 for Towards hate speech detection in low-resource languages: Comparing ASR to acoustic word embeddings on Wolof and Swahili
Figure 4 for Towards hate speech detection in low-resource languages: Comparing ASR to acoustic word embeddings on Wolof and Swahili
Viaarxiv icon

Voice Conversion With Just Nearest Neighbors

Add code
May 30, 2023
Figure 1 for Voice Conversion With Just Nearest Neighbors
Figure 2 for Voice Conversion With Just Nearest Neighbors
Figure 3 for Voice Conversion With Just Nearest Neighbors
Viaarxiv icon