Speaker Diarization


Speaker diarization is the process of segmenting and clustering speech signals to identify different speakers in an audio recording.

Speaker Diarization for Low-Resource Languages Through Wav2vec Fine-Tuning

Add code
Apr 23, 2025
Figure 1 for Speaker Diarization for Low-Resource Languages Through Wav2vec Fine-Tuning
Figure 2 for Speaker Diarization for Low-Resource Languages Through Wav2vec Fine-Tuning
Figure 3 for Speaker Diarization for Low-Resource Languages Through Wav2vec Fine-Tuning
Figure 4 for Speaker Diarization for Low-Resource Languages Through Wav2vec Fine-Tuning
Viaarxiv icon

HPP-Voice: A Large-Scale Evaluation of Speech Embeddings for Multi-Phenotypic Classification

Add code
May 22, 2025
Viaarxiv icon

BUT System for the MLC-SLM Challenge

Add code
Jun 16, 2025
Figure 1 for BUT System for the MLC-SLM Challenge
Figure 2 for BUT System for the MLC-SLM Challenge
Figure 3 for BUT System for the MLC-SLM Challenge
Figure 4 for BUT System for the MLC-SLM Challenge
Viaarxiv icon

AISHELL-5: The First Open-Source In-Car Multi-Channel Multi-Speaker Speech Dataset for Automatic Speech Diarization and Recognition

Add code
May 29, 2025
Viaarxiv icon

Mitigating Non-Target Speaker Bias in Guided Speaker Embedding

Add code
Jun 14, 2025
Viaarxiv icon

SeniorTalk: A Chinese Conversation Dataset with Rich Annotations for Super-Aged Seniors

Add code
Mar 20, 2025
Viaarxiv icon

Language Modelling for Speaker Diarization in Telephonic Interviews

Add code
Jan 28, 2025
Figure 1 for Language Modelling for Speaker Diarization in Telephonic Interviews
Figure 2 for Language Modelling for Speaker Diarization in Telephonic Interviews
Figure 3 for Language Modelling for Speaker Diarization in Telephonic Interviews
Figure 4 for Language Modelling for Speaker Diarization in Telephonic Interviews
Viaarxiv icon

SCDiar: a streaming diarization system based on speaker change detection and speech recognition

Add code
Jan 28, 2025
Figure 1 for SCDiar: a streaming diarization system based on speaker change detection and speech recognition
Figure 2 for SCDiar: a streaming diarization system based on speaker change detection and speech recognition
Figure 3 for SCDiar: a streaming diarization system based on speaker change detection and speech recognition
Figure 4 for SCDiar: a streaming diarization system based on speaker change detection and speech recognition
Viaarxiv icon

Afrispeech-Dialog: A Benchmark Dataset for Spontaneous English Conversations in Healthcare and Beyond

Add code
Feb 06, 2025
Viaarxiv icon

Playing with Voices: Tabletop Role-Playing Game Recordings as a Diarization Challenge

Add code
Feb 18, 2025
Viaarxiv icon