Picture for Dominik Klement

Dominik Klement

Factorized RVQ-GAN For Disentangled Speech Tokenization

Add code
Jun 18, 2025
Viaarxiv icon

BUT System for the MLC-SLM Challenge

Add code
Jun 16, 2025
Viaarxiv icon

DiCoW: Diarization-Conditioned Whisper for Target Speaker Automatic Speech Recognition

Add code
Dec 30, 2024
Viaarxiv icon

Joint Training of Speaker Embedding Extractor, Speech and Overlap Detection for Diarization

Add code
Nov 04, 2024
Figure 1 for Joint Training of Speaker Embedding Extractor, Speech and Overlap Detection for Diarization
Figure 2 for Joint Training of Speaker Embedding Extractor, Speech and Overlap Detection for Diarization
Figure 3 for Joint Training of Speaker Embedding Extractor, Speech and Overlap Detection for Diarization
Figure 4 for Joint Training of Speaker Embedding Extractor, Speech and Overlap Detection for Diarization
Viaarxiv icon

Target Speaker ASR with Whisper

Add code
Sep 14, 2024
Viaarxiv icon

Discriminative Training of VBx Diarization

Add code
Oct 04, 2023
Figure 1 for Discriminative Training of VBx Diarization
Figure 2 for Discriminative Training of VBx Diarization
Figure 3 for Discriminative Training of VBx Diarization
Viaarxiv icon