Picture for Lukas Burget

Lukas Burget

Challenging margin-based speaker embedding extractors by using the variational information bottleneck

Add code
Jun 18, 2024
Viaarxiv icon

DiaCorrect: Error Correction Back-end For Speaker Diarization

Add code
Sep 15, 2023
Figure 1 for DiaCorrect: Error Correction Back-end For Speaker Diarization
Figure 2 for DiaCorrect: Error Correction Back-end For Speaker Diarization
Figure 3 for DiaCorrect: Error Correction Back-end For Speaker Diarization
Figure 4 for DiaCorrect: Error Correction Back-end For Speaker Diarization
Viaarxiv icon

Multi-Stream Extension of Variational Bayesian HMM Clustering (MS-VBx) for Combined End-to-End and Vector Clustering-based Diarization

Add code
May 23, 2023
Figure 1 for Multi-Stream Extension of Variational Bayesian HMM Clustering (MS-VBx) for Combined End-to-End and Vector Clustering-based Diarization
Figure 2 for Multi-Stream Extension of Variational Bayesian HMM Clustering (MS-VBx) for Combined End-to-End and Vector Clustering-based Diarization
Figure 3 for Multi-Stream Extension of Variational Bayesian HMM Clustering (MS-VBx) for Combined End-to-End and Vector Clustering-based Diarization
Viaarxiv icon

Stabilized training of joint energy-based models and their practical applications

Add code
Mar 07, 2023
Figure 1 for Stabilized training of joint energy-based models and their practical applications
Figure 2 for Stabilized training of joint energy-based models and their practical applications
Figure 3 for Stabilized training of joint energy-based models and their practical applications
Figure 4 for Stabilized training of joint energy-based models and their practical applications
Viaarxiv icon

Speech-based emotion recognition with self-supervised models using attentive channel-wise correlations and label smoothing

Add code
Nov 03, 2022
Figure 1 for Speech-based emotion recognition with self-supervised models using attentive channel-wise correlations and label smoothing
Figure 2 for Speech-based emotion recognition with self-supervised models using attentive channel-wise correlations and label smoothing
Figure 3 for Speech-based emotion recognition with self-supervised models using attentive channel-wise correlations and label smoothing
Figure 4 for Speech-based emotion recognition with self-supervised models using attentive channel-wise correlations and label smoothing
Viaarxiv icon

Extracting speaker and emotion information from self-supervised speech models via channel-wise correlations

Add code
Oct 15, 2022
Figure 1 for Extracting speaker and emotion information from self-supervised speech models via channel-wise correlations
Figure 2 for Extracting speaker and emotion information from self-supervised speech models via channel-wise correlations
Figure 3 for Extracting speaker and emotion information from self-supervised speech models via channel-wise correlations
Figure 4 for Extracting speaker and emotion information from self-supervised speech models via channel-wise correlations
Viaarxiv icon

An attention-based backend allowing efficient fine-tuning of transformer models for speaker verification

Add code
Oct 03, 2022
Figure 1 for An attention-based backend allowing efficient fine-tuning of transformer models for speaker verification
Figure 2 for An attention-based backend allowing efficient fine-tuning of transformer models for speaker verification
Figure 3 for An attention-based backend allowing efficient fine-tuning of transformer models for speaker verification
Figure 4 for An attention-based backend allowing efficient fine-tuning of transformer models for speaker verification
Viaarxiv icon

Analyzing speaker verification embedding extractors and back-ends under language and channel mismatch

Add code
Mar 19, 2022
Figure 1 for Analyzing speaker verification embedding extractors and back-ends under language and channel mismatch
Figure 2 for Analyzing speaker verification embedding extractors and back-ends under language and channel mismatch
Figure 3 for Analyzing speaker verification embedding extractors and back-ends under language and channel mismatch
Figure 4 for Analyzing speaker verification embedding extractors and back-ends under language and channel mismatch
Viaarxiv icon

DPCCN: Densely-Connected Pyramid Complex Convolutional Network for Robust Speech Separation And Extraction

Add code
Dec 27, 2021
Figure 1 for DPCCN: Densely-Connected Pyramid Complex Convolutional Network for Robust Speech Separation And Extraction
Figure 2 for DPCCN: Densely-Connected Pyramid Complex Convolutional Network for Robust Speech Separation And Extraction
Figure 3 for DPCCN: Densely-Connected Pyramid Complex Convolutional Network for Robust Speech Separation And Extraction
Figure 4 for DPCCN: Densely-Connected Pyramid Complex Convolutional Network for Robust Speech Separation And Extraction
Viaarxiv icon

Speaker embeddings by modeling channel-wise correlations

Add code
Apr 06, 2021
Figure 1 for Speaker embeddings by modeling channel-wise correlations
Viaarxiv icon