Picture for Enrico Zovato

Enrico Zovato

A Toolkit for Joint Speaker Diarization and Identification with Application to Speaker-Attributed ASR

Add code
Sep 09, 2024
Figure 1 for A Toolkit for Joint Speaker Diarization and Identification with Application to Speaker-Attributed ASR
Figure 2 for A Toolkit for Joint Speaker Diarization and Identification with Application to Speaker-Attributed ASR
Viaarxiv icon

Exploring Spoken Language Identification Strategies for Automatic Transcription of Multilingual Broadcast and Institutional Speech

Add code
Jun 13, 2024
Figure 1 for Exploring Spoken Language Identification Strategies for Automatic Transcription of Multilingual Broadcast and Institutional Speech
Figure 2 for Exploring Spoken Language Identification Strategies for Automatic Transcription of Multilingual Broadcast and Institutional Speech
Figure 3 for Exploring Spoken Language Identification Strategies for Automatic Transcription of Multilingual Broadcast and Institutional Speech
Viaarxiv icon

An Experimental Review of Speaker Diarization methods with application to Two-Speaker Conversational Telephone Speech recordings

Add code
May 29, 2023
Viaarxiv icon

End-to-End Integration of Speech Separation and Voice Activity Detection for Low-Latency Diarization of Telephone Conversations

Add code
Mar 21, 2023
Viaarxiv icon

Conversational Speech Separation: an Evaluation Study for Streaming Applications

Add code
May 31, 2022
Figure 1 for Conversational Speech Separation: an Evaluation Study for Streaming Applications
Figure 2 for Conversational Speech Separation: an Evaluation Study for Streaming Applications
Figure 3 for Conversational Speech Separation: an Evaluation Study for Streaming Applications
Figure 4 for Conversational Speech Separation: an Evaluation Study for Streaming Applications
Viaarxiv icon

Leveraging Speech Separation for Conversational Telephone Speaker Diarization

Add code
Apr 05, 2022
Figure 1 for Leveraging Speech Separation for Conversational Telephone Speaker Diarization
Figure 2 for Leveraging Speech Separation for Conversational Telephone Speaker Diarization
Figure 3 for Leveraging Speech Separation for Conversational Telephone Speaker Diarization
Figure 4 for Leveraging Speech Separation for Conversational Telephone Speaker Diarization
Viaarxiv icon

Voice Cloning: a Multi-Speaker Text-to-Speech Synthesis Approach based on Transfer Learning

Add code
Feb 10, 2021
Figure 1 for Voice Cloning: a Multi-Speaker Text-to-Speech Synthesis Approach based on Transfer Learning
Figure 2 for Voice Cloning: a Multi-Speaker Text-to-Speech Synthesis Approach based on Transfer Learning
Figure 3 for Voice Cloning: a Multi-Speaker Text-to-Speech Synthesis Approach based on Transfer Learning
Figure 4 for Voice Cloning: a Multi-Speaker Text-to-Speech Synthesis Approach based on Transfer Learning
Viaarxiv icon