Picture for Giovanni Morrone

Giovanni Morrone

A Toolkit for Joint Speaker Diarization and Identification with Application to Speaker-Attributed ASR

Add code
Sep 09, 2024
Figure 1 for A Toolkit for Joint Speaker Diarization and Identification with Application to Speaker-Attributed ASR
Figure 2 for A Toolkit for Joint Speaker Diarization and Identification with Application to Speaker-Attributed ASR
Viaarxiv icon

Exploring Spoken Language Identification Strategies for Automatic Transcription of Multilingual Broadcast and Institutional Speech

Add code
Jun 13, 2024
Figure 1 for Exploring Spoken Language Identification Strategies for Automatic Transcription of Multilingual Broadcast and Institutional Speech
Figure 2 for Exploring Spoken Language Identification Strategies for Automatic Transcription of Multilingual Broadcast and Institutional Speech
Figure 3 for Exploring Spoken Language Identification Strategies for Automatic Transcription of Multilingual Broadcast and Institutional Speech
Viaarxiv icon

An Experimental Review of Speaker Diarization methods with application to Two-Speaker Conversational Telephone Speech recordings

Add code
May 29, 2023
Viaarxiv icon

End-to-End Integration of Speech Separation and Voice Activity Detection for Low-Latency Diarization of Telephone Conversations

Add code
Mar 21, 2023
Viaarxiv icon

Conversational Speech Separation: an Evaluation Study for Streaming Applications

Add code
May 31, 2022
Figure 1 for Conversational Speech Separation: an Evaluation Study for Streaming Applications
Figure 2 for Conversational Speech Separation: an Evaluation Study for Streaming Applications
Figure 3 for Conversational Speech Separation: an Evaluation Study for Streaming Applications
Figure 4 for Conversational Speech Separation: an Evaluation Study for Streaming Applications
Viaarxiv icon

Leveraging Speech Separation for Conversational Telephone Speaker Diarization

Add code
Apr 05, 2022
Figure 1 for Leveraging Speech Separation for Conversational Telephone Speaker Diarization
Figure 2 for Leveraging Speech Separation for Conversational Telephone Speaker Diarization
Figure 3 for Leveraging Speech Separation for Conversational Telephone Speaker Diarization
Figure 4 for Leveraging Speech Separation for Conversational Telephone Speaker Diarization
Viaarxiv icon

Audio-Visual Speech Inpainting with Deep Learning

Add code
Oct 09, 2020
Figure 1 for Audio-Visual Speech Inpainting with Deep Learning
Figure 2 for Audio-Visual Speech Inpainting with Deep Learning
Figure 3 for Audio-Visual Speech Inpainting with Deep Learning
Viaarxiv icon

Audio-Visual Target Speaker Extraction on Multi-Talker Environment using Event-Driven Cameras

Add code
Dec 05, 2019
Figure 1 for Audio-Visual Target Speaker Extraction on Multi-Talker Environment using Event-Driven Cameras
Figure 2 for Audio-Visual Target Speaker Extraction on Multi-Talker Environment using Event-Driven Cameras
Figure 3 for Audio-Visual Target Speaker Extraction on Multi-Talker Environment using Event-Driven Cameras
Viaarxiv icon

Joined Audio-Visual Speech Enhancement and Recognition in the Cocktail Party: The Tug Of War Between Enhancement and Recognition Losses

Add code
Apr 16, 2019
Figure 1 for Joined Audio-Visual Speech Enhancement and Recognition in the Cocktail Party: The Tug Of War Between Enhancement and Recognition Losses
Figure 2 for Joined Audio-Visual Speech Enhancement and Recognition in the Cocktail Party: The Tug Of War Between Enhancement and Recognition Losses
Figure 3 for Joined Audio-Visual Speech Enhancement and Recognition in the Cocktail Party: The Tug Of War Between Enhancement and Recognition Losses
Figure 4 for Joined Audio-Visual Speech Enhancement and Recognition in the Cocktail Party: The Tug Of War Between Enhancement and Recognition Losses
Viaarxiv icon

Face Landmark-based Speaker-Independent Audio-Visual Speech Enhancement in Multi-Talker Environments

Add code
Nov 06, 2018
Figure 1 for Face Landmark-based Speaker-Independent Audio-Visual Speech Enhancement in Multi-Talker Environments
Figure 2 for Face Landmark-based Speaker-Independent Audio-Visual Speech Enhancement in Multi-Talker Environments
Figure 3 for Face Landmark-based Speaker-Independent Audio-Visual Speech Enhancement in Multi-Talker Environments
Figure 4 for Face Landmark-based Speaker-Independent Audio-Visual Speech Enhancement in Multi-Talker Environments
Viaarxiv icon