Picture for Stefano Squartini

Stefano Squartini

Università Politecnica delle Marche

One model to rule them all ? Towards End-to-End Joint Speaker Diarization and Speech Recognition

Add code
Oct 02, 2023
Figure 1 for One model to rule them all ? Towards End-to-End Joint Speaker Diarization and Speech Recognition
Figure 2 for One model to rule them all ? Towards End-to-End Joint Speaker Diarization and Speech Recognition
Figure 3 for One model to rule them all ? Towards End-to-End Joint Speaker Diarization and Speech Recognition
Viaarxiv icon

An enhanced system for the detection and active cancellation of snoring signals

Add code
Jul 31, 2023
Figure 1 for An enhanced system for the detection and active cancellation of snoring signals
Figure 2 for An enhanced system for the detection and active cancellation of snoring signals
Figure 3 for An enhanced system for the detection and active cancellation of snoring signals
Figure 4 for An enhanced system for the detection and active cancellation of snoring signals
Viaarxiv icon

A Time-Frequency Generative Adversarial based method for Audio Packet Loss Concealment

Add code
Jul 28, 2023
Figure 1 for A Time-Frequency Generative Adversarial based method for Audio Packet Loss Concealment
Figure 2 for A Time-Frequency Generative Adversarial based method for Audio Packet Loss Concealment
Figure 3 for A Time-Frequency Generative Adversarial based method for Audio Packet Loss Concealment
Figure 4 for A Time-Frequency Generative Adversarial based method for Audio Packet Loss Concealment
Viaarxiv icon

The CHiME-7 DASR Challenge: Distant Meeting Transcription with Multiple Devices in Diverse Scenarios

Add code
Jul 14, 2023
Figure 1 for The CHiME-7 DASR Challenge: Distant Meeting Transcription with Multiple Devices in Diverse Scenarios
Figure 2 for The CHiME-7 DASR Challenge: Distant Meeting Transcription with Multiple Devices in Diverse Scenarios
Figure 3 for The CHiME-7 DASR Challenge: Distant Meeting Transcription with Multiple Devices in Diverse Scenarios
Figure 4 for The CHiME-7 DASR Challenge: Distant Meeting Transcription with Multiple Devices in Diverse Scenarios
Viaarxiv icon

An Experimental Review of Speaker Diarization methods with application to Two-Speaker Conversational Telephone Speech recordings

Add code
May 29, 2023
Figure 1 for An Experimental Review of Speaker Diarization methods with application to Two-Speaker Conversational Telephone Speech recordings
Figure 2 for An Experimental Review of Speaker Diarization methods with application to Two-Speaker Conversational Telephone Speech recordings
Figure 3 for An Experimental Review of Speaker Diarization methods with application to Two-Speaker Conversational Telephone Speech recordings
Figure 4 for An Experimental Review of Speaker Diarization methods with application to Two-Speaker Conversational Telephone Speech recordings
Viaarxiv icon

End-to-End Integration of Speech Separation and Voice Activity Detection for Low-Latency Diarization of Telephone Conversations

Add code
Mar 21, 2023
Figure 1 for End-to-End Integration of Speech Separation and Voice Activity Detection for Low-Latency Diarization of Telephone Conversations
Figure 2 for End-to-End Integration of Speech Separation and Voice Activity Detection for Low-Latency Diarization of Telephone Conversations
Figure 3 for End-to-End Integration of Speech Separation and Voice Activity Detection for Low-Latency Diarization of Telephone Conversations
Figure 4 for End-to-End Integration of Speech Separation and Voice Activity Detection for Low-Latency Diarization of Telephone Conversations
Viaarxiv icon

Conversational Speech Separation: an Evaluation Study for Streaming Applications

Add code
May 31, 2022
Figure 1 for Conversational Speech Separation: an Evaluation Study for Streaming Applications
Figure 2 for Conversational Speech Separation: an Evaluation Study for Streaming Applications
Figure 3 for Conversational Speech Separation: an Evaluation Study for Streaming Applications
Figure 4 for Conversational Speech Separation: an Evaluation Study for Streaming Applications
Viaarxiv icon

Leveraging Speech Separation for Conversational Telephone Speaker Diarization

Add code
Apr 05, 2022
Figure 1 for Leveraging Speech Separation for Conversational Telephone Speaker Diarization
Figure 2 for Leveraging Speech Separation for Conversational Telephone Speaker Diarization
Figure 3 for Leveraging Speech Separation for Conversational Telephone Speaker Diarization
Figure 4 for Leveraging Speech Separation for Conversational Telephone Speaker Diarization
Viaarxiv icon

Learning Filterbanks for End-to-End Acoustic Beamforming

Add code
Nov 08, 2021
Figure 1 for Learning Filterbanks for End-to-End Acoustic Beamforming
Figure 2 for Learning Filterbanks for End-to-End Acoustic Beamforming
Figure 3 for Learning Filterbanks for End-to-End Acoustic Beamforming
Figure 4 for Learning Filterbanks for End-to-End Acoustic Beamforming
Viaarxiv icon

Deep Optimization of Parametric IIR Filters for Audio Equalization

Add code
Oct 05, 2021
Figure 1 for Deep Optimization of Parametric IIR Filters for Audio Equalization
Figure 2 for Deep Optimization of Parametric IIR Filters for Audio Equalization
Figure 3 for Deep Optimization of Parametric IIR Filters for Audio Equalization
Figure 4 for Deep Optimization of Parametric IIR Filters for Audio Equalization
Viaarxiv icon