Picture for Desh Raj

Desh Raj

Listening to Multi-talker Conversations: Modular and End-to-end Perspectives

Add code
Feb 14, 2024
Viaarxiv icon

On Speaker Attribution with SURT

Add code
Jan 28, 2024
Viaarxiv icon

Updated Corpora and Benchmarks for Long-Form Speech Recognition

Add code
Sep 26, 2023
Viaarxiv icon

Learning from Flawed Data: Weakly Supervised Automatic Speech Recognition

Add code
Sep 26, 2023
Figure 1 for Learning from Flawed Data: Weakly Supervised Automatic Speech Recognition
Figure 2 for Learning from Flawed Data: Weakly Supervised Automatic Speech Recognition
Figure 3 for Learning from Flawed Data: Weakly Supervised Automatic Speech Recognition
Figure 4 for Learning from Flawed Data: Weakly Supervised Automatic Speech Recognition
Viaarxiv icon

Training dynamic models using early exits for automatic speech recognition on resource-constrained devices

Add code
Sep 18, 2023
Figure 1 for Training dynamic models using early exits for automatic speech recognition on resource-constrained devices
Figure 2 for Training dynamic models using early exits for automatic speech recognition on resource-constrained devices
Figure 3 for Training dynamic models using early exits for automatic speech recognition on resource-constrained devices
Figure 4 for Training dynamic models using early exits for automatic speech recognition on resource-constrained devices
Viaarxiv icon

The CHiME-7 DASR Challenge: Distant Meeting Transcription with Multiple Devices in Diverse Scenarios

Add code
Jul 14, 2023
Figure 1 for The CHiME-7 DASR Challenge: Distant Meeting Transcription with Multiple Devices in Diverse Scenarios
Figure 2 for The CHiME-7 DASR Challenge: Distant Meeting Transcription with Multiple Devices in Diverse Scenarios
Figure 3 for The CHiME-7 DASR Challenge: Distant Meeting Transcription with Multiple Devices in Diverse Scenarios
Figure 4 for The CHiME-7 DASR Challenge: Distant Meeting Transcription with Multiple Devices in Diverse Scenarios
Viaarxiv icon

SURT 2.0: Advances in Transducer-based Multi-talker Speech Recognition

Add code
Jun 18, 2023
Figure 1 for SURT 2.0: Advances in Transducer-based Multi-talker Speech Recognition
Figure 2 for SURT 2.0: Advances in Transducer-based Multi-talker Speech Recognition
Figure 3 for SURT 2.0: Advances in Transducer-based Multi-talker Speech Recognition
Figure 4 for SURT 2.0: Advances in Transducer-based Multi-talker Speech Recognition
Viaarxiv icon

GPU-accelerated Guided Source Separation for Meeting Transcription

Add code
Dec 10, 2022
Figure 1 for GPU-accelerated Guided Source Separation for Meeting Transcription
Figure 2 for GPU-accelerated Guided Source Separation for Meeting Transcription
Figure 3 for GPU-accelerated Guided Source Separation for Meeting Transcription
Figure 4 for GPU-accelerated Guided Source Separation for Meeting Transcription
Viaarxiv icon

Adapting self-supervised models to multi-talker speech recognition using speaker embeddings

Nov 01, 2022
Figure 1 for Adapting self-supervised models to multi-talker speech recognition using speaker embeddings
Figure 2 for Adapting self-supervised models to multi-talker speech recognition using speaker embeddings
Figure 3 for Adapting self-supervised models to multi-talker speech recognition using speaker embeddings
Figure 4 for Adapting self-supervised models to multi-talker speech recognition using speaker embeddings
Viaarxiv icon

Leveraging Speech Separation for Conversational Telephone Speaker Diarization

Add code
Apr 05, 2022
Figure 1 for Leveraging Speech Separation for Conversational Telephone Speaker Diarization
Figure 2 for Leveraging Speech Separation for Conversational Telephone Speaker Diarization
Figure 3 for Leveraging Speech Separation for Conversational Telephone Speaker Diarization
Figure 4 for Leveraging Speech Separation for Conversational Telephone Speaker Diarization
Viaarxiv icon