Picture for Sanjeev Khudanpur

Sanjeev Khudanpur

Improving Neural Biasing for Contextual Speech Recognition by Early Context Injection and Text Perturbation

Add code
Jul 14, 2024
Viaarxiv icon

Multi-Channel Multi-Speaker ASR Using Target Speaker's Solo Segment

Add code
Jun 17, 2024
Figure 1 for Multi-Channel Multi-Speaker ASR Using Target Speaker's Solo Segment
Figure 2 for Multi-Channel Multi-Speaker ASR Using Target Speaker's Solo Segment
Figure 3 for Multi-Channel Multi-Speaker ASR Using Target Speaker's Solo Segment
Figure 4 for Multi-Channel Multi-Speaker ASR Using Target Speaker's Solo Segment
Viaarxiv icon

Acoustic modeling for Overlapping Speech Recognition: JHU Chime-5 Challenge System

Add code
May 17, 2024
Viaarxiv icon

Kreyòl-MT: Building MT for Latin American, Caribbean and Colonial African Creole Languages

Add code
May 08, 2024
Viaarxiv icon

On Speaker Attribution with SURT

Add code
Jan 28, 2024
Viaarxiv icon

Enhancing Code-switching Speech Recognition with Interactive Language Biases

Add code
Sep 29, 2023
Figure 1 for Enhancing Code-switching Speech Recognition with Interactive Language Biases
Figure 2 for Enhancing Code-switching Speech Recognition with Interactive Language Biases
Figure 3 for Enhancing Code-switching Speech Recognition with Interactive Language Biases
Figure 4 for Enhancing Code-switching Speech Recognition with Interactive Language Biases
Viaarxiv icon

Enhancing End-to-End Conversational Speech Translation Through Target Language Context Utilization

Add code
Sep 27, 2023
Figure 1 for Enhancing End-to-End Conversational Speech Translation Through Target Language Context Utilization
Figure 2 for Enhancing End-to-End Conversational Speech Translation Through Target Language Context Utilization
Figure 3 for Enhancing End-to-End Conversational Speech Translation Through Target Language Context Utilization
Figure 4 for Enhancing End-to-End Conversational Speech Translation Through Target Language Context Utilization
Viaarxiv icon

Speech collage: code-switched audio generation by collaging monolingual corpora

Add code
Sep 27, 2023
Figure 1 for Speech collage: code-switched audio generation by collaging monolingual corpora
Figure 2 for Speech collage: code-switched audio generation by collaging monolingual corpora
Figure 3 for Speech collage: code-switched audio generation by collaging monolingual corpora
Figure 4 for Speech collage: code-switched audio generation by collaging monolingual corpora
Viaarxiv icon

Learning from Flawed Data: Weakly Supervised Automatic Speech Recognition

Add code
Sep 26, 2023
Figure 1 for Learning from Flawed Data: Weakly Supervised Automatic Speech Recognition
Figure 2 for Learning from Flawed Data: Weakly Supervised Automatic Speech Recognition
Figure 3 for Learning from Flawed Data: Weakly Supervised Automatic Speech Recognition
Figure 4 for Learning from Flawed Data: Weakly Supervised Automatic Speech Recognition
Viaarxiv icon

The CHiME-7 DASR Challenge: Distant Meeting Transcription with Multiple Devices in Diverse Scenarios

Add code
Jul 14, 2023
Figure 1 for The CHiME-7 DASR Challenge: Distant Meeting Transcription with Multiple Devices in Diverse Scenarios
Figure 2 for The CHiME-7 DASR Challenge: Distant Meeting Transcription with Multiple Devices in Diverse Scenarios
Figure 3 for The CHiME-7 DASR Challenge: Distant Meeting Transcription with Multiple Devices in Diverse Scenarios
Figure 4 for The CHiME-7 DASR Challenge: Distant Meeting Transcription with Multiple Devices in Diverse Scenarios
Viaarxiv icon