Picture for Yashesh Gaur

Yashesh Gaur

Jack

Internal Language Model Adaptation with Text-Only Data for End-to-End Speech Recognition

Add code
Oct 14, 2021
Figure 1 for Internal Language Model Adaptation with Text-Only Data for End-to-End Speech Recognition
Figure 2 for Internal Language Model Adaptation with Text-Only Data for End-to-End Speech Recognition
Viaarxiv icon

Transcribe-to-Diarize: Neural Speaker Diarization for Unlimited Number of Speakers using End-to-End Speaker-Attributed ASR

Add code
Oct 07, 2021
Figure 1 for Transcribe-to-Diarize: Neural Speaker Diarization for Unlimited Number of Speakers using End-to-End Speaker-Attributed ASR
Figure 2 for Transcribe-to-Diarize: Neural Speaker Diarization for Unlimited Number of Speakers using End-to-End Speaker-Attributed ASR
Figure 3 for Transcribe-to-Diarize: Neural Speaker Diarization for Unlimited Number of Speakers using End-to-End Speaker-Attributed ASR
Viaarxiv icon

Continuous Streaming Multi-Talker ASR with Dual-path Transducers

Add code
Sep 17, 2021
Figure 1 for Continuous Streaming Multi-Talker ASR with Dual-path Transducers
Figure 2 for Continuous Streaming Multi-Talker ASR with Dual-path Transducers
Figure 3 for Continuous Streaming Multi-Talker ASR with Dual-path Transducers
Figure 4 for Continuous Streaming Multi-Talker ASR with Dual-path Transducers
Viaarxiv icon

A Comparative Study of Modular and Joint Approaches for Speaker-Attributed ASR on Monaural Long-Form Audio

Add code
Jul 06, 2021
Figure 1 for A Comparative Study of Modular and Joint Approaches for Speaker-Attributed ASR on Monaural Long-Form Audio
Figure 2 for A Comparative Study of Modular and Joint Approaches for Speaker-Attributed ASR on Monaural Long-Form Audio
Figure 3 for A Comparative Study of Modular and Joint Approaches for Speaker-Attributed ASR on Monaural Long-Form Audio
Figure 4 for A Comparative Study of Modular and Joint Approaches for Speaker-Attributed ASR on Monaural Long-Form Audio
Viaarxiv icon

Dynamic Gradient Aggregation for Federated Domain Adaptation

Add code
Jun 14, 2021
Figure 1 for Dynamic Gradient Aggregation for Federated Domain Adaptation
Figure 2 for Dynamic Gradient Aggregation for Federated Domain Adaptation
Figure 3 for Dynamic Gradient Aggregation for Federated Domain Adaptation
Figure 4 for Dynamic Gradient Aggregation for Federated Domain Adaptation
Viaarxiv icon

Large-Scale Pre-Training of End-to-End Multi-Talker ASR for Meeting Transcription with Single Distant Microphone

Add code
Apr 12, 2021
Figure 1 for Large-Scale Pre-Training of End-to-End Multi-Talker ASR for Meeting Transcription with Single Distant Microphone
Figure 2 for Large-Scale Pre-Training of End-to-End Multi-Talker ASR for Meeting Transcription with Single Distant Microphone
Figure 3 for Large-Scale Pre-Training of End-to-End Multi-Talker ASR for Meeting Transcription with Single Distant Microphone
Viaarxiv icon

End-to-End Speaker-Attributed ASR with Transformer

Add code
Apr 05, 2021
Figure 1 for End-to-End Speaker-Attributed ASR with Transformer
Figure 2 for End-to-End Speaker-Attributed ASR with Transformer
Figure 3 for End-to-End Speaker-Attributed ASR with Transformer
Figure 4 for End-to-End Speaker-Attributed ASR with Transformer
Viaarxiv icon

Internal Language Model Training for Domain-Adaptive End-to-End Speech Recognition

Add code
Feb 02, 2021
Figure 1 for Internal Language Model Training for Domain-Adaptive End-to-End Speech Recognition
Figure 2 for Internal Language Model Training for Domain-Adaptive End-to-End Speech Recognition
Viaarxiv icon

Hypothesis Stitcher for End-to-End Speaker-attributed ASR on Long-form Multi-talker Recordings

Add code
Jan 06, 2021
Figure 1 for Hypothesis Stitcher for End-to-End Speaker-attributed ASR on Long-form Multi-talker Recordings
Figure 2 for Hypothesis Stitcher for End-to-End Speaker-attributed ASR on Long-form Multi-talker Recordings
Viaarxiv icon

Minimum Bayes Risk Training for End-to-End Speaker-Attributed ASR

Add code
Nov 03, 2020
Figure 1 for Minimum Bayes Risk Training for End-to-End Speaker-Attributed ASR
Figure 2 for Minimum Bayes Risk Training for End-to-End Speaker-Attributed ASR
Figure 3 for Minimum Bayes Risk Training for End-to-End Speaker-Attributed ASR
Viaarxiv icon