Picture for Jinyu Li

Jinyu Li

Fred

Continuous Streaming Multi-Talker ASR with Dual-path Transducers

Add code
Sep 17, 2021
Figure 1 for Continuous Streaming Multi-Talker ASR with Dual-path Transducers
Figure 2 for Continuous Streaming Multi-Talker ASR with Dual-path Transducers
Figure 3 for Continuous Streaming Multi-Talker ASR with Dual-path Transducers
Figure 4 for Continuous Streaming Multi-Talker ASR with Dual-path Transducers
Viaarxiv icon

A Light-weight contextual spelling correction model for customizing transducer-based speech recognition systems

Add code
Aug 17, 2021
Figure 1 for A Light-weight contextual spelling correction model for customizing transducer-based speech recognition systems
Figure 2 for A Light-weight contextual spelling correction model for customizing transducer-based speech recognition systems
Figure 3 for A Light-weight contextual spelling correction model for customizing transducer-based speech recognition systems
Figure 4 for A Light-weight contextual spelling correction model for customizing transducer-based speech recognition systems
Viaarxiv icon

A Configurable Multilingual Model is All You Need to Recognize All Languages

Add code
Jul 13, 2021
Figure 1 for A Configurable Multilingual Model is All You Need to Recognize All Languages
Figure 2 for A Configurable Multilingual Model is All You Need to Recognize All Languages
Figure 3 for A Configurable Multilingual Model is All You Need to Recognize All Languages
Figure 4 for A Configurable Multilingual Model is All You Need to Recognize All Languages
Viaarxiv icon

UniSpeech at scale: An Empirical Study of Pre-training Method on Large-Scale Speech Recognition Dataset

Add code
Jul 12, 2021
Figure 1 for UniSpeech at scale: An Empirical Study of Pre-training Method on Large-Scale Speech Recognition Dataset
Figure 2 for UniSpeech at scale: An Empirical Study of Pre-training Method on Large-Scale Speech Recognition Dataset
Figure 3 for UniSpeech at scale: An Empirical Study of Pre-training Method on Large-Scale Speech Recognition Dataset
Figure 4 for UniSpeech at scale: An Empirical Study of Pre-training Method on Large-Scale Speech Recognition Dataset
Viaarxiv icon

Investigation of Practical Aspects of Single Channel Speech Separation for ASR

Add code
Jul 05, 2021
Figure 1 for Investigation of Practical Aspects of Single Channel Speech Separation for ASR
Figure 2 for Investigation of Practical Aspects of Single Channel Speech Separation for ASR
Figure 3 for Investigation of Practical Aspects of Single Channel Speech Separation for ASR
Viaarxiv icon

Minimum Word Error Rate Training with Language Model Fusion for End-to-End Speech Recognition

Add code
Jun 04, 2021
Figure 1 for Minimum Word Error Rate Training with Language Model Fusion for End-to-End Speech Recognition
Figure 2 for Minimum Word Error Rate Training with Language Model Fusion for End-to-End Speech Recognition
Figure 3 for Minimum Word Error Rate Training with Language Model Fusion for End-to-End Speech Recognition
Viaarxiv icon

On Addressing Practical Challenges for RNN-Transducer

Add code
May 04, 2021
Figure 1 for On Addressing Practical Challenges for RNN-Transducer
Figure 2 for On Addressing Practical Challenges for RNN-Transducer
Figure 3 for On Addressing Practical Challenges for RNN-Transducer
Figure 4 for On Addressing Practical Challenges for RNN-Transducer
Viaarxiv icon

Streaming Multi-talker Speech Recognition with Joint Speaker Identification

Add code
Apr 05, 2021
Figure 1 for Streaming Multi-talker Speech Recognition with Joint Speaker Identification
Figure 2 for Streaming Multi-talker Speech Recognition with Joint Speaker Identification
Figure 3 for Streaming Multi-talker Speech Recognition with Joint Speaker Identification
Figure 4 for Streaming Multi-talker Speech Recognition with Joint Speaker Identification
Viaarxiv icon

Internal Language Model Training for Domain-Adaptive End-to-End Speech Recognition

Add code
Feb 02, 2021
Figure 1 for Internal Language Model Training for Domain-Adaptive End-to-End Speech Recognition
Figure 2 for Internal Language Model Training for Domain-Adaptive End-to-End Speech Recognition
Viaarxiv icon

Streaming end-to-end multi-talker speech recognition

Add code
Nov 26, 2020
Figure 1 for Streaming end-to-end multi-talker speech recognition
Figure 2 for Streaming end-to-end multi-talker speech recognition
Figure 3 for Streaming end-to-end multi-talker speech recognition
Figure 4 for Streaming end-to-end multi-talker speech recognition
Viaarxiv icon