Alert button
Picture for Jinyu Li

Jinyu Li

Alert button

Factorized Neural Transducer for Efficient Language Model Adaptation

Add code
Bookmark button
Alert button
Oct 07, 2021
Xie Chen, Zhong Meng, Sarangarajan Parthasarathy, Jinyu Li

Figure 1 for Factorized Neural Transducer for Efficient Language Model Adaptation
Figure 2 for Factorized Neural Transducer for Efficient Language Model Adaptation
Figure 3 for Factorized Neural Transducer for Efficient Language Model Adaptation
Figure 4 for Factorized Neural Transducer for Efficient Language Model Adaptation
Viaarxiv icon

Internal Language Model Adaptation with Text-Only Data for End-to-End Speech Recognition

Add code
Bookmark button
Alert button
Oct 06, 2021
Zhong Meng, Yashesh Gaur, Naoyuki Kanda, Jinyu Li, Xie Chen, Yu Wu, Yifan Gong

Figure 1 for Internal Language Model Adaptation with Text-Only Data for End-to-End Speech Recognition
Figure 2 for Internal Language Model Adaptation with Text-Only Data for End-to-End Speech Recognition
Viaarxiv icon

Continuous Streaming Multi-Talker ASR with Dual-path Transducers

Add code
Bookmark button
Alert button
Sep 17, 2021
Desh Raj, Liang Lu, Zhuo Chen, Yashesh Gaur, Jinyu Li

Figure 1 for Continuous Streaming Multi-Talker ASR with Dual-path Transducers
Figure 2 for Continuous Streaming Multi-Talker ASR with Dual-path Transducers
Figure 3 for Continuous Streaming Multi-Talker ASR with Dual-path Transducers
Figure 4 for Continuous Streaming Multi-Talker ASR with Dual-path Transducers
Viaarxiv icon

A Light-weight contextual spelling correction model for customizing transducer-based speech recognition systems

Add code
Bookmark button
Alert button
Aug 17, 2021
Xiaoqiang Wang, Yanqing Liu, Sheng Zhao, Jinyu Li

Figure 1 for A Light-weight contextual spelling correction model for customizing transducer-based speech recognition systems
Figure 2 for A Light-weight contextual spelling correction model for customizing transducer-based speech recognition systems
Figure 3 for A Light-weight contextual spelling correction model for customizing transducer-based speech recognition systems
Figure 4 for A Light-weight contextual spelling correction model for customizing transducer-based speech recognition systems
Viaarxiv icon

A Configurable Multilingual Model is All You Need to Recognize All Languages

Add code
Bookmark button
Alert button
Jul 13, 2021
Long Zhou, Jinyu Li, Eric Sun, Shujie Liu

Figure 1 for A Configurable Multilingual Model is All You Need to Recognize All Languages
Figure 2 for A Configurable Multilingual Model is All You Need to Recognize All Languages
Figure 3 for A Configurable Multilingual Model is All You Need to Recognize All Languages
Figure 4 for A Configurable Multilingual Model is All You Need to Recognize All Languages
Viaarxiv icon

UniSpeech at scale: An Empirical Study of Pre-training Method on Large-Scale Speech Recognition Dataset

Add code
Bookmark button
Alert button
Jul 12, 2021
Chengyi Wang, Yu Wu, Shujie Liu, Jinyu Li, Yao Qian, Kenichi Kumatani, Furu Wei

Figure 1 for UniSpeech at scale: An Empirical Study of Pre-training Method on Large-Scale Speech Recognition Dataset
Figure 2 for UniSpeech at scale: An Empirical Study of Pre-training Method on Large-Scale Speech Recognition Dataset
Figure 3 for UniSpeech at scale: An Empirical Study of Pre-training Method on Large-Scale Speech Recognition Dataset
Figure 4 for UniSpeech at scale: An Empirical Study of Pre-training Method on Large-Scale Speech Recognition Dataset
Viaarxiv icon

Investigation of Practical Aspects of Single Channel Speech Separation for ASR

Add code
Bookmark button
Alert button
Jul 05, 2021
Jian Wu, Zhuo Chen, Sanyuan Chen, Yu Wu, Takuya Yoshioka, Naoyuki Kanda, Shujie Liu, Jinyu Li

Figure 1 for Investigation of Practical Aspects of Single Channel Speech Separation for ASR
Figure 2 for Investigation of Practical Aspects of Single Channel Speech Separation for ASR
Figure 3 for Investigation of Practical Aspects of Single Channel Speech Separation for ASR
Viaarxiv icon

Minimum Word Error Rate Training with Language Model Fusion for End-to-End Speech Recognition

Add code
Bookmark button
Alert button
Jun 04, 2021
Zhong Meng, Yu Wu, Naoyuki Kanda, Liang Lu, Xie Chen, Guoli Ye, Eric Sun, Jinyu Li, Yifan Gong

Figure 1 for Minimum Word Error Rate Training with Language Model Fusion for End-to-End Speech Recognition
Figure 2 for Minimum Word Error Rate Training with Language Model Fusion for End-to-End Speech Recognition
Figure 3 for Minimum Word Error Rate Training with Language Model Fusion for End-to-End Speech Recognition
Viaarxiv icon

On Addressing Practical Challenges for RNN-Transducer

Add code
Bookmark button
Alert button
May 04, 2021
Rui Zhao, Jian Xue, Jinyu Li, Wenning Wei, Lei He, Yifan Gong

Figure 1 for On Addressing Practical Challenges for RNN-Transducer
Figure 2 for On Addressing Practical Challenges for RNN-Transducer
Figure 3 for On Addressing Practical Challenges for RNN-Transducer
Figure 4 for On Addressing Practical Challenges for RNN-Transducer
Viaarxiv icon

Streaming Multi-talker Speech Recognition with Joint Speaker Identification

Add code
Bookmark button
Alert button
Apr 05, 2021
Liang Lu, Naoyuki Kanda, Jinyu Li, Yifan Gong

Figure 1 for Streaming Multi-talker Speech Recognition with Joint Speaker Identification
Figure 2 for Streaming Multi-talker Speech Recognition with Joint Speaker Identification
Figure 3 for Streaming Multi-talker Speech Recognition with Joint Speaker Identification
Figure 4 for Streaming Multi-talker Speech Recognition with Joint Speaker Identification
Viaarxiv icon