Alert button
Picture for Yifan Gong

Yifan Gong

Alert button

Internal Language Model Estimation for Domain-Adaptive End-to-End Speech Recognition

Add code
Bookmark button
Alert button
Nov 03, 2020
Zhong Meng, Sarangarajan Parthasarathy, Eric Sun, Yashesh Gaur, Naoyuki Kanda, Liang Lu, Xie Chen, Rui Zhao, Jinyu Li, Yifan Gong

Figure 1 for Internal Language Model Estimation for Domain-Adaptive End-to-End Speech Recognition
Figure 2 for Internal Language Model Estimation for Domain-Adaptive End-to-End Speech Recognition
Figure 3 for Internal Language Model Estimation for Domain-Adaptive End-to-End Speech Recognition
Figure 4 for Internal Language Model Estimation for Domain-Adaptive End-to-End Speech Recognition
Viaarxiv icon

On Minimum Word Error Rate Training of the Hybrid Autoregressive Transducer

Add code
Bookmark button
Alert button
Oct 23, 2020
Liang Lu, Zhong Meng, Naoyuki Kanda, Jinyu Li, Yifan Gong

Figure 1 for On Minimum Word Error Rate Training of the Hybrid Autoregressive Transducer
Figure 2 for On Minimum Word Error Rate Training of the Hybrid Autoregressive Transducer
Figure 3 for On Minimum Word Error Rate Training of the Hybrid Autoregressive Transducer
Figure 4 for On Minimum Word Error Rate Training of the Hybrid Autoregressive Transducer
Viaarxiv icon

Speaker Separation Using Speaker Inventories and Estimated Speech

Add code
Bookmark button
Alert button
Oct 20, 2020
Peidong Wang, Zhuo Chen, DeLiang Wang, Jinyu Li, Yifan Gong

Figure 1 for Speaker Separation Using Speaker Inventories and Estimated Speech
Figure 2 for Speaker Separation Using Speaker Inventories and Estimated Speech
Figure 3 for Speaker Separation Using Speaker Inventories and Estimated Speech
Figure 4 for Speaker Separation Using Speaker Inventories and Estimated Speech
Viaarxiv icon

Developing RNN-T Models Surpassing High-Performance Hybrid Models with Customization Capability

Add code
Bookmark button
Alert button
Jul 30, 2020
Jinyu Li, Rui Zhao, Zhong Meng, Yanqing Liu, Wenning Wei, Sarangarajan Parthasarathy, Vadim Mazalov, Zhenghao Wang, Lei He, Sheng Zhao, Yifan Gong

Figure 1 for Developing RNN-T Models Surpassing High-Performance Hybrid Models with Customization Capability
Figure 2 for Developing RNN-T Models Surpassing High-Performance Hybrid Models with Customization Capability
Figure 3 for Developing RNN-T Models Surpassing High-Performance Hybrid Models with Customization Capability
Figure 4 for Developing RNN-T Models Surpassing High-Performance Hybrid Models with Customization Capability
Viaarxiv icon

Exploring Transformers for Large-Scale Speech Recognition

Add code
Bookmark button
Alert button
May 19, 2020
Liang Lu, Changliang Liu, Jinyu Li, Yifan Gong

Figure 1 for Exploring Transformers for Large-Scale Speech Recognition
Figure 2 for Exploring Transformers for Large-Scale Speech Recognition
Figure 3 for Exploring Transformers for Large-Scale Speech Recognition
Figure 4 for Exploring Transformers for Large-Scale Speech Recognition
Viaarxiv icon

Minimum Latency Training Strategies for Streaming Sequence-to-Sequence ASR

Add code
Bookmark button
Alert button
May 15, 2020
Hirofumi Inaguma, Yashesh Gaur, Liang Lu, Jinyu Li, Yifan Gong

Figure 1 for Minimum Latency Training Strategies for Streaming Sequence-to-Sequence ASR
Figure 2 for Minimum Latency Training Strategies for Streaming Sequence-to-Sequence ASR
Figure 3 for Minimum Latency Training Strategies for Streaming Sequence-to-Sequence ASR
Figure 4 for Minimum Latency Training Strategies for Streaming Sequence-to-Sequence ASR
Viaarxiv icon

Exploring Pre-training with Alignments for RNN Transducer based End-to-End Speech Recognition

Add code
Bookmark button
Alert button
May 01, 2020
Hu Hu, Rui Zhao, Jinyu Li, Liang Lu, Yifan Gong

Figure 1 for Exploring Pre-training with Alignments for RNN Transducer based End-to-End Speech Recognition
Figure 2 for Exploring Pre-training with Alignments for RNN Transducer based End-to-End Speech Recognition
Figure 3 for Exploring Pre-training with Alignments for RNN Transducer based End-to-End Speech Recognition
Figure 4 for Exploring Pre-training with Alignments for RNN Transducer based End-to-End Speech Recognition
Viaarxiv icon

L-Vector: Neural Label Embedding for Domain Adaptation

Add code
Bookmark button
Alert button
Apr 25, 2020
Zhong Meng, Hu Hu, Jinyu Li, Changliang Liu, Yan Huang, Yifan Gong, Chin-Hui Lee

Figure 1 for L-Vector: Neural Label Embedding for Domain Adaptation
Figure 2 for L-Vector: Neural Label Embedding for Domain Adaptation
Figure 3 for L-Vector: Neural Label Embedding for Domain Adaptation
Viaarxiv icon

High-Accuracy and Low-Latency Speech Recognition with Two-Head Contextual Layer Trajectory LSTM Model

Add code
Bookmark button
Alert button
Mar 17, 2020
Jinyu Li, Rui Zhao, Eric Sun, Jeremy H. M. Wong, Amit Das, Zhong Meng, Yifan Gong

Figure 1 for High-Accuracy and Low-Latency Speech Recognition with Two-Head Contextual Layer Trajectory LSTM Model
Figure 2 for High-Accuracy and Low-Latency Speech Recognition with Two-Head Contextual Layer Trajectory LSTM Model
Figure 3 for High-Accuracy and Low-Latency Speech Recognition with Two-Head Contextual Layer Trajectory LSTM Model
Figure 4 for High-Accuracy and Low-Latency Speech Recognition with Two-Head Contextual Layer Trajectory LSTM Model
Viaarxiv icon