Alert button
Picture for Yifan Gong

Yifan Gong

Alert button

Have best of both worlds: two-pass hybrid and E2E cascading framework for speech recognition

Add code
Bookmark button
Alert button
Oct 10, 2021
Guoli Ye, Vadim Mazalov, Jinyu Li, Yifan Gong

Figure 1 for Have best of both worlds: two-pass hybrid and E2E cascading framework for speech recognition
Figure 2 for Have best of both worlds: two-pass hybrid and E2E cascading framework for speech recognition
Figure 3 for Have best of both worlds: two-pass hybrid and E2E cascading framework for speech recognition
Figure 4 for Have best of both worlds: two-pass hybrid and E2E cascading framework for speech recognition
Viaarxiv icon

Internal Language Model Adaptation with Text-Only Data for End-to-End Speech Recognition

Add code
Bookmark button
Alert button
Oct 06, 2021
Zhong Meng, Yashesh Gaur, Naoyuki Kanda, Jinyu Li, Xie Chen, Yu Wu, Yifan Gong

Figure 1 for Internal Language Model Adaptation with Text-Only Data for End-to-End Speech Recognition
Figure 2 for Internal Language Model Adaptation with Text-Only Data for End-to-End Speech Recognition
Viaarxiv icon

Diarisation using location tracking with agglomerative clustering

Add code
Bookmark button
Alert button
Sep 24, 2021
Jeremy H. M. Wong, Igor Abramovski, Xiong Xiao, Yifan Gong

Figure 1 for Diarisation using location tracking with agglomerative clustering
Figure 2 for Diarisation using location tracking with agglomerative clustering
Figure 3 for Diarisation using location tracking with agglomerative clustering
Viaarxiv icon

Joint speaker diarisation and tracking in switching state-space model

Add code
Bookmark button
Alert button
Sep 23, 2021
Jeremy H. M. Wong, Yifan Gong

Figure 1 for Joint speaker diarisation and tracking in switching state-space model
Figure 2 for Joint speaker diarisation and tracking in switching state-space model
Figure 3 for Joint speaker diarisation and tracking in switching state-space model
Figure 4 for Joint speaker diarisation and tracking in switching state-space model
Viaarxiv icon

Achieving on-Mobile Real-Time Super-Resolution with Neural Architecture and Pruning Search

Add code
Bookmark button
Alert button
Aug 18, 2021
Zheng Zhan, Yifan Gong, Pu Zhao, Geng Yuan, Wei Niu, Yushu Wu, Tianyun Zhang, Malith Jayaweera, David Kaeli, Bin Ren, Xue Lin, Yanzhi Wang

Figure 1 for Achieving on-Mobile Real-Time Super-Resolution with Neural Architecture and Pruning Search
Figure 2 for Achieving on-Mobile Real-Time Super-Resolution with Neural Architecture and Pruning Search
Figure 3 for Achieving on-Mobile Real-Time Super-Resolution with Neural Architecture and Pruning Search
Figure 4 for Achieving on-Mobile Real-Time Super-Resolution with Neural Architecture and Pruning Search
Viaarxiv icon

Minimum Word Error Rate Training with Language Model Fusion for End-to-End Speech Recognition

Add code
Bookmark button
Alert button
Jun 04, 2021
Zhong Meng, Yu Wu, Naoyuki Kanda, Liang Lu, Xie Chen, Guoli Ye, Eric Sun, Jinyu Li, Yifan Gong

Figure 1 for Minimum Word Error Rate Training with Language Model Fusion for End-to-End Speech Recognition
Figure 2 for Minimum Word Error Rate Training with Language Model Fusion for End-to-End Speech Recognition
Figure 3 for Minimum Word Error Rate Training with Language Model Fusion for End-to-End Speech Recognition
Viaarxiv icon

On Addressing Practical Challenges for RNN-Transducer

Add code
Bookmark button
Alert button
May 04, 2021
Rui Zhao, Jian Xue, Jinyu Li, Wenning Wei, Lei He, Yifan Gong

Figure 1 for On Addressing Practical Challenges for RNN-Transducer
Figure 2 for On Addressing Practical Challenges for RNN-Transducer
Figure 3 for On Addressing Practical Challenges for RNN-Transducer
Figure 4 for On Addressing Practical Challenges for RNN-Transducer
Viaarxiv icon

Streaming Multi-talker Speech Recognition with Joint Speaker Identification

Add code
Bookmark button
Alert button
Apr 05, 2021
Liang Lu, Naoyuki Kanda, Jinyu Li, Yifan Gong

Figure 1 for Streaming Multi-talker Speech Recognition with Joint Speaker Identification
Figure 2 for Streaming Multi-talker Speech Recognition with Joint Speaker Identification
Figure 3 for Streaming Multi-talker Speech Recognition with Joint Speaker Identification
Figure 4 for Streaming Multi-talker Speech Recognition with Joint Speaker Identification
Viaarxiv icon

Internal Language Model Training for Domain-Adaptive End-to-End Speech Recognition

Add code
Bookmark button
Alert button
Feb 02, 2021
Zhong Meng, Naoyuki Kanda, Yashesh Gaur, Sarangarajan Parthasarathy, Eric Sun, Liang Lu, Xie Chen, Jinyu Li, Yifan Gong

Figure 1 for Internal Language Model Training for Domain-Adaptive End-to-End Speech Recognition
Figure 2 for Internal Language Model Training for Domain-Adaptive End-to-End Speech Recognition
Viaarxiv icon

Streaming end-to-end multi-talker speech recognition

Add code
Bookmark button
Alert button
Nov 26, 2020
Liang Lu, Naoyuki Kanda, Jinyu Li, Yifan Gong

Figure 1 for Streaming end-to-end multi-talker speech recognition
Figure 2 for Streaming end-to-end multi-talker speech recognition
Figure 3 for Streaming end-to-end multi-talker speech recognition
Figure 4 for Streaming end-to-end multi-talker speech recognition
Viaarxiv icon