Alert button
Picture for Guoli Ye

Guoli Ye

Alert button

Hybrid Attention-based Encoder-decoder Model for Efficient Language Model Adaptation

Add code
Bookmark button
Alert button
Sep 14, 2023
Shaoshi Ling, Guoli Ye, Rui Zhao, Yifan Gong

Figure 1 for Hybrid Attention-based Encoder-decoder Model for Efficient Language Model Adaptation
Figure 2 for Hybrid Attention-based Encoder-decoder Model for Efficient Language Model Adaptation
Figure 3 for Hybrid Attention-based Encoder-decoder Model for Efficient Language Model Adaptation
Figure 4 for Hybrid Attention-based Encoder-decoder Model for Efficient Language Model Adaptation
Viaarxiv icon

Adapting Large Language Model with Speech for Fully Formatted End-to-End Speech Recognition

Add code
Bookmark button
Alert button
Aug 03, 2023
Shaoshi Ling, Yuxuan Hu, Shuangbei Qian, Guoli Ye, Yao Qian, Yifan Gong, Ed Lin, Michael Zeng

Figure 1 for Adapting Large Language Model with Speech for Fully Formatted End-to-End Speech Recognition
Figure 2 for Adapting Large Language Model with Speech for Fully Formatted End-to-End Speech Recognition
Figure 3 for Adapting Large Language Model with Speech for Fully Formatted End-to-End Speech Recognition
Figure 4 for Adapting Large Language Model with Speech for Fully Formatted End-to-End Speech Recognition
Viaarxiv icon

Acoustic-aware Non-autoregressive Spell Correction with Mask Sample Decoding

Add code
Bookmark button
Alert button
Oct 16, 2022
Ruchao Fan, Guoli Ye, Yashesh Gaur, Jinyu Li

Figure 1 for Acoustic-aware Non-autoregressive Spell Correction with Mask Sample Decoding
Figure 2 for Acoustic-aware Non-autoregressive Spell Correction with Mask Sample Decoding
Figure 3 for Acoustic-aware Non-autoregressive Spell Correction with Mask Sample Decoding
Figure 4 for Acoustic-aware Non-autoregressive Spell Correction with Mask Sample Decoding
Viaarxiv icon

Have best of both worlds: two-pass hybrid and E2E cascading framework for speech recognition

Add code
Bookmark button
Alert button
Oct 10, 2021
Guoli Ye, Vadim Mazalov, Jinyu Li, Yifan Gong

Figure 1 for Have best of both worlds: two-pass hybrid and E2E cascading framework for speech recognition
Figure 2 for Have best of both worlds: two-pass hybrid and E2E cascading framework for speech recognition
Figure 3 for Have best of both worlds: two-pass hybrid and E2E cascading framework for speech recognition
Figure 4 for Have best of both worlds: two-pass hybrid and E2E cascading framework for speech recognition
Viaarxiv icon

Minimum Word Error Rate Training with Language Model Fusion for End-to-End Speech Recognition

Add code
Bookmark button
Alert button
Jun 04, 2021
Zhong Meng, Yu Wu, Naoyuki Kanda, Liang Lu, Xie Chen, Guoli Ye, Eric Sun, Jinyu Li, Yifan Gong

Figure 1 for Minimum Word Error Rate Training with Language Model Fusion for End-to-End Speech Recognition
Figure 2 for Minimum Word Error Rate Training with Language Model Fusion for End-to-End Speech Recognition
Figure 3 for Minimum Word Error Rate Training with Language Model Fusion for End-to-End Speech Recognition
Viaarxiv icon

Large-Scale Pre-Training of End-to-End Multi-Talker ASR for Meeting Transcription with Single Distant Microphone

Add code
Bookmark button
Alert button
Apr 12, 2021
Naoyuki Kanda, Guoli Ye, Yu Wu, Yashesh Gaur, Xiaofei Wang, Zhong Meng, Zhuo Chen, Takuya Yoshioka

Figure 1 for Large-Scale Pre-Training of End-to-End Multi-Talker ASR for Meeting Transcription with Single Distant Microphone
Figure 2 for Large-Scale Pre-Training of End-to-End Multi-Talker ASR for Meeting Transcription with Single Distant Microphone
Figure 3 for Large-Scale Pre-Training of End-to-End Multi-Talker ASR for Meeting Transcription with Single Distant Microphone
Viaarxiv icon

End-to-End Speaker-Attributed ASR with Transformer

Add code
Bookmark button
Alert button
Apr 05, 2021
Naoyuki Kanda, Guoli Ye, Yashesh Gaur, Xiaofei Wang, Zhong Meng, Zhuo Chen, Takuya Yoshioka

Figure 1 for End-to-End Speaker-Attributed ASR with Transformer
Figure 2 for End-to-End Speaker-Attributed ASR with Transformer
Figure 3 for End-to-End Speaker-Attributed ASR with Transformer
Figure 4 for End-to-End Speaker-Attributed ASR with Transformer
Viaarxiv icon