Picture for Yifan Gong

Yifan Gong

Fred

On Addressing Practical Challenges for RNN-Transducer

Add code
May 04, 2021
Figure 1 for On Addressing Practical Challenges for RNN-Transducer
Figure 2 for On Addressing Practical Challenges for RNN-Transducer
Figure 3 for On Addressing Practical Challenges for RNN-Transducer
Figure 4 for On Addressing Practical Challenges for RNN-Transducer
Viaarxiv icon

Streaming Multi-talker Speech Recognition with Joint Speaker Identification

Add code
Apr 05, 2021
Figure 1 for Streaming Multi-talker Speech Recognition with Joint Speaker Identification
Figure 2 for Streaming Multi-talker Speech Recognition with Joint Speaker Identification
Figure 3 for Streaming Multi-talker Speech Recognition with Joint Speaker Identification
Figure 4 for Streaming Multi-talker Speech Recognition with Joint Speaker Identification
Viaarxiv icon

Internal Language Model Training for Domain-Adaptive End-to-End Speech Recognition

Add code
Feb 02, 2021
Figure 1 for Internal Language Model Training for Domain-Adaptive End-to-End Speech Recognition
Figure 2 for Internal Language Model Training for Domain-Adaptive End-to-End Speech Recognition
Viaarxiv icon

Streaming end-to-end multi-talker speech recognition

Add code
Nov 26, 2020
Figure 1 for Streaming end-to-end multi-talker speech recognition
Figure 2 for Streaming end-to-end multi-talker speech recognition
Figure 3 for Streaming end-to-end multi-talker speech recognition
Figure 4 for Streaming end-to-end multi-talker speech recognition
Viaarxiv icon

Internal Language Model Estimation for Domain-Adaptive End-to-End Speech Recognition

Add code
Nov 03, 2020
Figure 1 for Internal Language Model Estimation for Domain-Adaptive End-to-End Speech Recognition
Figure 2 for Internal Language Model Estimation for Domain-Adaptive End-to-End Speech Recognition
Figure 3 for Internal Language Model Estimation for Domain-Adaptive End-to-End Speech Recognition
Figure 4 for Internal Language Model Estimation for Domain-Adaptive End-to-End Speech Recognition
Viaarxiv icon

On Minimum Word Error Rate Training of the Hybrid Autoregressive Transducer

Add code
Oct 23, 2020
Figure 1 for On Minimum Word Error Rate Training of the Hybrid Autoregressive Transducer
Figure 2 for On Minimum Word Error Rate Training of the Hybrid Autoregressive Transducer
Figure 3 for On Minimum Word Error Rate Training of the Hybrid Autoregressive Transducer
Figure 4 for On Minimum Word Error Rate Training of the Hybrid Autoregressive Transducer
Viaarxiv icon

Speaker Separation Using Speaker Inventories and Estimated Speech

Add code
Oct 20, 2020
Figure 1 for Speaker Separation Using Speaker Inventories and Estimated Speech
Figure 2 for Speaker Separation Using Speaker Inventories and Estimated Speech
Figure 3 for Speaker Separation Using Speaker Inventories and Estimated Speech
Figure 4 for Speaker Separation Using Speaker Inventories and Estimated Speech
Viaarxiv icon

Developing RNN-T Models Surpassing High-Performance Hybrid Models with Customization Capability

Add code
Jul 30, 2020
Figure 1 for Developing RNN-T Models Surpassing High-Performance Hybrid Models with Customization Capability
Figure 2 for Developing RNN-T Models Surpassing High-Performance Hybrid Models with Customization Capability
Figure 3 for Developing RNN-T Models Surpassing High-Performance Hybrid Models with Customization Capability
Figure 4 for Developing RNN-T Models Surpassing High-Performance Hybrid Models with Customization Capability
Viaarxiv icon

Exploring Transformers for Large-Scale Speech Recognition

Add code
May 19, 2020
Figure 1 for Exploring Transformers for Large-Scale Speech Recognition
Figure 2 for Exploring Transformers for Large-Scale Speech Recognition
Figure 3 for Exploring Transformers for Large-Scale Speech Recognition
Figure 4 for Exploring Transformers for Large-Scale Speech Recognition
Viaarxiv icon

Minimum Latency Training Strategies for Streaming Sequence-to-Sequence ASR

Add code
May 15, 2020
Figure 1 for Minimum Latency Training Strategies for Streaming Sequence-to-Sequence ASR
Figure 2 for Minimum Latency Training Strategies for Streaming Sequence-to-Sequence ASR
Figure 3 for Minimum Latency Training Strategies for Streaming Sequence-to-Sequence ASR
Figure 4 for Minimum Latency Training Strategies for Streaming Sequence-to-Sequence ASR
Viaarxiv icon