Picture for Liang Lu

Liang Lu

Internal Language Model Training for Domain-Adaptive End-to-End Speech Recognition

Add code
Feb 02, 2021
Figure 1 for Internal Language Model Training for Domain-Adaptive End-to-End Speech Recognition
Figure 2 for Internal Language Model Training for Domain-Adaptive End-to-End Speech Recognition
Viaarxiv icon

Streaming end-to-end multi-talker speech recognition

Add code
Nov 26, 2020
Figure 1 for Streaming end-to-end multi-talker speech recognition
Figure 2 for Streaming end-to-end multi-talker speech recognition
Figure 3 for Streaming end-to-end multi-talker speech recognition
Figure 4 for Streaming end-to-end multi-talker speech recognition
Viaarxiv icon

Minimum Bayes Risk Training for End-to-End Speaker-Attributed ASR

Add code
Nov 03, 2020
Figure 1 for Minimum Bayes Risk Training for End-to-End Speaker-Attributed ASR
Figure 2 for Minimum Bayes Risk Training for End-to-End Speaker-Attributed ASR
Figure 3 for Minimum Bayes Risk Training for End-to-End Speaker-Attributed ASR
Viaarxiv icon

Internal Language Model Estimation for Domain-Adaptive End-to-End Speech Recognition

Add code
Nov 03, 2020
Figure 1 for Internal Language Model Estimation for Domain-Adaptive End-to-End Speech Recognition
Figure 2 for Internal Language Model Estimation for Domain-Adaptive End-to-End Speech Recognition
Figure 3 for Internal Language Model Estimation for Domain-Adaptive End-to-End Speech Recognition
Figure 4 for Internal Language Model Estimation for Domain-Adaptive End-to-End Speech Recognition
Viaarxiv icon

On Minimum Word Error Rate Training of the Hybrid Autoregressive Transducer

Add code
Oct 23, 2020
Figure 1 for On Minimum Word Error Rate Training of the Hybrid Autoregressive Transducer
Figure 2 for On Minimum Word Error Rate Training of the Hybrid Autoregressive Transducer
Figure 3 for On Minimum Word Error Rate Training of the Hybrid Autoregressive Transducer
Figure 4 for On Minimum Word Error Rate Training of the Hybrid Autoregressive Transducer
Viaarxiv icon

Exploring Transformers for Large-Scale Speech Recognition

Add code
May 19, 2020
Figure 1 for Exploring Transformers for Large-Scale Speech Recognition
Figure 2 for Exploring Transformers for Large-Scale Speech Recognition
Figure 3 for Exploring Transformers for Large-Scale Speech Recognition
Figure 4 for Exploring Transformers for Large-Scale Speech Recognition
Viaarxiv icon

Minimum Latency Training Strategies for Streaming Sequence-to-Sequence ASR

Add code
May 15, 2020
Figure 1 for Minimum Latency Training Strategies for Streaming Sequence-to-Sequence ASR
Figure 2 for Minimum Latency Training Strategies for Streaming Sequence-to-Sequence ASR
Figure 3 for Minimum Latency Training Strategies for Streaming Sequence-to-Sequence ASR
Figure 4 for Minimum Latency Training Strategies for Streaming Sequence-to-Sequence ASR
Viaarxiv icon

Exploring Pre-training with Alignments for RNN Transducer based End-to-End Speech Recognition

Add code
May 01, 2020
Figure 1 for Exploring Pre-training with Alignments for RNN Transducer based End-to-End Speech Recognition
Figure 2 for Exploring Pre-training with Alignments for RNN Transducer based End-to-End Speech Recognition
Figure 3 for Exploring Pre-training with Alignments for RNN Transducer based End-to-End Speech Recognition
Figure 4 for Exploring Pre-training with Alignments for RNN Transducer based End-to-End Speech Recognition
Viaarxiv icon

Continuous speech separation: dataset and analysis

Add code
Jan 30, 2020
Figure 1 for Continuous speech separation: dataset and analysis
Figure 2 for Continuous speech separation: dataset and analysis
Figure 3 for Continuous speech separation: dataset and analysis
Figure 4 for Continuous speech separation: dataset and analysis
Viaarxiv icon

Semantic Mask for Transformer based End-to-End Speech Recognition

Add code
Dec 06, 2019
Figure 1 for Semantic Mask for Transformer based End-to-End Speech Recognition
Figure 2 for Semantic Mask for Transformer based End-to-End Speech Recognition
Figure 3 for Semantic Mask for Transformer based End-to-End Speech Recognition
Figure 4 for Semantic Mask for Transformer based End-to-End Speech Recognition
Viaarxiv icon