Picture for Ching-Feng Yeh

Ching-Feng Yeh

Alignment Restricted Streaming Recurrent Neural Network Transducer

Add code
Nov 05, 2020
Figure 1 for Alignment Restricted Streaming Recurrent Neural Network Transducer
Figure 2 for Alignment Restricted Streaming Recurrent Neural Network Transducer
Figure 3 for Alignment Restricted Streaming Recurrent Neural Network Transducer
Figure 4 for Alignment Restricted Streaming Recurrent Neural Network Transducer
Viaarxiv icon

Streaming Attention-Based Models with Augmented Memory for End-to-End Speech Recognition

Add code
Nov 03, 2020
Figure 1 for Streaming Attention-Based Models with Augmented Memory for End-to-End Speech Recognition
Figure 2 for Streaming Attention-Based Models with Augmented Memory for End-to-End Speech Recognition
Figure 3 for Streaming Attention-Based Models with Augmented Memory for End-to-End Speech Recognition
Figure 4 for Streaming Attention-Based Models with Augmented Memory for End-to-End Speech Recognition
Viaarxiv icon

Transformer in action: a comparative study of transformer-based acoustic models for large scale speech recognition applications

Add code
Oct 29, 2020
Figure 1 for Transformer in action: a comparative study of transformer-based acoustic models for large scale speech recognition applications
Figure 2 for Transformer in action: a comparative study of transformer-based acoustic models for large scale speech recognition applications
Figure 3 for Transformer in action: a comparative study of transformer-based acoustic models for large scale speech recognition applications
Figure 4 for Transformer in action: a comparative study of transformer-based acoustic models for large scale speech recognition applications
Viaarxiv icon

Emformer: Efficient Memory Transformer Based Acoustic Model For Low Latency Streaming Speech Recognition

Add code
Oct 29, 2020
Figure 1 for Emformer: Efficient Memory Transformer Based Acoustic Model For Low Latency Streaming Speech Recognition
Figure 2 for Emformer: Efficient Memory Transformer Based Acoustic Model For Low Latency Streaming Speech Recognition
Figure 3 for Emformer: Efficient Memory Transformer Based Acoustic Model For Low Latency Streaming Speech Recognition
Figure 4 for Emformer: Efficient Memory Transformer Based Acoustic Model For Low Latency Streaming Speech Recognition
Viaarxiv icon

Weak-Attention Suppression For Transformer Based Speech Recognition

Add code
May 18, 2020
Figure 1 for Weak-Attention Suppression For Transformer Based Speech Recognition
Figure 2 for Weak-Attention Suppression For Transformer Based Speech Recognition
Figure 3 for Weak-Attention Suppression For Transformer Based Speech Recognition
Figure 4 for Weak-Attention Suppression For Transformer Based Speech Recognition
Viaarxiv icon

Streaming Transformer-based Acoustic Models Using Self-attention with Augmented Memory

Add code
May 16, 2020
Figure 1 for Streaming Transformer-based Acoustic Models Using Self-attention with Augmented Memory
Figure 2 for Streaming Transformer-based Acoustic Models Using Self-attention with Augmented Memory
Figure 3 for Streaming Transformer-based Acoustic Models Using Self-attention with Augmented Memory
Figure 4 for Streaming Transformer-based Acoustic Models Using Self-attention with Augmented Memory
Viaarxiv icon

AIPNet: Generative Adversarial Pre-training of Accent-invariant Networks for End-to-end Speech Recognition

Add code
Nov 27, 2019
Figure 1 for AIPNet: Generative Adversarial Pre-training of Accent-invariant Networks for End-to-end Speech Recognition
Figure 2 for AIPNet: Generative Adversarial Pre-training of Accent-invariant Networks for End-to-end Speech Recognition
Figure 3 for AIPNet: Generative Adversarial Pre-training of Accent-invariant Networks for End-to-end Speech Recognition
Figure 4 for AIPNet: Generative Adversarial Pre-training of Accent-invariant Networks for End-to-end Speech Recognition
Viaarxiv icon

RNN-T For Latency Controlled ASR With Improved Beam Search

Add code
Nov 05, 2019
Figure 1 for RNN-T For Latency Controlled ASR With Improved Beam Search
Figure 2 for RNN-T For Latency Controlled ASR With Improved Beam Search
Figure 3 for RNN-T For Latency Controlled ASR With Improved Beam Search
Figure 4 for RNN-T For Latency Controlled ASR With Improved Beam Search
Viaarxiv icon

Transformer-Transducer: End-to-End Speech Recognition with Self-Attention

Add code
Oct 28, 2019
Figure 1 for Transformer-Transducer: End-to-End Speech Recognition with Self-Attention
Figure 2 for Transformer-Transducer: End-to-End Speech Recognition with Self-Attention
Figure 3 for Transformer-Transducer: End-to-End Speech Recognition with Self-Attention
Figure 4 for Transformer-Transducer: End-to-End Speech Recognition with Self-Attention
Viaarxiv icon

Training Augmentation with Adversarial Examples for Robust Speech Recognition

Add code
Jun 17, 2018
Figure 1 for Training Augmentation with Adversarial Examples for Robust Speech Recognition
Figure 2 for Training Augmentation with Adversarial Examples for Robust Speech Recognition
Figure 3 for Training Augmentation with Adversarial Examples for Robust Speech Recognition
Figure 4 for Training Augmentation with Adversarial Examples for Robust Speech Recognition
Viaarxiv icon