Picture for Yifan Gong

Yifan Gong

Fred

Speaker Separation Using Speaker Inventories and Estimated Speech

Add code
Oct 20, 2020
Figure 1 for Speaker Separation Using Speaker Inventories and Estimated Speech
Figure 2 for Speaker Separation Using Speaker Inventories and Estimated Speech
Figure 3 for Speaker Separation Using Speaker Inventories and Estimated Speech
Figure 4 for Speaker Separation Using Speaker Inventories and Estimated Speech
Viaarxiv icon

Developing RNN-T Models Surpassing High-Performance Hybrid Models with Customization Capability

Add code
Jul 30, 2020
Figure 1 for Developing RNN-T Models Surpassing High-Performance Hybrid Models with Customization Capability
Figure 2 for Developing RNN-T Models Surpassing High-Performance Hybrid Models with Customization Capability
Figure 3 for Developing RNN-T Models Surpassing High-Performance Hybrid Models with Customization Capability
Figure 4 for Developing RNN-T Models Surpassing High-Performance Hybrid Models with Customization Capability
Viaarxiv icon

Exploring Transformers for Large-Scale Speech Recognition

Add code
May 19, 2020
Figure 1 for Exploring Transformers for Large-Scale Speech Recognition
Figure 2 for Exploring Transformers for Large-Scale Speech Recognition
Figure 3 for Exploring Transformers for Large-Scale Speech Recognition
Figure 4 for Exploring Transformers for Large-Scale Speech Recognition
Viaarxiv icon

Minimum Latency Training Strategies for Streaming Sequence-to-Sequence ASR

Add code
May 15, 2020
Figure 1 for Minimum Latency Training Strategies for Streaming Sequence-to-Sequence ASR
Figure 2 for Minimum Latency Training Strategies for Streaming Sequence-to-Sequence ASR
Figure 3 for Minimum Latency Training Strategies for Streaming Sequence-to-Sequence ASR
Figure 4 for Minimum Latency Training Strategies for Streaming Sequence-to-Sequence ASR
Viaarxiv icon

Exploring Pre-training with Alignments for RNN Transducer based End-to-End Speech Recognition

Add code
May 01, 2020
Figure 1 for Exploring Pre-training with Alignments for RNN Transducer based End-to-End Speech Recognition
Figure 2 for Exploring Pre-training with Alignments for RNN Transducer based End-to-End Speech Recognition
Figure 3 for Exploring Pre-training with Alignments for RNN Transducer based End-to-End Speech Recognition
Figure 4 for Exploring Pre-training with Alignments for RNN Transducer based End-to-End Speech Recognition
Viaarxiv icon

L-Vector: Neural Label Embedding for Domain Adaptation

Add code
Apr 25, 2020
Figure 1 for L-Vector: Neural Label Embedding for Domain Adaptation
Figure 2 for L-Vector: Neural Label Embedding for Domain Adaptation
Figure 3 for L-Vector: Neural Label Embedding for Domain Adaptation
Viaarxiv icon

High-Accuracy and Low-Latency Speech Recognition with Two-Head Contextual Layer Trajectory LSTM Model

Add code
Mar 17, 2020
Figure 1 for High-Accuracy and Low-Latency Speech Recognition with Two-Head Contextual Layer Trajectory LSTM Model
Figure 2 for High-Accuracy and Low-Latency Speech Recognition with Two-Head Contextual Layer Trajectory LSTM Model
Figure 3 for High-Accuracy and Low-Latency Speech Recognition with Two-Head Contextual Layer Trajectory LSTM Model
Figure 4 for High-Accuracy and Low-Latency Speech Recognition with Two-Head Contextual Layer Trajectory LSTM Model
Viaarxiv icon

A Privacy-Preserving DNN Pruning and Mobile Acceleration Framework

Add code
Mar 13, 2020
Figure 1 for A Privacy-Preserving DNN Pruning and Mobile Acceleration Framework
Figure 2 for A Privacy-Preserving DNN Pruning and Mobile Acceleration Framework
Figure 3 for A Privacy-Preserving DNN Pruning and Mobile Acceleration Framework
Figure 4 for A Privacy-Preserving DNN Pruning and Mobile Acceleration Framework
Viaarxiv icon

BLK-REW: A Unified Block-based DNN Pruning Framework using Reweighted Regularization Method

Add code
Feb 22, 2020
Figure 1 for BLK-REW: A Unified Block-based DNN Pruning Framework using Reweighted Regularization Method
Figure 2 for BLK-REW: A Unified Block-based DNN Pruning Framework using Reweighted Regularization Method
Figure 3 for BLK-REW: A Unified Block-based DNN Pruning Framework using Reweighted Regularization Method
Figure 4 for BLK-REW: A Unified Block-based DNN Pruning Framework using Reweighted Regularization Method
Viaarxiv icon

RTMobile: Beyond Real-Time Mobile Acceleration of RNNs for Speech Recognition

Add code
Feb 19, 2020
Figure 1 for RTMobile: Beyond Real-Time Mobile Acceleration of RNNs for Speech Recognition
Figure 2 for RTMobile: Beyond Real-Time Mobile Acceleration of RNNs for Speech Recognition
Figure 3 for RTMobile: Beyond Real-Time Mobile Acceleration of RNNs for Speech Recognition
Figure 4 for RTMobile: Beyond Real-Time Mobile Acceleration of RNNs for Speech Recognition
Viaarxiv icon