Picture for Ruoming Pang

Ruoming Pang

Improving Tail Performance of a Deliberation E2E ASR Model Using a Large Text Corpus

Add code
Aug 25, 2020
Figure 1 for Improving Tail Performance of a Deliberation E2E ASR Model Using a Large Text Corpus
Figure 2 for Improving Tail Performance of a Deliberation E2E ASR Model Using a Large Text Corpus
Figure 3 for Improving Tail Performance of a Deliberation E2E ASR Model Using a Large Text Corpus
Figure 4 for Improving Tail Performance of a Deliberation E2E ASR Model Using a Large Text Corpus
Viaarxiv icon

RNN-T Models Fail to Generalize to Out-of-Domain Audio: Causes and Solutions

Add code
May 17, 2020
Figure 1 for RNN-T Models Fail to Generalize to Out-of-Domain Audio: Causes and Solutions
Figure 2 for RNN-T Models Fail to Generalize to Out-of-Domain Audio: Causes and Solutions
Figure 3 for RNN-T Models Fail to Generalize to Out-of-Domain Audio: Causes and Solutions
Figure 4 for RNN-T Models Fail to Generalize to Out-of-Domain Audio: Causes and Solutions
Viaarxiv icon

Dynamic Sparsity Neural Networks for Automatic Speech Recognition

Add code
May 16, 2020
Figure 1 for Dynamic Sparsity Neural Networks for Automatic Speech Recognition
Figure 2 for Dynamic Sparsity Neural Networks for Automatic Speech Recognition
Figure 3 for Dynamic Sparsity Neural Networks for Automatic Speech Recognition
Viaarxiv icon

Conformer: Convolution-augmented Transformer for Speech Recognition

Add code
May 16, 2020
Figure 1 for Conformer: Convolution-augmented Transformer for Speech Recognition
Figure 2 for Conformer: Convolution-augmented Transformer for Speech Recognition
Figure 3 for Conformer: Convolution-augmented Transformer for Speech Recognition
Figure 4 for Conformer: Convolution-augmented Transformer for Speech Recognition
Viaarxiv icon

ContextNet: Improving Convolutional Neural Networks for Automatic Speech Recognition with Global Context

Add code
May 16, 2020
Figure 1 for ContextNet: Improving Convolutional Neural Networks for Automatic Speech Recognition with Global Context
Figure 2 for ContextNet: Improving Convolutional Neural Networks for Automatic Speech Recognition with Global Context
Figure 3 for ContextNet: Improving Convolutional Neural Networks for Automatic Speech Recognition with Global Context
Figure 4 for ContextNet: Improving Convolutional Neural Networks for Automatic Speech Recognition with Global Context
Viaarxiv icon

A Streaming On-Device End-to-End Model Surpassing Server-Side Conventional Model Quality and Latency

Add code
Mar 28, 2020
Figure 1 for A Streaming On-Device End-to-End Model Surpassing Server-Side Conventional Model Quality and Latency
Figure 2 for A Streaming On-Device End-to-End Model Surpassing Server-Side Conventional Model Quality and Latency
Figure 3 for A Streaming On-Device End-to-End Model Surpassing Server-Side Conventional Model Quality and Latency
Figure 4 for A Streaming On-Device End-to-End Model Surpassing Server-Side Conventional Model Quality and Latency
Viaarxiv icon

BigNAS: Scaling Up Neural Architecture Search with Big Single-Stage Models

Add code
Mar 24, 2020
Figure 1 for BigNAS: Scaling Up Neural Architecture Search with Big Single-Stage Models
Figure 2 for BigNAS: Scaling Up Neural Architecture Search with Big Single-Stage Models
Figure 3 for BigNAS: Scaling Up Neural Architecture Search with Big Single-Stage Models
Figure 4 for BigNAS: Scaling Up Neural Architecture Search with Big Single-Stage Models
Viaarxiv icon

Deliberation Model Based Two-Pass End-to-End Speech Recognition

Add code
Mar 17, 2020
Figure 1 for Deliberation Model Based Two-Pass End-to-End Speech Recognition
Figure 2 for Deliberation Model Based Two-Pass End-to-End Speech Recognition
Figure 3 for Deliberation Model Based Two-Pass End-to-End Speech Recognition
Figure 4 for Deliberation Model Based Two-Pass End-to-End Speech Recognition
Viaarxiv icon

EfficientDet: Scalable and Efficient Object Detection

Add code
Nov 20, 2019
Figure 1 for EfficientDet: Scalable and Efficient Object Detection
Figure 2 for EfficientDet: Scalable and Efficient Object Detection
Figure 3 for EfficientDet: Scalable and Efficient Object Detection
Figure 4 for EfficientDet: Scalable and Efficient Object Detection
Viaarxiv icon

A comparison of end-to-end models for long-form speech recognition

Add code
Nov 06, 2019
Figure 1 for A comparison of end-to-end models for long-form speech recognition
Figure 2 for A comparison of end-to-end models for long-form speech recognition
Figure 3 for A comparison of end-to-end models for long-form speech recognition
Viaarxiv icon