Picture for Yonghui Wu

Yonghui Wu

Conformer: Convolution-augmented Transformer for Speech Recognition

Add code
May 16, 2020
Figure 1 for Conformer: Convolution-augmented Transformer for Speech Recognition
Figure 2 for Conformer: Convolution-augmented Transformer for Speech Recognition
Figure 3 for Conformer: Convolution-augmented Transformer for Speech Recognition
Figure 4 for Conformer: Convolution-augmented Transformer for Speech Recognition
Viaarxiv icon

ContextNet: Improving Convolutional Neural Networks for Automatic Speech Recognition with Global Context

Add code
May 16, 2020
Figure 1 for ContextNet: Improving Convolutional Neural Networks for Automatic Speech Recognition with Global Context
Figure 2 for ContextNet: Improving Convolutional Neural Networks for Automatic Speech Recognition with Global Context
Figure 3 for ContextNet: Improving Convolutional Neural Networks for Automatic Speech Recognition with Global Context
Figure 4 for ContextNet: Improving Convolutional Neural Networks for Automatic Speech Recognition with Global Context
Viaarxiv icon

Interpretable Learning-to-Rank with Generalized Additive Models

Add code
May 14, 2020
Figure 1 for Interpretable Learning-to-Rank with Generalized Additive Models
Figure 2 for Interpretable Learning-to-Rank with Generalized Additive Models
Figure 3 for Interpretable Learning-to-Rank with Generalized Additive Models
Figure 4 for Interpretable Learning-to-Rank with Generalized Additive Models
Viaarxiv icon

Leveraging Monolingual Data with Self-Supervision for Multilingual Neural Machine Translation

Add code
May 11, 2020
Figure 1 for Leveraging Monolingual Data with Self-Supervision for Multilingual Neural Machine Translation
Figure 2 for Leveraging Monolingual Data with Self-Supervision for Multilingual Neural Machine Translation
Figure 3 for Leveraging Monolingual Data with Self-Supervision for Multilingual Neural Machine Translation
Figure 4 for Leveraging Monolingual Data with Self-Supervision for Multilingual Neural Machine Translation
Viaarxiv icon

A Streaming On-Device End-to-End Model Surpassing Server-Side Conventional Model Quality and Latency

Add code
Mar 28, 2020
Figure 1 for A Streaming On-Device End-to-End Model Surpassing Server-Side Conventional Model Quality and Latency
Figure 2 for A Streaming On-Device End-to-End Model Surpassing Server-Side Conventional Model Quality and Latency
Figure 3 for A Streaming On-Device End-to-End Model Surpassing Server-Side Conventional Model Quality and Latency
Figure 4 for A Streaming On-Device End-to-End Model Surpassing Server-Side Conventional Model Quality and Latency
Viaarxiv icon

Fully-hierarchical fine-grained prosody modeling for interpretable speech synthesis

Add code
Feb 06, 2020
Figure 1 for Fully-hierarchical fine-grained prosody modeling for interpretable speech synthesis
Figure 2 for Fully-hierarchical fine-grained prosody modeling for interpretable speech synthesis
Figure 3 for Fully-hierarchical fine-grained prosody modeling for interpretable speech synthesis
Figure 4 for Fully-hierarchical fine-grained prosody modeling for interpretable speech synthesis
Viaarxiv icon

Generating diverse and natural text-to-speech samples using a quantized fine-grained VAE and auto-regressive prosody prior

Add code
Feb 06, 2020
Figure 1 for Generating diverse and natural text-to-speech samples using a quantized fine-grained VAE and auto-regressive prosody prior
Figure 2 for Generating diverse and natural text-to-speech samples using a quantized fine-grained VAE and auto-regressive prosody prior
Figure 3 for Generating diverse and natural text-to-speech samples using a quantized fine-grained VAE and auto-regressive prosody prior
Figure 4 for Generating diverse and natural text-to-speech samples using a quantized fine-grained VAE and auto-regressive prosody prior
Viaarxiv icon

SpecAugment on Large Scale Datasets

Add code
Dec 11, 2019
Figure 1 for SpecAugment on Large Scale Datasets
Figure 2 for SpecAugment on Large Scale Datasets
Figure 3 for SpecAugment on Large Scale Datasets
Viaarxiv icon

A comparison of end-to-end models for long-form speech recognition

Add code
Nov 06, 2019
Figure 1 for A comparison of end-to-end models for long-form speech recognition
Figure 2 for A comparison of end-to-end models for long-form speech recognition
Figure 3 for A comparison of end-to-end models for long-form speech recognition
Viaarxiv icon

Identifying Cancer Patients at Risk for Heart Failure Using Machine Learning Methods

Add code
Oct 01, 2019
Figure 1 for Identifying Cancer Patients at Risk for Heart Failure Using Machine Learning Methods
Figure 2 for Identifying Cancer Patients at Risk for Heart Failure Using Machine Learning Methods
Figure 3 for Identifying Cancer Patients at Risk for Heart Failure Using Machine Learning Methods
Figure 4 for Identifying Cancer Patients at Risk for Heart Failure Using Machine Learning Methods
Viaarxiv icon