Picture for Alexei Baevski

Alexei Baevski

Jack

Unsupervised Speech Recognition

Add code
May 24, 2021
Figure 1 for Unsupervised Speech Recognition
Figure 2 for Unsupervised Speech Recognition
Figure 3 for Unsupervised Speech Recognition
Figure 4 for Unsupervised Speech Recognition
Viaarxiv icon

Large-Scale Self- and Semi-Supervised Learning for Speech Translation

Add code
Apr 14, 2021
Figure 1 for Large-Scale Self- and Semi-Supervised Learning for Speech Translation
Figure 2 for Large-Scale Self- and Semi-Supervised Learning for Speech Translation
Figure 3 for Large-Scale Self- and Semi-Supervised Learning for Speech Translation
Viaarxiv icon

Robust wav2vec 2.0: Analyzing Domain Shift in Self-Supervised Pre-Training

Add code
Apr 02, 2021
Figure 1 for Robust wav2vec 2.0: Analyzing Domain Shift in Self-Supervised Pre-Training
Figure 2 for Robust wav2vec 2.0: Analyzing Domain Shift in Self-Supervised Pre-Training
Figure 3 for Robust wav2vec 2.0: Analyzing Domain Shift in Self-Supervised Pre-Training
Figure 4 for Robust wav2vec 2.0: Analyzing Domain Shift in Self-Supervised Pre-Training
Viaarxiv icon

Generative Spoken Language Modeling from Raw Audio

Add code
Feb 01, 2021
Figure 1 for Generative Spoken Language Modeling from Raw Audio
Figure 2 for Generative Spoken Language Modeling from Raw Audio
Figure 3 for Generative Spoken Language Modeling from Raw Audio
Figure 4 for Generative Spoken Language Modeling from Raw Audio
Viaarxiv icon

Reservoir Transformer

Add code
Dec 30, 2020
Figure 1 for Reservoir Transformer
Figure 2 for Reservoir Transformer
Figure 3 for Reservoir Transformer
Figure 4 for Reservoir Transformer
Viaarxiv icon

The Zero Resource Speech Benchmark 2021: Metrics and baselines for unsupervised spoken language modeling

Add code
Dec 01, 2020
Figure 1 for The Zero Resource Speech Benchmark 2021: Metrics and baselines for unsupervised spoken language modeling
Figure 2 for The Zero Resource Speech Benchmark 2021: Metrics and baselines for unsupervised spoken language modeling
Figure 3 for The Zero Resource Speech Benchmark 2021: Metrics and baselines for unsupervised spoken language modeling
Figure 4 for The Zero Resource Speech Benchmark 2021: Metrics and baselines for unsupervised spoken language modeling
Viaarxiv icon

A Comparison of Discrete Latent Variable Models for Speech Representation Learning

Add code
Oct 24, 2020
Figure 1 for A Comparison of Discrete Latent Variable Models for Speech Representation Learning
Figure 2 for A Comparison of Discrete Latent Variable Models for Speech Representation Learning
Figure 3 for A Comparison of Discrete Latent Variable Models for Speech Representation Learning
Figure 4 for A Comparison of Discrete Latent Variable Models for Speech Representation Learning
Viaarxiv icon

Self-training and Pre-training are Complementary for Speech Recognition

Add code
Oct 22, 2020
Figure 1 for Self-training and Pre-training are Complementary for Speech Recognition
Figure 2 for Self-training and Pre-training are Complementary for Speech Recognition
Figure 3 for Self-training and Pre-training are Complementary for Speech Recognition
Figure 4 for Self-training and Pre-training are Complementary for Speech Recognition
Viaarxiv icon

Unsupervised Cross-lingual Representation Learning for Speech Recognition

Add code
Jun 24, 2020
Figure 1 for Unsupervised Cross-lingual Representation Learning for Speech Recognition
Figure 2 for Unsupervised Cross-lingual Representation Learning for Speech Recognition
Figure 3 for Unsupervised Cross-lingual Representation Learning for Speech Recognition
Figure 4 for Unsupervised Cross-lingual Representation Learning for Speech Recognition
Viaarxiv icon

wav2vec 2.0: A Framework for Self-Supervised Learning of Speech Representations

Add code
Jun 20, 2020
Figure 1 for wav2vec 2.0: A Framework for Self-Supervised Learning of Speech Representations
Figure 2 for wav2vec 2.0: A Framework for Self-Supervised Learning of Speech Representations
Figure 3 for wav2vec 2.0: A Framework for Self-Supervised Learning of Speech Representations
Figure 4 for wav2vec 2.0: A Framework for Self-Supervised Learning of Speech Representations
Viaarxiv icon