Picture for Ann Lee

Ann Lee

Direct simultaneous speech to speech translation

Add code
Oct 15, 2021
Figure 1 for Direct simultaneous speech to speech translation
Figure 2 for Direct simultaneous speech to speech translation
Viaarxiv icon

fairseq S^2: A Scalable and Integrable Speech Synthesis Toolkit

Add code
Sep 14, 2021
Figure 1 for fairseq S^2: A Scalable and Integrable Speech Synthesis Toolkit
Figure 2 for fairseq S^2: A Scalable and Integrable Speech Synthesis Toolkit
Figure 3 for fairseq S^2: A Scalable and Integrable Speech Synthesis Toolkit
Figure 4 for fairseq S^2: A Scalable and Integrable Speech Synthesis Toolkit
Viaarxiv icon

Text-Free Prosody-Aware Generative Spoken Language Modeling

Add code
Sep 07, 2021
Figure 1 for Text-Free Prosody-Aware Generative Spoken Language Modeling
Figure 2 for Text-Free Prosody-Aware Generative Spoken Language Modeling
Figure 3 for Text-Free Prosody-Aware Generative Spoken Language Modeling
Figure 4 for Text-Free Prosody-Aware Generative Spoken Language Modeling
Viaarxiv icon

Direct speech-to-speech translation with discrete units

Add code
Jul 12, 2021
Figure 1 for Direct speech-to-speech translation with discrete units
Figure 2 for Direct speech-to-speech translation with discrete units
Figure 3 for Direct speech-to-speech translation with discrete units
Figure 4 for Direct speech-to-speech translation with discrete units
Viaarxiv icon

Robust wav2vec 2.0: Analyzing Domain Shift in Self-Supervised Pre-Training

Add code
Apr 02, 2021
Figure 1 for Robust wav2vec 2.0: Analyzing Domain Shift in Self-Supervised Pre-Training
Figure 2 for Robust wav2vec 2.0: Analyzing Domain Shift in Self-Supervised Pre-Training
Figure 3 for Robust wav2vec 2.0: Analyzing Domain Shift in Self-Supervised Pre-Training
Figure 4 for Robust wav2vec 2.0: Analyzing Domain Shift in Self-Supervised Pre-Training
Viaarxiv icon

VoxPopuli: A Large-Scale Multilingual Speech Corpus for Representation Learning, Semi-Supervised Learning and Interpretation

Add code
Jan 02, 2021
Figure 1 for VoxPopuli: A Large-Scale Multilingual Speech Corpus for Representation Learning, Semi-Supervised Learning and Interpretation
Figure 2 for VoxPopuli: A Large-Scale Multilingual Speech Corpus for Representation Learning, Semi-Supervised Learning and Interpretation
Figure 3 for VoxPopuli: A Large-Scale Multilingual Speech Corpus for Representation Learning, Semi-Supervised Learning and Interpretation
Figure 4 for VoxPopuli: A Large-Scale Multilingual Speech Corpus for Representation Learning, Semi-Supervised Learning and Interpretation
Viaarxiv icon

Few-shot Sequence Learning with Transformers

Add code
Dec 17, 2020
Figure 1 for Few-shot Sequence Learning with Transformers
Figure 2 for Few-shot Sequence Learning with Transformers
Figure 3 for Few-shot Sequence Learning with Transformers
Figure 4 for Few-shot Sequence Learning with Transformers
Viaarxiv icon

Facebook AI's WMT20 News Translation Task Submission

Add code
Nov 16, 2020
Figure 1 for Facebook AI's WMT20 News Translation Task Submission
Figure 2 for Facebook AI's WMT20 News Translation Task Submission
Figure 3 for Facebook AI's WMT20 News Translation Task Submission
Figure 4 for Facebook AI's WMT20 News Translation Task Submission
Viaarxiv icon

Semi-Supervised Speech Recognition via Local Prior Matching

Add code
Feb 24, 2020
Figure 1 for Semi-Supervised Speech Recognition via Local Prior Matching
Figure 2 for Semi-Supervised Speech Recognition via Local Prior Matching
Figure 3 for Semi-Supervised Speech Recognition via Local Prior Matching
Figure 4 for Semi-Supervised Speech Recognition via Local Prior Matching
Viaarxiv icon

Self-Training for End-to-End Speech Recognition

Add code
Sep 19, 2019
Figure 1 for Self-Training for End-to-End Speech Recognition
Figure 2 for Self-Training for End-to-End Speech Recognition
Figure 3 for Self-Training for End-to-End Speech Recognition
Figure 4 for Self-Training for End-to-End Speech Recognition
Viaarxiv icon