Picture for Ann Lee

Ann Lee

A Holistic Cascade System, benchmark, and Human Evaluation Protocol for Expressive Speech-to-Speech Translation

Add code
Jan 25, 2023
Figure 1 for A Holistic Cascade System, benchmark, and Human Evaluation Protocol for Expressive Speech-to-Speech Translation
Figure 2 for A Holistic Cascade System, benchmark, and Human Evaluation Protocol for Expressive Speech-to-Speech Translation
Figure 3 for A Holistic Cascade System, benchmark, and Human Evaluation Protocol for Expressive Speech-to-Speech Translation
Figure 4 for A Holistic Cascade System, benchmark, and Human Evaluation Protocol for Expressive Speech-to-Speech Translation
Viaarxiv icon

UnitY: Two-pass Direct Speech-to-speech Translation with Discrete Units

Add code
Dec 15, 2022
Viaarxiv icon

Speech-to-Speech Translation For A Real-world Unwritten Language

Add code
Nov 11, 2022
Figure 1 for Speech-to-Speech Translation For A Real-world Unwritten Language
Figure 2 for Speech-to-Speech Translation For A Real-world Unwritten Language
Figure 3 for Speech-to-Speech Translation For A Real-world Unwritten Language
Figure 4 for Speech-to-Speech Translation For A Real-world Unwritten Language
Viaarxiv icon

SpeechMatrix: A Large-Scale Mined Corpus of Multilingual Speech-to-Speech Translations

Add code
Nov 08, 2022
Figure 1 for SpeechMatrix: A Large-Scale Mined Corpus of Multilingual Speech-to-Speech Translations
Figure 2 for SpeechMatrix: A Large-Scale Mined Corpus of Multilingual Speech-to-Speech Translations
Figure 3 for SpeechMatrix: A Large-Scale Mined Corpus of Multilingual Speech-to-Speech Translations
Figure 4 for SpeechMatrix: A Large-Scale Mined Corpus of Multilingual Speech-to-Speech Translations
Viaarxiv icon

Bridging Speech and Textual Pre-trained Models with Unsupervised ASR

Add code
Nov 06, 2022
Viaarxiv icon

On The Robustness of Self-Supervised Representations for Spoken Language Modeling

Add code
Sep 30, 2022
Figure 1 for On The Robustness of Self-Supervised Representations for Spoken Language Modeling
Figure 2 for On The Robustness of Self-Supervised Representations for Spoken Language Modeling
Figure 3 for On The Robustness of Self-Supervised Representations for Spoken Language Modeling
Figure 4 for On The Robustness of Self-Supervised Representations for Spoken Language Modeling
Viaarxiv icon

Enhanced Direct Speech-to-Speech Translation Using Self-supervised Pre-training and Data Augmentation

Add code
Apr 06, 2022
Figure 1 for Enhanced Direct Speech-to-Speech Translation Using Self-supervised Pre-training and Data Augmentation
Figure 2 for Enhanced Direct Speech-to-Speech Translation Using Self-supervised Pre-training and Data Augmentation
Figure 3 for Enhanced Direct Speech-to-Speech Translation Using Self-supervised Pre-training and Data Augmentation
Figure 4 for Enhanced Direct Speech-to-Speech Translation Using Self-supervised Pre-training and Data Augmentation
Viaarxiv icon

textless-lib: a Library for Textless Spoken Language Processing

Add code
Feb 15, 2022
Figure 1 for textless-lib: a Library for Textless Spoken Language Processing
Figure 2 for textless-lib: a Library for Textless Spoken Language Processing
Figure 3 for textless-lib: a Library for Textless Spoken Language Processing
Figure 4 for textless-lib: a Library for Textless Spoken Language Processing
Viaarxiv icon

Flashlight: Enabling Innovation in Tools for Machine Learning

Add code
Jan 29, 2022
Figure 1 for Flashlight: Enabling Innovation in Tools for Machine Learning
Figure 2 for Flashlight: Enabling Innovation in Tools for Machine Learning
Figure 3 for Flashlight: Enabling Innovation in Tools for Machine Learning
Figure 4 for Flashlight: Enabling Innovation in Tools for Machine Learning
Viaarxiv icon

Textless Speech-to-Speech Translation on Real Data

Add code
Dec 15, 2021
Figure 1 for Textless Speech-to-Speech Translation on Real Data
Figure 2 for Textless Speech-to-Speech Translation on Real Data
Figure 3 for Textless Speech-to-Speech Translation on Real Data
Figure 4 for Textless Speech-to-Speech Translation on Real Data
Viaarxiv icon