Picture for Michelle Tadmor Ramanovich

Michelle Tadmor Ramanovich

SimulTron: On-Device Simultaneous Speech to Speech Translation

Add code
Jun 04, 2024
Figure 1 for SimulTron: On-Device Simultaneous Speech to Speech Translation
Figure 2 for SimulTron: On-Device Simultaneous Speech to Speech Translation
Figure 3 for SimulTron: On-Device Simultaneous Speech to Speech Translation
Figure 4 for SimulTron: On-Device Simultaneous Speech to Speech Translation
Viaarxiv icon

AudioPaLM: A Large Language Model That Can Speak and Listen

Add code
Jun 22, 2023
Figure 1 for AudioPaLM: A Large Language Model That Can Speak and Listen
Figure 2 for AudioPaLM: A Large Language Model That Can Speak and Listen
Figure 3 for AudioPaLM: A Large Language Model That Can Speak and Listen
Figure 4 for AudioPaLM: A Large Language Model That Can Speak and Listen
Viaarxiv icon

Translatotron 3: Speech to Speech Translation with Monolingual Data

Add code
Jun 01, 2023
Figure 1 for Translatotron 3: Speech to Speech Translation with Monolingual Data
Figure 2 for Translatotron 3: Speech to Speech Translation with Monolingual Data
Figure 3 for Translatotron 3: Speech to Speech Translation with Monolingual Data
Figure 4 for Translatotron 3: Speech to Speech Translation with Monolingual Data
Viaarxiv icon

LMs with a Voice: Spoken Language Modeling beyond Speech Tokens

Add code
May 24, 2023
Figure 1 for LMs with a Voice: Spoken Language Modeling beyond Speech Tokens
Figure 2 for LMs with a Voice: Spoken Language Modeling beyond Speech Tokens
Figure 3 for LMs with a Voice: Spoken Language Modeling beyond Speech Tokens
Figure 4 for LMs with a Voice: Spoken Language Modeling beyond Speech Tokens
Viaarxiv icon

CVSS Corpus and Massively Multilingual Speech-to-Speech Translation

Add code
Jan 16, 2022
Figure 1 for CVSS Corpus and Massively Multilingual Speech-to-Speech Translation
Figure 2 for CVSS Corpus and Massively Multilingual Speech-to-Speech Translation
Figure 3 for CVSS Corpus and Massively Multilingual Speech-to-Speech Translation
Figure 4 for CVSS Corpus and Massively Multilingual Speech-to-Speech Translation
Viaarxiv icon

More than Words: In-the-Wild Visually-Driven Prosody for Text-to-Speech

Add code
Nov 19, 2021
Figure 1 for More than Words: In-the-Wild Visually-Driven Prosody for Text-to-Speech
Figure 2 for More than Words: In-the-Wild Visually-Driven Prosody for Text-to-Speech
Figure 3 for More than Words: In-the-Wild Visually-Driven Prosody for Text-to-Speech
Figure 4 for More than Words: In-the-Wild Visually-Driven Prosody for Text-to-Speech
Viaarxiv icon

Translatotron 2: Robust direct speech-to-speech translation

Add code
Jul 29, 2021
Figure 1 for Translatotron 2: Robust direct speech-to-speech translation
Figure 2 for Translatotron 2: Robust direct speech-to-speech translation
Figure 3 for Translatotron 2: Robust direct speech-to-speech translation
Figure 4 for Translatotron 2: Robust direct speech-to-speech translation
Viaarxiv icon