Picture for Jason Riesa

Jason Riesa

Gemini: A Family of Highly Capable Multimodal Models

Add code
Dec 19, 2023
Viaarxiv icon

Multimodal Modeling For Spoken Language Identification

Add code
Sep 19, 2023
Figure 1 for Multimodal Modeling For Spoken Language Identification
Figure 2 for Multimodal Modeling For Spoken Language Identification
Figure 3 for Multimodal Modeling For Spoken Language Identification
Figure 4 for Multimodal Modeling For Spoken Language Identification
Viaarxiv icon

Google USM: Scaling Automatic Speech Recognition Beyond 100 Languages

Add code
Mar 03, 2023
Figure 1 for Google USM: Scaling Automatic Speech Recognition Beyond 100 Languages
Figure 2 for Google USM: Scaling Automatic Speech Recognition Beyond 100 Languages
Figure 3 for Google USM: Scaling Automatic Speech Recognition Beyond 100 Languages
Figure 4 for Google USM: Scaling Automatic Speech Recognition Beyond 100 Languages
Viaarxiv icon

SQuId: Measuring Speech Naturalness in Many Languages

Add code
Oct 12, 2022
Figure 1 for SQuId: Measuring Speech Naturalness in Many Languages
Figure 2 for SQuId: Measuring Speech Naturalness in Many Languages
Figure 3 for SQuId: Measuring Speech Naturalness in Many Languages
Figure 4 for SQuId: Measuring Speech Naturalness in Many Languages
Viaarxiv icon

FRMT: A Benchmark for Few-Shot Region-Aware Machine Translation

Add code
Oct 01, 2022
Figure 1 for FRMT: A Benchmark for Few-Shot Region-Aware Machine Translation
Figure 2 for FRMT: A Benchmark for Few-Shot Region-Aware Machine Translation
Figure 3 for FRMT: A Benchmark for Few-Shot Region-Aware Machine Translation
Figure 4 for FRMT: A Benchmark for Few-Shot Region-Aware Machine Translation
Viaarxiv icon

FLEURS: Few-shot Learning Evaluation of Universal Representations of Speech

Add code
May 25, 2022
Figure 1 for FLEURS: Few-shot Learning Evaluation of Universal Representations of Speech
Figure 2 for FLEURS: Few-shot Learning Evaluation of Universal Representations of Speech
Figure 3 for FLEURS: Few-shot Learning Evaluation of Universal Representations of Speech
Figure 4 for FLEURS: Few-shot Learning Evaluation of Universal Representations of Speech
Viaarxiv icon

Building Machine Translation Systems for the Next Thousand Languages

Add code
May 16, 2022
Figure 1 for Building Machine Translation Systems for the Next Thousand Languages
Figure 2 for Building Machine Translation Systems for the Next Thousand Languages
Figure 3 for Building Machine Translation Systems for the Next Thousand Languages
Figure 4 for Building Machine Translation Systems for the Next Thousand Languages
Viaarxiv icon

XTREME-S: Evaluating Cross-lingual Speech Representations

Add code
Apr 13, 2022
Figure 1 for XTREME-S: Evaluating Cross-lingual Speech Representations
Figure 2 for XTREME-S: Evaluating Cross-lingual Speech Representations
Figure 3 for XTREME-S: Evaluating Cross-lingual Speech Representations
Figure 4 for XTREME-S: Evaluating Cross-lingual Speech Representations
Viaarxiv icon

mSLAM: Massively multilingual joint pre-training for speech and text

Add code
Feb 03, 2022
Figure 1 for mSLAM: Massively multilingual joint pre-training for speech and text
Figure 2 for mSLAM: Massively multilingual joint pre-training for speech and text
Figure 3 for mSLAM: Massively multilingual joint pre-training for speech and text
Figure 4 for mSLAM: Massively multilingual joint pre-training for speech and text
Viaarxiv icon

SLAM: A Unified Encoder for Speech and Language Modeling via Speech-Text Joint Pre-Training

Add code
Oct 20, 2021
Figure 1 for SLAM: A Unified Encoder for Speech and Language Modeling via Speech-Text Joint Pre-Training
Figure 2 for SLAM: A Unified Encoder for Speech and Language Modeling via Speech-Text Joint Pre-Training
Figure 3 for SLAM: A Unified Encoder for Speech and Language Modeling via Speech-Text Joint Pre-Training
Figure 4 for SLAM: A Unified Encoder for Speech and Language Modeling via Speech-Text Joint Pre-Training
Viaarxiv icon