Picture for Tara Sainath

Tara Sainath

Google

Hierarchical Recurrent Adapters for Efficient Multi-Task Adaptation of Large Speech Models

Add code
Mar 25, 2024
Figure 1 for Hierarchical Recurrent Adapters for Efficient Multi-Task Adaptation of Large Speech Models
Figure 2 for Hierarchical Recurrent Adapters for Efficient Multi-Task Adaptation of Large Speech Models
Figure 3 for Hierarchical Recurrent Adapters for Efficient Multi-Task Adaptation of Large Speech Models
Figure 4 for Hierarchical Recurrent Adapters for Efficient Multi-Task Adaptation of Large Speech Models
Viaarxiv icon

Gemini 1.5: Unlocking multimodal understanding across millions of tokens of context

Add code
Mar 08, 2024
Viaarxiv icon

Gemini: A Family of Highly Capable Multimodal Models

Add code
Dec 19, 2023
Viaarxiv icon

Contextual Biasing with the Knuth-Morris-Pratt Matching Algorithm

Add code
Sep 29, 2023
Viaarxiv icon

Massive End-to-end Models for Short Search Queries

Add code
Sep 22, 2023
Figure 1 for Massive End-to-end Models for Short Search Queries
Figure 2 for Massive End-to-end Models for Short Search Queries
Figure 3 for Massive End-to-end Models for Short Search Queries
Figure 4 for Massive End-to-end Models for Short Search Queries
Viaarxiv icon

Improving Speech Recognition for African American English With Audio Classification

Add code
Sep 16, 2023
Figure 1 for Improving Speech Recognition for African American English With Audio Classification
Figure 2 for Improving Speech Recognition for African American English With Audio Classification
Figure 3 for Improving Speech Recognition for African American English With Audio Classification
Figure 4 for Improving Speech Recognition for African American English With Audio Classification
Viaarxiv icon

Augmenting conformers with structured state space models for online speech recognition

Add code
Sep 15, 2023
Figure 1 for Augmenting conformers with structured state space models for online speech recognition
Figure 2 for Augmenting conformers with structured state space models for online speech recognition
Figure 3 for Augmenting conformers with structured state space models for online speech recognition
Figure 4 for Augmenting conformers with structured state space models for online speech recognition
Viaarxiv icon

AudioPaLM: A Large Language Model That Can Speak and Listen

Add code
Jun 22, 2023
Figure 1 for AudioPaLM: A Large Language Model That Can Speak and Listen
Figure 2 for AudioPaLM: A Large Language Model That Can Speak and Listen
Figure 3 for AudioPaLM: A Large Language Model That Can Speak and Listen
Figure 4 for AudioPaLM: A Large Language Model That Can Speak and Listen
Viaarxiv icon

A Comparison of Semi-Supervised Learning Techniques for Streaming ASR at Scale

Add code
Apr 19, 2023
Figure 1 for A Comparison of Semi-Supervised Learning Techniques for Streaming ASR at Scale
Figure 2 for A Comparison of Semi-Supervised Learning Techniques for Streaming ASR at Scale
Figure 3 for A Comparison of Semi-Supervised Learning Techniques for Streaming ASR at Scale
Figure 4 for A Comparison of Semi-Supervised Learning Techniques for Streaming ASR at Scale
Viaarxiv icon

Google USM: Scaling Automatic Speech Recognition Beyond 100 Languages

Add code
Mar 03, 2023
Figure 1 for Google USM: Scaling Automatic Speech Recognition Beyond 100 Languages
Figure 2 for Google USM: Scaling Automatic Speech Recognition Beyond 100 Languages
Figure 3 for Google USM: Scaling Automatic Speech Recognition Beyond 100 Languages
Figure 4 for Google USM: Scaling Automatic Speech Recognition Beyond 100 Languages
Viaarxiv icon