Picture for Trevor Strohman

Trevor Strohman

Efficient Adapter Finetuning for Tail Languages in Streaming Multilingual ASR

Add code
Jan 17, 2024
Viaarxiv icon

Gemini: A Family of Highly Capable Multimodal Models

Add code
Dec 19, 2023
Viaarxiv icon

Controlled Decoding from Language Models

Add code
Oct 25, 2023
Figure 1 for Controlled Decoding from Language Models
Figure 2 for Controlled Decoding from Language Models
Figure 3 for Controlled Decoding from Language Models
Figure 4 for Controlled Decoding from Language Models
Viaarxiv icon

Practical Conformer: Optimizing size, speed and flops of Conformer for on-Device and cloud ASR

Add code
Mar 31, 2023
Figure 1 for Practical Conformer: Optimizing size, speed and flops of Conformer for on-Device and cloud ASR
Figure 2 for Practical Conformer: Optimizing size, speed and flops of Conformer for on-Device and cloud ASR
Figure 3 for Practical Conformer: Optimizing size, speed and flops of Conformer for on-Device and cloud ASR
Figure 4 for Practical Conformer: Optimizing size, speed and flops of Conformer for on-Device and cloud ASR
Viaarxiv icon

Google USM: Scaling Automatic Speech Recognition Beyond 100 Languages

Add code
Mar 03, 2023
Figure 1 for Google USM: Scaling Automatic Speech Recognition Beyond 100 Languages
Figure 2 for Google USM: Scaling Automatic Speech Recognition Beyond 100 Languages
Figure 3 for Google USM: Scaling Automatic Speech Recognition Beyond 100 Languages
Figure 4 for Google USM: Scaling Automatic Speech Recognition Beyond 100 Languages
Viaarxiv icon

UML: A Universal Monolingual Output Layer for Multilingual ASR

Add code
Feb 22, 2023
Figure 1 for UML: A Universal Monolingual Output Layer for Multilingual ASR
Figure 2 for UML: A Universal Monolingual Output Layer for Multilingual ASR
Figure 3 for UML: A Universal Monolingual Output Layer for Multilingual ASR
Viaarxiv icon

Massively Multilingual Shallow Fusion with Large Language Models

Add code
Feb 17, 2023
Figure 1 for Massively Multilingual Shallow Fusion with Large Language Models
Figure 2 for Massively Multilingual Shallow Fusion with Large Language Models
Figure 3 for Massively Multilingual Shallow Fusion with Large Language Models
Figure 4 for Massively Multilingual Shallow Fusion with Large Language Models
Viaarxiv icon

Efficient Domain Adaptation for Speech Foundation Models

Add code
Feb 03, 2023
Figure 1 for Efficient Domain Adaptation for Speech Foundation Models
Figure 2 for Efficient Domain Adaptation for Speech Foundation Models
Figure 3 for Efficient Domain Adaptation for Speech Foundation Models
Figure 4 for Efficient Domain Adaptation for Speech Foundation Models
Viaarxiv icon

From English to More Languages: Parameter-Efficient Model Reprogramming for Cross-Lingual Speech Recognition

Add code
Jan 19, 2023
Figure 1 for From English to More Languages: Parameter-Efficient Model Reprogramming for Cross-Lingual Speech Recognition
Figure 2 for From English to More Languages: Parameter-Efficient Model Reprogramming for Cross-Lingual Speech Recognition
Figure 3 for From English to More Languages: Parameter-Efficient Model Reprogramming for Cross-Lingual Speech Recognition
Figure 4 for From English to More Languages: Parameter-Efficient Model Reprogramming for Cross-Lingual Speech Recognition
Viaarxiv icon

Resource-Efficient Transfer Learning From Speech Foundation Model Using Hierarchical Feature Fusion

Add code
Nov 04, 2022
Figure 1 for Resource-Efficient Transfer Learning From Speech Foundation Model Using Hierarchical Feature Fusion
Figure 2 for Resource-Efficient Transfer Learning From Speech Foundation Model Using Hierarchical Feature Fusion
Figure 3 for Resource-Efficient Transfer Learning From Speech Foundation Model Using Hierarchical Feature Fusion
Figure 4 for Resource-Efficient Transfer Learning From Speech Foundation Model Using Hierarchical Feature Fusion
Viaarxiv icon