Picture for Ehsan Variani

Ehsan Variani

Benchmarking LLMs on the Massive Sound Embedding Benchmark (MSEB)

Add code
May 06, 2026
Viaarxiv icon

Massive Sound Embedding Benchmark (MSEB)

Add code
Feb 06, 2026
Viaarxiv icon

LAST: Scalable Lattice-Based Speech Modelling in JAX

Add code
Apr 25, 2023
Viaarxiv icon

JEIT: Joint End-to-End Model and Internal Language Model Training for Speech Recognition

Add code
Feb 16, 2023
Figure 1 for JEIT: Joint End-to-End Model and Internal Language Model Training for Speech Recognition
Figure 2 for JEIT: Joint End-to-End Model and Internal Language Model Training for Speech Recognition
Figure 3 for JEIT: Joint End-to-End Model and Internal Language Model Training for Speech Recognition
Figure 4 for JEIT: Joint End-to-End Model and Internal Language Model Training for Speech Recognition
Viaarxiv icon

Alignment Entropy Regularization

Add code
Dec 22, 2022
Figure 1 for Alignment Entropy Regularization
Figure 2 for Alignment Entropy Regularization
Figure 3 for Alignment Entropy Regularization
Figure 4 for Alignment Entropy Regularization
Viaarxiv icon

Modular Hybrid Autoregressive Transducer

Add code
Oct 31, 2022
Figure 1 for Modular Hybrid Autoregressive Transducer
Figure 2 for Modular Hybrid Autoregressive Transducer
Figure 3 for Modular Hybrid Autoregressive Transducer
Figure 4 for Modular Hybrid Autoregressive Transducer
Viaarxiv icon

UserLibri: A Dataset for ASR Personalization Using Only Text

Add code
Jul 02, 2022
Figure 1 for UserLibri: A Dataset for ASR Personalization Using Only Text
Figure 2 for UserLibri: A Dataset for ASR Personalization Using Only Text
Figure 3 for UserLibri: A Dataset for ASR Personalization Using Only Text
Figure 4 for UserLibri: A Dataset for ASR Personalization Using Only Text
Viaarxiv icon

Global Normalization for Streaming Speech Recognition in a Modular Framework

Add code
May 26, 2022
Figure 1 for Global Normalization for Streaming Speech Recognition in a Modular Framework
Figure 2 for Global Normalization for Streaming Speech Recognition in a Modular Framework
Figure 3 for Global Normalization for Streaming Speech Recognition in a Modular Framework
Figure 4 for Global Normalization for Streaming Speech Recognition in a Modular Framework
Viaarxiv icon

Improving Rare Word Recognition with LM-aware MWER Training

Add code
Apr 15, 2022
Figure 1 for Improving Rare Word Recognition with LM-aware MWER Training
Figure 2 for Improving Rare Word Recognition with LM-aware MWER Training
Figure 3 for Improving Rare Word Recognition with LM-aware MWER Training
Figure 4 for Improving Rare Word Recognition with LM-aware MWER Training
Viaarxiv icon

Cascaded encoders for unifying streaming and non-streaming ASR

Add code
Oct 27, 2020
Figure 1 for Cascaded encoders for unifying streaming and non-streaming ASR
Figure 2 for Cascaded encoders for unifying streaming and non-streaming ASR
Figure 3 for Cascaded encoders for unifying streaming and non-streaming ASR
Figure 4 for Cascaded encoders for unifying streaming and non-streaming ASR
Viaarxiv icon