Picture for Gil Keren

Gil Keren

The Llama 4 Herd: Architecture, Training, Evaluation, and Deployment Notes

Add code
Jan 15, 2026
Viaarxiv icon

Omnilingual ASR: Open-Source Multilingual Speech Recognition for 1600+ Languages

Add code
Nov 12, 2025
Figure 1 for Omnilingual ASR: Open-Source Multilingual Speech Recognition for 1600+ Languages
Figure 2 for Omnilingual ASR: Open-Source Multilingual Speech Recognition for 1600+ Languages
Figure 3 for Omnilingual ASR: Open-Source Multilingual Speech Recognition for 1600+ Languages
Figure 4 for Omnilingual ASR: Open-Source Multilingual Speech Recognition for 1600+ Languages
Viaarxiv icon

Efficient Streaming LLM for Speech Recognition

Add code
Oct 02, 2024
Figure 1 for Efficient Streaming LLM for Speech Recognition
Figure 2 for Efficient Streaming LLM for Speech Recognition
Figure 3 for Efficient Streaming LLM for Speech Recognition
Figure 4 for Efficient Streaming LLM for Speech Recognition
Viaarxiv icon

M-BEST-RQ: A Multi-Channel Speech Foundation Model for Smart Glasses

Add code
Sep 17, 2024
Figure 1 for M-BEST-RQ: A Multi-Channel Speech Foundation Model for Smart Glasses
Figure 2 for M-BEST-RQ: A Multi-Channel Speech Foundation Model for Smart Glasses
Figure 3 for M-BEST-RQ: A Multi-Channel Speech Foundation Model for Smart Glasses
Figure 4 for M-BEST-RQ: A Multi-Channel Speech Foundation Model for Smart Glasses
Viaarxiv icon

Faster Speech-LLaMA Inference with Multi-token Prediction

Add code
Sep 12, 2024
Figure 1 for Faster Speech-LLaMA Inference with Multi-token Prediction
Figure 2 for Faster Speech-LLaMA Inference with Multi-token Prediction
Figure 3 for Faster Speech-LLaMA Inference with Multi-token Prediction
Figure 4 for Faster Speech-LLaMA Inference with Multi-token Prediction
Viaarxiv icon

Token-Weighted RNN-T for Learning from Flawed Data

Add code
Jun 26, 2024
Viaarxiv icon

Towards Selection of Text-to-speech Data to Augment ASR Training

Add code
May 30, 2023
Figure 1 for Towards Selection of Text-to-speech Data to Augment ASR Training
Figure 2 for Towards Selection of Text-to-speech Data to Augment ASR Training
Figure 3 for Towards Selection of Text-to-speech Data to Augment ASR Training
Figure 4 for Towards Selection of Text-to-speech Data to Augment ASR Training
Viaarxiv icon

Text Generation with Speech Synthesis for ASR Data Augmentation

Add code
May 22, 2023
Figure 1 for Text Generation with Speech Synthesis for ASR Data Augmentation
Figure 2 for Text Generation with Speech Synthesis for ASR Data Augmentation
Figure 3 for Text Generation with Speech Synthesis for ASR Data Augmentation
Viaarxiv icon

A Token-Wise Beam Search Algorithm for RNN-T

Add code
Feb 28, 2023
Figure 1 for A Token-Wise Beam Search Algorithm for RNN-T
Figure 2 for A Token-Wise Beam Search Algorithm for RNN-T
Figure 3 for A Token-Wise Beam Search Algorithm for RNN-T
Figure 4 for A Token-Wise Beam Search Algorithm for RNN-T
Viaarxiv icon

Improving Fast-slow Encoder based Transducer with Streaming Deliberation

Add code
Dec 15, 2022
Viaarxiv icon