Picture for Avner May

Avner May

Speculative Speculative Decoding

Add code
Mar 03, 2026
Viaarxiv icon

When RL Meets Adaptive Speculative Training: A Unified Training-Serving System

Add code
Feb 06, 2026
Viaarxiv icon

Minions: Cost-efficient Collaboration Between On-device and Cloud Language Models

Add code
Feb 21, 2025
Viaarxiv icon

The Mamba in the Llama: Distilling and Accelerating Hybrid Models

Add code
Aug 27, 2024
Viaarxiv icon

SpecExec: Massively Parallel Speculative Decoding for Interactive LLM Inference on Consumer Devices

Add code
Jun 04, 2024
Figure 1 for SpecExec: Massively Parallel Speculative Decoding for Interactive LLM Inference on Consumer Devices
Figure 2 for SpecExec: Massively Parallel Speculative Decoding for Interactive LLM Inference on Consumer Devices
Figure 3 for SpecExec: Massively Parallel Speculative Decoding for Interactive LLM Inference on Consumer Devices
Figure 4 for SpecExec: Massively Parallel Speculative Decoding for Interactive LLM Inference on Consumer Devices
Viaarxiv icon

Sequoia: Scalable, Robust, and Hardware-aware Speculative Decoding

Add code
Feb 29, 2024
Figure 1 for Sequoia: Scalable, Robust, and Hardware-aware Speculative Decoding
Figure 2 for Sequoia: Scalable, Robust, and Hardware-aware Speculative Decoding
Figure 3 for Sequoia: Scalable, Robust, and Hardware-aware Speculative Decoding
Figure 4 for Sequoia: Scalable, Robust, and Hardware-aware Speculative Decoding
Viaarxiv icon

Audio-visual fine-tuning of audio-only ASR models

Add code
Dec 14, 2023
Figure 1 for Audio-visual fine-tuning of audio-only ASR models
Figure 2 for Audio-visual fine-tuning of audio-only ASR models
Figure 3 for Audio-visual fine-tuning of audio-only ASR models
Viaarxiv icon

Contextual Embeddings: When Are They Worth It?

Add code
May 18, 2020
Figure 1 for Contextual Embeddings: When Are They Worth It?
Figure 2 for Contextual Embeddings: When Are They Worth It?
Figure 3 for Contextual Embeddings: When Are They Worth It?
Figure 4 for Contextual Embeddings: When Are They Worth It?
Viaarxiv icon

Understanding the Downstream Instability of Word Embeddings

Add code
Feb 29, 2020
Figure 1 for Understanding the Downstream Instability of Word Embeddings
Figure 2 for Understanding the Downstream Instability of Word Embeddings
Figure 3 for Understanding the Downstream Instability of Word Embeddings
Figure 4 for Understanding the Downstream Instability of Word Embeddings
Viaarxiv icon

On the Downstream Performance of Compressed Word Embeddings

Add code
Sep 03, 2019
Figure 1 for On the Downstream Performance of Compressed Word Embeddings
Figure 2 for On the Downstream Performance of Compressed Word Embeddings
Figure 3 for On the Downstream Performance of Compressed Word Embeddings
Figure 4 for On the Downstream Performance of Compressed Word Embeddings
Viaarxiv icon