Picture for Vahab Mirrokni

Vahab Mirrokni

Dima

TNT: Improving Chunkwise Training for Test-Time Memorization

Add code
Nov 10, 2025
Viaarxiv icon

Data Selection for Fine-tuning Vision Language Models via Cross Modal Alignment Trajectories

Add code
Oct 01, 2025
Figure 1 for Data Selection for Fine-tuning Vision Language Models via Cross Modal Alignment Trajectories
Figure 2 for Data Selection for Fine-tuning Vision Language Models via Cross Modal Alignment Trajectories
Figure 3 for Data Selection for Fine-tuning Vision Language Models via Cross Modal Alignment Trajectories
Figure 4 for Data Selection for Fine-tuning Vision Language Models via Cross Modal Alignment Trajectories
Viaarxiv icon

ATLAS: Learning to Optimally Memorize the Context at Test Time

Add code
May 29, 2025
Figure 1 for ATLAS: Learning to Optimally Memorize the Context at Test Time
Figure 2 for ATLAS: Learning to Optimally Memorize the Context at Test Time
Figure 3 for ATLAS: Learning to Optimally Memorize the Context at Test Time
Figure 4 for ATLAS: Learning to Optimally Memorize the Context at Test Time
Viaarxiv icon

Efficient Data Selection at Scale via Influence Distillation

Add code
May 25, 2025
Viaarxiv icon

Self-Boost via Optimal Retraining: An Analysis via Approximate Message Passing

Add code
May 21, 2025
Viaarxiv icon

TurboQuant: Online Vector Quantization with Near-optimal Distortion Rate

Add code
Apr 28, 2025
Figure 1 for TurboQuant: Online Vector Quantization with Near-optimal Distortion Rate
Figure 2 for TurboQuant: Online Vector Quantization with Near-optimal Distortion Rate
Figure 3 for TurboQuant: Online Vector Quantization with Near-optimal Distortion Rate
Figure 4 for TurboQuant: Online Vector Quantization with Near-optimal Distortion Rate
Viaarxiv icon

It's All Connected: A Journey Through Test-Time Memorization, Attentional Bias, Retention, and Online Optimization

Add code
Apr 17, 2025
Figure 1 for It's All Connected: A Journey Through Test-Time Memorization, Attentional Bias, Retention, and Online Optimization
Figure 2 for It's All Connected: A Journey Through Test-Time Memorization, Attentional Bias, Retention, and Online Optimization
Figure 3 for It's All Connected: A Journey Through Test-Time Memorization, Attentional Bias, Retention, and Online Optimization
Figure 4 for It's All Connected: A Journey Through Test-Time Memorization, Attentional Bias, Retention, and Online Optimization
Viaarxiv icon

Lattice: Learning to Efficiently Compress the Memory

Add code
Apr 08, 2025
Viaarxiv icon

Gemma 3 Technical Report

Add code
Mar 25, 2025
Viaarxiv icon

Synthetic Text Generation for Training Large Language Models via Gradient Matching

Add code
Feb 24, 2025
Figure 1 for Synthetic Text Generation for Training Large Language Models via Gradient Matching
Figure 2 for Synthetic Text Generation for Training Large Language Models via Gradient Matching
Figure 3 for Synthetic Text Generation for Training Large Language Models via Gradient Matching
Figure 4 for Synthetic Text Generation for Training Large Language Models via Gradient Matching
Viaarxiv icon