Picture for Vahab Mirrokni

Vahab Mirrokni

Dima

It's All Connected: A Journey Through Test-Time Memorization, Attentional Bias, Retention, and Online Optimization

Add code
Apr 17, 2025
Viaarxiv icon

Lattice: Learning to Efficiently Compress the Memory

Add code
Apr 08, 2025
Viaarxiv icon

Gemma 3 Technical Report

Add code
Mar 25, 2025
Viaarxiv icon

Synthetic Text Generation for Training Large Language Models via Gradient Matching

Add code
Feb 24, 2025
Viaarxiv icon

PiKE: Adaptive Data Mixing for Multi-Task Learning Under Low Gradient Conflicts

Add code
Feb 10, 2025
Viaarxiv icon

DeepCrossAttention: Supercharging Transformer Residual Connections

Add code
Feb 10, 2025
Viaarxiv icon

PolarQuant: Quantizing KV Caches with Polar Transformation

Add code
Feb 04, 2025
Viaarxiv icon

Titans: Learning to Memorize at Test Time

Add code
Dec 31, 2024
Viaarxiv icon

Best of Both Worlds: Advantages of Hybrid Graph Sequence Models

Add code
Nov 23, 2024
Viaarxiv icon

Procurement Auctions via Approximately Optimal Submodular Optimization

Add code
Nov 20, 2024
Viaarxiv icon