Picture for Vahab Mirrokni

Vahab Mirrokni

Dima

Theoretical Perspectives on Data Quality and Synergistic Effects in Pre- and Post-Training Reasoning Models

Add code
Mar 01, 2026
Viaarxiv icon

Memory Caching: RNNs with Growing Memory

Add code
Feb 27, 2026
Viaarxiv icon

Aletheia tackles FirstProof autonomously

Add code
Feb 24, 2026
Viaarxiv icon

Less is More: Convergence Benefits of Fewer Data Weight Updates over Longer Horizon

Add code
Feb 23, 2026
Viaarxiv icon

Accelerating Scientific Research with Gemini: Case Studies and Common Techniques

Add code
Feb 03, 2026
Viaarxiv icon

ECO: Quantized Training without Full-Precision Master Weights

Add code
Jan 29, 2026
Viaarxiv icon

Nested Learning: The Illusion of Deep Learning Architectures

Add code
Dec 31, 2025
Viaarxiv icon

Trellis: Learning to Compress Key-Value Memory in Attention Models

Add code
Dec 29, 2025
Viaarxiv icon

MS-SSM: A Multi-Scale State Space Model for Efficient Sequence Modeling

Add code
Dec 29, 2025
Viaarxiv icon

Sampling and Loss Weights in Multi-Domain Training

Add code
Nov 10, 2025
Viaarxiv icon