Picture for Archi Mitra

Archi Mitra

Jack

Memorization Dynamics in Knowledge Distillation for Language Models

Add code
Jan 21, 2026
Viaarxiv icon

The Llama 4 Herd: Architecture, Training, Evaluation, and Deployment Notes

Add code
Jan 15, 2026
Viaarxiv icon

The Llama 3 Herd of Models

Add code
Jul 31, 2024
Viaarxiv icon