Picture for Maxime Guigon

Maxime Guigon

A Study on Hidden Layer Distillation for Large Language Model Pre-Training

Add code
May 12, 2026
Viaarxiv icon