Picture for Howe Tissue

Howe Tissue

Domain2Vec: Vectorizing Datasets to Find the Optimal Data Mixture without Training

Add code
Jun 12, 2025
Viaarxiv icon

Learning Dynamics in Continual Pre-Training for Large Language Models

Add code
May 12, 2025
Viaarxiv icon

Scaling Law with Learning Rate Annealing

Add code
Aug 20, 2024
Viaarxiv icon