Picture for Yiming Dong

Yiming Dong

Stepsize anything: A unified learning rate schedule for budgeted-iteration training

Add code
May 30, 2025
Viaarxiv icon

From Macro to Micro: Probing Dataset Diversity in Language Model Fine-Tuning

Add code
May 30, 2025
Viaarxiv icon

On the $O(\frac{\sqrt{d}}{K^{1/4}})$ Convergence Rate of AdamW Measured by $\ell_1$ Norm

Add code
May 17, 2025
Viaarxiv icon

Convergence Rate Analysis of LION

Add code
Nov 12, 2024
Viaarxiv icon