Picture for Chulhee Yun

Chulhee Yun

Understanding Sharpness Dynamics in NN Training with a Minimalist Example: The Effects of Dataset Difficulty, Depth, Stochasticity, and More

Add code
Jun 07, 2025
Viaarxiv icon

Incremental Gradient Descent with Small Epoch Counts is Surprisingly Slow on Ill-Conditioned Problems

Add code
Jun 04, 2025
Viaarxiv icon

Convergence and Implicit Bias of Gradient Descent on Continual Linear Classification

Add code
Apr 17, 2025
Viaarxiv icon

Lightweight Dataset Pruning without Full Training via Example Difficulty and Prediction Uncertainty

Add code
Feb 10, 2025
Viaarxiv icon

Stochastic Extragradient with Flip-Flop Shuffling & Anchoring: Provable Improvements

Add code
Dec 31, 2024
Viaarxiv icon

Provable Benefit of Cutout and CutMix for Feature Learning

Add code
Oct 31, 2024
Viaarxiv icon

DASH: Warm-Starting Neural Network Training in Stationary Settings without Loss of Plasticity

Add code
Oct 30, 2024
Viaarxiv icon

Arithmetic Transformers Can Length-Generalize in Both Operand Length and Count

Add code
Oct 21, 2024
Viaarxiv icon

Position Coupling: Leveraging Task Structure for Improved Length Generalization of Transformers

Add code
May 31, 2024
Viaarxiv icon

Does SGD really happen in tiny subspaces?

Add code
May 25, 2024
Viaarxiv icon