Picture for Jacob Mitchell Springer

Jacob Mitchell Springer

Early Data Exposure Improves Robustness to Subsequent Fine-Tuning

Add code
May 12, 2026
Viaarxiv icon

Annotations Mitigate Post-Training Mode Collapse

Add code
May 11, 2026
Viaarxiv icon

Sharpness-Aware Pretraining Mitigates Catastrophic Forgetting

Add code
May 04, 2026
Viaarxiv icon

Disentangling Geometry, Performance, and Training in Language Models

Add code
Feb 24, 2026
Viaarxiv icon

Overtrained Language Models Are Harder to Fine-Tune

Add code
Mar 24, 2025
Figure 1 for Overtrained Language Models Are Harder to Fine-Tune
Figure 2 for Overtrained Language Models Are Harder to Fine-Tune
Figure 3 for Overtrained Language Models Are Harder to Fine-Tune
Figure 4 for Overtrained Language Models Are Harder to Fine-Tune
Viaarxiv icon

Sharpness-Aware Minimization Enhances Feature Quality via Balanced Learning

Add code
May 30, 2024
Figure 1 for Sharpness-Aware Minimization Enhances Feature Quality via Balanced Learning
Figure 2 for Sharpness-Aware Minimization Enhances Feature Quality via Balanced Learning
Figure 3 for Sharpness-Aware Minimization Enhances Feature Quality via Balanced Learning
Figure 4 for Sharpness-Aware Minimization Enhances Feature Quality via Balanced Learning
Viaarxiv icon

Repetition Improves Language Model Embeddings

Add code
Feb 23, 2024
Figure 1 for Repetition Improves Language Model Embeddings
Figure 2 for Repetition Improves Language Model Embeddings
Figure 3 for Repetition Improves Language Model Embeddings
Figure 4 for Repetition Improves Language Model Embeddings
Viaarxiv icon

Understanding Catastrophic Forgetting in Language Models via Implicit Inference

Add code
Sep 18, 2023
Figure 1 for Understanding Catastrophic Forgetting in Language Models via Implicit Inference
Figure 2 for Understanding Catastrophic Forgetting in Language Models via Implicit Inference
Figure 3 for Understanding Catastrophic Forgetting in Language Models via Implicit Inference
Figure 4 for Understanding Catastrophic Forgetting in Language Models via Implicit Inference
Viaarxiv icon