Picture for Aditi Raghunathan

Aditi Raghunathan

The Finetuner's Fallacy: When to Pretrain with Your Finetuning Data

Add code
Mar 17, 2026
Viaarxiv icon

One-step Language Modeling via Continuous Denoising

Add code
Feb 18, 2026
Viaarxiv icon

S2D: Selective Spectral Decay for Quantization-Friendly Conditioning of Neural Activations

Add code
Feb 16, 2026
Viaarxiv icon

Watch the Weights: Unsupervised monitoring and control of fine-tuned LLMs

Add code
Jul 31, 2025
Viaarxiv icon

Reasoning as an Adaptive Defense for Safety

Add code
Jul 01, 2025
Figure 1 for Reasoning as an Adaptive Defense for Safety
Figure 2 for Reasoning as an Adaptive Defense for Safety
Figure 3 for Reasoning as an Adaptive Defense for Safety
Figure 4 for Reasoning as an Adaptive Defense for Safety
Viaarxiv icon

Roll the dice & look before you leap: Going beyond the creative limits of next-token prediction

Add code
Apr 21, 2025
Figure 1 for Roll the dice & look before you leap: Going beyond the creative limits of next-token prediction
Figure 2 for Roll the dice & look before you leap: Going beyond the creative limits of next-token prediction
Figure 3 for Roll the dice & look before you leap: Going beyond the creative limits of next-token prediction
Figure 4 for Roll the dice & look before you leap: Going beyond the creative limits of next-token prediction
Viaarxiv icon

Weight Ensembling Improves Reasoning in Language Models

Add code
Apr 15, 2025
Viaarxiv icon

Exact Unlearning of Finetuning Data via Model Merging at Scale

Add code
Apr 06, 2025
Viaarxiv icon

Overtrained Language Models Are Harder to Fine-Tune

Add code
Mar 24, 2025
Figure 1 for Overtrained Language Models Are Harder to Fine-Tune
Figure 2 for Overtrained Language Models Are Harder to Fine-Tune
Figure 3 for Overtrained Language Models Are Harder to Fine-Tune
Figure 4 for Overtrained Language Models Are Harder to Fine-Tune
Viaarxiv icon

Not-Just-Scaling Laws: Towards a Better Understanding of the Downstream Impact of Language Model Design Decisions

Add code
Mar 05, 2025
Figure 1 for Not-Just-Scaling Laws: Towards a Better Understanding of the Downstream Impact of Language Model Design Decisions
Figure 2 for Not-Just-Scaling Laws: Towards a Better Understanding of the Downstream Impact of Language Model Design Decisions
Figure 3 for Not-Just-Scaling Laws: Towards a Better Understanding of the Downstream Impact of Language Model Design Decisions
Figure 4 for Not-Just-Scaling Laws: Towards a Better Understanding of the Downstream Impact of Language Model Design Decisions
Viaarxiv icon