Picture for Baharan Mirzasoleiman

Baharan Mirzasoleiman

Reasoning Quality Emerges Early: Data Curation for Reasoning Models

Add code
Jun 25, 2026
Viaarxiv icon

Your Model Already Knows: Attention-Guided Safety Filter for Vision-Language-Action Models

Add code
Jun 08, 2026
Viaarxiv icon

ProbeAct: Probe-Guided Training-Free Failure Recovery in Vision-Language-Action Models

Add code
Jun 08, 2026
Viaarxiv icon

How Transformers Learn to Plan via Multi-Token Prediction

Add code
Apr 13, 2026
Viaarxiv icon

Theoretical Perspectives on Data Quality and Synergistic Effects in Pre- and Post-Training Reasoning Models

Add code
Mar 01, 2026
Viaarxiv icon

Data Distribution as a Lever for Guiding Optimizers Toward Superior Generalization in LLMs

Add code
Jan 31, 2026
Viaarxiv icon

Beyond What Seems Necessary: Hidden Gains from Scaling Training-Time Reasoning Length under Outcome Supervision

Add code
Jan 31, 2026
Viaarxiv icon

Tuning the Implicit Regularizer of Masked Diffusion Language Models: Enhancing Generalization via Insights from $k$-Parity

Add code
Jan 30, 2026
Viaarxiv icon

Data Selection for Fine-tuning Vision Language Models via Cross Modal Alignment Trajectories

Add code
Oct 01, 2025
Figure 1 for Data Selection for Fine-tuning Vision Language Models via Cross Modal Alignment Trajectories
Figure 2 for Data Selection for Fine-tuning Vision Language Models via Cross Modal Alignment Trajectories
Figure 3 for Data Selection for Fine-tuning Vision Language Models via Cross Modal Alignment Trajectories
Figure 4 for Data Selection for Fine-tuning Vision Language Models via Cross Modal Alignment Trajectories
Viaarxiv icon

LoRA is All You Need for Safety Alignment of Reasoning LLMs

Add code
Jul 22, 2025
Figure 1 for LoRA is All You Need for Safety Alignment of Reasoning LLMs
Figure 2 for LoRA is All You Need for Safety Alignment of Reasoning LLMs
Figure 3 for LoRA is All You Need for Safety Alignment of Reasoning LLMs
Figure 4 for LoRA is All You Need for Safety Alignment of Reasoning LLMs
Viaarxiv icon