Picture for Shahar Mendel

Shahar Mendel

A Mechanistic Account of Attention Sinks in GPT-2: One Circuit, Broader Implications for Mitigation

Add code
Apr 16, 2026
Viaarxiv icon

Outcome-Based RL Provably Leads Transformers to Reason, but Only With the Right Data

Add code
Jan 21, 2026
Viaarxiv icon