Picture for Geonhee Kim

Geonhee Kim

Mitigating Content Effects on Reasoning in Language Models through Fine-Grained Activation Steering

Add code
May 18, 2025
Viaarxiv icon

A Mechanistic Interpretation of Syllogistic Reasoning in Auto-Regressive Language Models

Add code
Aug 16, 2024
Viaarxiv icon