Picture for Alireza Bayat Makou

Alireza Bayat Makou

Many Circuits, One Mechanism: Input Variation and Evaluation Granularity in Circuit Discovery

Add code
Jun 04, 2026
Viaarxiv icon

LLM-based Rewriting of Inappropriate Argumentation using Reinforcement Learning from Machine Feedback

Add code
Jun 05, 2024
Viaarxiv icon