Picture for Abir Harrasse

Abir Harrasse

CLT-Forge: A Scalable Library for Cross-Layer Transcoders and Attribution Graphs

Add code
Mar 22, 2026
Viaarxiv icon

Curveball Steering: The Right Direction To Steer Isn't Always Linear

Add code
Mar 11, 2026
Viaarxiv icon

Tracing Multilingual Representations in LLMs with Cross-Layer Transcoders

Add code
Nov 13, 2025
Figure 1 for Tracing Multilingual Representations in LLMs with Cross-Layer Transcoders
Figure 2 for Tracing Multilingual Representations in LLMs with Cross-Layer Transcoders
Figure 3 for Tracing Multilingual Representations in LLMs with Cross-Layer Transcoders
Figure 4 for Tracing Multilingual Representations in LLMs with Cross-Layer Transcoders
Viaarxiv icon

TinySQL: A Progressive Text-to-SQL Dataset for Mechanistic Interpretability Research

Add code
Mar 17, 2025
Viaarxiv icon

Activation Space Interventions Can Be Transferred Between Large Language Models

Add code
Mar 06, 2025
Viaarxiv icon

Adversarial Multi-Agent Evaluation of Large Language Models through Iterative Debates

Add code
Oct 07, 2024
Figure 1 for Adversarial Multi-Agent Evaluation of Large Language Models through Iterative Debates
Figure 2 for Adversarial Multi-Agent Evaluation of Large Language Models through Iterative Debates
Figure 3 for Adversarial Multi-Agent Evaluation of Large Language Models through Iterative Debates
Viaarxiv icon