Picture for Bruno Puri

Bruno Puri

Atlas-Alignment: Making Interpretability Transferable Across Language Models

Add code
Oct 31, 2025
Viaarxiv icon

FADE: Why Bad Descriptions Happen to Good Features

Add code
Feb 24, 2025
Viaarxiv icon

A Close Look at Decomposition-based XAI-Methods for Transformer Language Models

Add code
Feb 21, 2025
Viaarxiv icon