Picture for Alessandro Stolfo

Alessandro Stolfo

Probing for Arithmetic Errors in Language Models

Add code
Jul 16, 2025
Viaarxiv icon

Dense SAE Latents Are Features, Not Bugs

Add code
Jun 18, 2025
Viaarxiv icon

Transferring Features Across Language Models With Model Stitching

Add code
Jun 07, 2025
Viaarxiv icon

MIB: A Mechanistic Interpretability Benchmark

Add code
Apr 17, 2025
Viaarxiv icon

Confidence Regulation Neurons in Language Models

Add code
Jun 24, 2024
Viaarxiv icon

Groundedness in Retrieval-augmented Long-form Generation: An Empirical Study

Add code
Apr 10, 2024
Figure 1 for Groundedness in Retrieval-augmented Long-form Generation: An Empirical Study
Figure 2 for Groundedness in Retrieval-augmented Long-form Generation: An Empirical Study
Figure 3 for Groundedness in Retrieval-augmented Long-form Generation: An Empirical Study
Figure 4 for Groundedness in Retrieval-augmented Long-form Generation: An Empirical Study
Viaarxiv icon

Do Language Models Exhibit the Same Cognitive Biases in Problem Solving as Human Learners?

Add code
Jan 31, 2024
Viaarxiv icon

Towards a Mechanistic Interpretation of Multi-Step Reasoning Capabilities of Language Models

Add code
Oct 23, 2023
Figure 1 for Towards a Mechanistic Interpretation of Multi-Step Reasoning Capabilities of Language Models
Figure 2 for Towards a Mechanistic Interpretation of Multi-Step Reasoning Capabilities of Language Models
Figure 3 for Towards a Mechanistic Interpretation of Multi-Step Reasoning Capabilities of Language Models
Figure 4 for Towards a Mechanistic Interpretation of Multi-Step Reasoning Capabilities of Language Models
Viaarxiv icon

Understanding Arithmetic Reasoning in Language Models using Causal Mediation Analysis

Add code
May 24, 2023
Figure 1 for Understanding Arithmetic Reasoning in Language Models using Causal Mediation Analysis
Figure 2 for Understanding Arithmetic Reasoning in Language Models using Causal Mediation Analysis
Figure 3 for Understanding Arithmetic Reasoning in Language Models using Causal Mediation Analysis
Figure 4 for Understanding Arithmetic Reasoning in Language Models using Causal Mediation Analysis
Viaarxiv icon

Distilling Multi-Step Reasoning Capabilities of Large Language Models into Smaller Models via Semantic Decompositions

Add code
Dec 01, 2022
Viaarxiv icon