Picture for Louis Jaburi

Louis Jaburi

Evaluating Explanations: An Explanatory Virtues Framework for Mechanistic Interpretability -- The Strange Science Part I.ii

Add code
May 02, 2025
Viaarxiv icon

A Mathematical Philosophy of Explanations in Mechanistic Interpretability -- The Strange Science Part I.i

Add code
May 01, 2025
Viaarxiv icon

Unifying and Verifying Mechanistic Interpretations: A Case Study with Group Operations

Add code
Oct 09, 2024
Figure 1 for Unifying and Verifying Mechanistic Interpretations: A Case Study with Group Operations
Figure 2 for Unifying and Verifying Mechanistic Interpretations: A Case Study with Group Operations
Figure 3 for Unifying and Verifying Mechanistic Interpretations: A Case Study with Group Operations
Figure 4 for Unifying and Verifying Mechanistic Interpretations: A Case Study with Group Operations
Viaarxiv icon