Picture for Thomas Icard

Thomas Icard

A Reply to Makelov et al. 's "Interpretability Illusion" Arguments

Add code
Jan 23, 2024
Viaarxiv icon

Comparing Causal Frameworks: Potential Outcomes, Structural Models, Graphs, and Abstractions

Add code
Jun 25, 2023
Figure 1 for Comparing Causal Frameworks: Potential Outcomes, Structural Models, Graphs, and Abstractions
Figure 2 for Comparing Causal Frameworks: Potential Outcomes, Structural Models, Graphs, and Abstractions
Viaarxiv icon

Finding Alignments Between Interpretable Causal Variables and Distributed Neural Representations

Add code
Mar 05, 2023
Figure 1 for Finding Alignments Between Interpretable Causal Variables and Distributed Neural Representations
Figure 2 for Finding Alignments Between Interpretable Causal Variables and Distributed Neural Representations
Figure 3 for Finding Alignments Between Interpretable Causal Variables and Distributed Neural Representations
Figure 4 for Finding Alignments Between Interpretable Causal Variables and Distributed Neural Representations
Viaarxiv icon

Causal Abstraction for Faithful Model Interpretation

Add code
Jan 11, 2023
Figure 1 for Causal Abstraction for Faithful Model Interpretation
Figure 2 for Causal Abstraction for Faithful Model Interpretation
Figure 3 for Causal Abstraction for Faithful Model Interpretation
Figure 4 for Causal Abstraction for Faithful Model Interpretation
Viaarxiv icon

Causal Abstraction with Soft Interventions

Add code
Nov 22, 2022
Figure 1 for Causal Abstraction with Soft Interventions
Figure 2 for Causal Abstraction with Soft Interventions
Figure 3 for Causal Abstraction with Soft Interventions
Figure 4 for Causal Abstraction with Soft Interventions
Viaarxiv icon

Holistic Evaluation of Language Models

Add code
Nov 16, 2022
Figure 1 for Holistic Evaluation of Language Models
Figure 2 for Holistic Evaluation of Language Models
Figure 3 for Holistic Evaluation of Language Models
Figure 4 for Holistic Evaluation of Language Models
Viaarxiv icon

Causal Distillation for Language Models

Add code
Dec 05, 2021
Figure 1 for Causal Distillation for Language Models
Figure 2 for Causal Distillation for Language Models
Figure 3 for Causal Distillation for Language Models
Figure 4 for Causal Distillation for Language Models
Viaarxiv icon

Inducing Causal Structure for Interpretable Neural Networks

Add code
Dec 01, 2021
Figure 1 for Inducing Causal Structure for Interpretable Neural Networks
Figure 2 for Inducing Causal Structure for Interpretable Neural Networks
Figure 3 for Inducing Causal Structure for Interpretable Neural Networks
Figure 4 for Inducing Causal Structure for Interpretable Neural Networks
Viaarxiv icon

On the Opportunities and Risks of Foundation Models

Add code
Aug 18, 2021
Figure 1 for On the Opportunities and Risks of Foundation Models
Figure 2 for On the Opportunities and Risks of Foundation Models
Figure 3 for On the Opportunities and Risks of Foundation Models
Figure 4 for On the Opportunities and Risks of Foundation Models
Viaarxiv icon

A Topological Perspective on Causal Inference

Add code
Jul 18, 2021
Figure 1 for A Topological Perspective on Causal Inference
Figure 2 for A Topological Perspective on Causal Inference
Viaarxiv icon