Picture for Soheil Feizi

Soheil Feizi

Localizing Knowledge in Diffusion Transformers

Add code
May 24, 2025
Viaarxiv icon

Gaming Tool Preferences in Agentic LLMs

Add code
May 23, 2025
Viaarxiv icon

Chain-of-Defensive-Thought: Structured Reasoning Elicits Robustness in Large Language Models against Reference Corruption

Add code
Apr 29, 2025
Viaarxiv icon

How Learnable Grids Recover Fine Detail in Low Dimensions: A Neural Tangent Kernel Analysis of Multigrid Parametric Encodings

Add code
Apr 18, 2025
Viaarxiv icon

RePanda: Pandas-powered Tabular Verification and Reasoning

Add code
Mar 14, 2025
Viaarxiv icon

Seeing What's Not There: Spurious Correlation in Multimodal LLMs

Add code
Mar 11, 2025
Viaarxiv icon

A Survey on Mechanistic Interpretability for Multi-Modal Foundation Models

Add code
Feb 22, 2025
Figure 1 for A Survey on Mechanistic Interpretability for Multi-Modal Foundation Models
Figure 2 for A Survey on Mechanistic Interpretability for Multi-Modal Foundation Models
Figure 3 for A Survey on Mechanistic Interpretability for Multi-Modal Foundation Models
Figure 4 for A Survey on Mechanistic Interpretability for Multi-Modal Foundation Models
Viaarxiv icon

Almost AI, Almost Human: The Challenge of Detecting AI-Polished Writing

Add code
Feb 21, 2025
Viaarxiv icon

On Mechanistic Circuits for Extractive Question-Answering

Add code
Feb 12, 2025
Figure 1 for On Mechanistic Circuits for Extractive Question-Answering
Figure 2 for On Mechanistic Circuits for Extractive Question-Answering
Figure 3 for On Mechanistic Circuits for Extractive Question-Answering
Figure 4 for On Mechanistic Circuits for Extractive Question-Answering
Viaarxiv icon

RESTOR: Knowledge Recovery through Machine Unlearning

Add code
Oct 31, 2024
Figure 1 for RESTOR: Knowledge Recovery through Machine Unlearning
Figure 2 for RESTOR: Knowledge Recovery through Machine Unlearning
Figure 3 for RESTOR: Knowledge Recovery through Machine Unlearning
Figure 4 for RESTOR: Knowledge Recovery through Machine Unlearning
Viaarxiv icon