Picture for Soheil Feizi

Soheil Feizi

Adversarial Paraphrasing: A Universal Attack for Humanizing AI-Generated Text

Add code
Jun 08, 2025
Viaarxiv icon

DyePack: Provably Flagging Test Set Contamination in LLMs Using Backdoors

Add code
May 29, 2025
Viaarxiv icon

A Closer Look at Bias and Chain-of-Thought Faithfulness of Large (Vision) Language Models

Add code
May 29, 2025
Viaarxiv icon

Localizing Knowledge in Diffusion Transformers

Add code
May 24, 2025
Viaarxiv icon

Gaming Tool Preferences in Agentic LLMs

Add code
May 23, 2025
Viaarxiv icon

Chain-of-Defensive-Thought: Structured Reasoning Elicits Robustness in Large Language Models against Reference Corruption

Add code
Apr 29, 2025
Viaarxiv icon

How Learnable Grids Recover Fine Detail in Low Dimensions: A Neural Tangent Kernel Analysis of Multigrid Parametric Encodings

Add code
Apr 18, 2025
Viaarxiv icon

RePanda: Pandas-powered Tabular Verification and Reasoning

Add code
Mar 14, 2025
Viaarxiv icon

Seeing What's Not There: Spurious Correlation in Multimodal LLMs

Add code
Mar 11, 2025
Viaarxiv icon

A Survey on Mechanistic Interpretability for Multi-Modal Foundation Models

Add code
Feb 22, 2025
Figure 1 for A Survey on Mechanistic Interpretability for Multi-Modal Foundation Models
Figure 2 for A Survey on Mechanistic Interpretability for Multi-Modal Foundation Models
Figure 3 for A Survey on Mechanistic Interpretability for Multi-Modal Foundation Models
Figure 4 for A Survey on Mechanistic Interpretability for Multi-Modal Foundation Models
Viaarxiv icon