Picture for Wilson Wu

Wilson Wu

Bayesian Influence Functions for Hessian-Free Data Attribution

Add code
Sep 30, 2025
Figure 1 for Bayesian Influence Functions for Hessian-Free Data Attribution
Figure 2 for Bayesian Influence Functions for Hessian-Free Data Attribution
Figure 3 for Bayesian Influence Functions for Hessian-Free Data Attribution
Figure 4 for Bayesian Influence Functions for Hessian-Free Data Attribution
Viaarxiv icon

Unifying and Verifying Mechanistic Interpretations: A Case Study with Group Operations

Add code
Oct 09, 2024
Figure 1 for Unifying and Verifying Mechanistic Interpretations: A Case Study with Group Operations
Figure 2 for Unifying and Verifying Mechanistic Interpretations: A Case Study with Group Operations
Figure 3 for Unifying and Verifying Mechanistic Interpretations: A Case Study with Group Operations
Figure 4 for Unifying and Verifying Mechanistic Interpretations: A Case Study with Group Operations
Viaarxiv icon

Do language models plan ahead for future tokens?

Add code
Apr 01, 2024
Figure 1 for Do language models plan ahead for future tokens?
Figure 2 for Do language models plan ahead for future tokens?
Figure 3 for Do language models plan ahead for future tokens?
Figure 4 for Do language models plan ahead for future tokens?
Viaarxiv icon

Learning Deterministic Finite Automata from Confidence Oracles

Add code
Nov 18, 2023
Viaarxiv icon

Generating Semantic Adversarial Examples with Differentiable Rendering

Add code
Oct 02, 2019
Figure 1 for Generating Semantic Adversarial Examples with Differentiable Rendering
Figure 2 for Generating Semantic Adversarial Examples with Differentiable Rendering
Figure 3 for Generating Semantic Adversarial Examples with Differentiable Rendering
Figure 4 for Generating Semantic Adversarial Examples with Differentiable Rendering
Viaarxiv icon