Picture for Kevin Du

Kevin Du

Efficiently Computing Susceptibility to Context in Language Models

Add code
Oct 18, 2024
Viaarxiv icon

Activation Scaling for Steering and Interpreting Language Models

Add code
Oct 07, 2024
Viaarxiv icon

Context versus Prior Knowledge in Language Models

Add code
Apr 06, 2024
Viaarxiv icon

Grammatical Gender's Influence on Distributional Semantics: A Causal Perspective

Add code
Nov 30, 2023
Viaarxiv icon

Generalizing Backpropagation for Gradient-Based Interpretability

Add code
Jul 06, 2023
Viaarxiv icon

AlphaSnake: Policy Iteration on a Nondeterministic NP-hard Markov Decision Process

Add code
Nov 17, 2022
Viaarxiv icon