Picture for Roger Grosse

Roger Grosse

Measuring Stochastic Data Complexity with Boltzmann Influence Functions

Add code
Jun 04, 2024
Viaarxiv icon

What is Your Data Worth to GPT? LLM-Scale Data Valuation with Influence Functions

Add code
May 22, 2024
Viaarxiv icon

Training Data Attribution via Approximate Unrolled Differentiation

Add code
May 21, 2024
Viaarxiv icon

Probabilistic Inference in Language Models via Twisted Sequential Monte Carlo

Add code
Apr 26, 2024
Viaarxiv icon

REFACTOR: Learning to Extract Theorems from Proofs

Add code
Feb 26, 2024
Viaarxiv icon

Sleeper Agents: Training Deceptive LLMs that Persist Through Safety Training

Add code
Jan 17, 2024
Viaarxiv icon

Studying Large Language Model Generalization with Influence Functions

Add code
Aug 07, 2023
Viaarxiv icon

Improving Mutual Information Estimation with Annealed and Energy-Based Bounds

Add code
Mar 13, 2023
Viaarxiv icon

Efficient Parametric Approximations of Neural Network Function Space Distance

Add code
Feb 07, 2023
Viaarxiv icon

On Implicit Bias in Overparameterized Bilevel Optimization

Add code
Dec 28, 2022
Viaarxiv icon