Picture for Aaron David Tucker

Aaron David Tucker

Interpreting learned search: finding a transition model and value function in an RNN that plays Sokoban

Add code
Jun 11, 2025
Viaarxiv icon

Variance-Optimal Augmentation Logging for Counterfactual Evaluation in Contextual Bandits

Add code
Feb 03, 2022
Viaarxiv icon