Picture for Aaron David Tucker

Aaron David Tucker

Interpreting learned search: finding a transition model and value function in an RNN that plays Sokoban

Add code
Jun 11, 2025
Figure 1 for Interpreting learned search: finding a transition model and value function in an RNN that plays Sokoban
Figure 2 for Interpreting learned search: finding a transition model and value function in an RNN that plays Sokoban
Figure 3 for Interpreting learned search: finding a transition model and value function in an RNN that plays Sokoban
Figure 4 for Interpreting learned search: finding a transition model and value function in an RNN that plays Sokoban
Viaarxiv icon

Variance-Optimal Augmentation Logging for Counterfactual Evaluation in Contextual Bandits

Add code
Feb 03, 2022
Viaarxiv icon