Picture for Josiah P. Hanna

Josiah P. Hanna

Abstract Sim2Real through Approximate Information States

Add code
Apr 16, 2026
Viaarxiv icon

Demystifying the Paradox of Importance Sampling with an Estimated History-Dependent Behavior Policy in Off-Policy Evaluation

Add code
May 28, 2025
Viaarxiv icon

Multi-Robot Collaboration through Reinforcement Learning and Abstract Simulation

Add code
Mar 07, 2025
Figure 1 for Multi-Robot Collaboration through Reinforcement Learning and Abstract Simulation
Figure 2 for Multi-Robot Collaboration through Reinforcement Learning and Abstract Simulation
Figure 3 for Multi-Robot Collaboration through Reinforcement Learning and Abstract Simulation
Figure 4 for Multi-Robot Collaboration through Reinforcement Learning and Abstract Simulation
Viaarxiv icon

Reinforcement Learning Within the Classical Robotics Stack: A Case Study in Robot Soccer

Add code
Dec 12, 2024
Figure 1 for Reinforcement Learning Within the Classical Robotics Stack: A Case Study in Robot Soccer
Figure 2 for Reinforcement Learning Within the Classical Robotics Stack: A Case Study in Robot Soccer
Figure 3 for Reinforcement Learning Within the Classical Robotics Stack: A Case Study in Robot Soccer
Figure 4 for Reinforcement Learning Within the Classical Robotics Stack: A Case Study in Robot Soccer
Viaarxiv icon

Stable Offline Value Function Learning with Bisimulation-based Representations

Add code
Oct 02, 2024
Figure 1 for Stable Offline Value Function Learning with Bisimulation-based Representations
Figure 2 for Stable Offline Value Function Learning with Bisimulation-based Representations
Figure 3 for Stable Offline Value Function Learning with Bisimulation-based Representations
Figure 4 for Stable Offline Value Function Learning with Bisimulation-based Representations
Viaarxiv icon

Reinforcement Learning via Auxiliary Task Distillation

Add code
Jun 24, 2024
Figure 1 for Reinforcement Learning via Auxiliary Task Distillation
Figure 2 for Reinforcement Learning via Auxiliary Task Distillation
Figure 3 for Reinforcement Learning via Auxiliary Task Distillation
Figure 4 for Reinforcement Learning via Auxiliary Task Distillation
Viaarxiv icon

Pretraining Decision Transformers with Reward Prediction for In-Context Multi-task Structured Bandit Learning

Add code
Jun 07, 2024
Figure 1 for Pretraining Decision Transformers with Reward Prediction for In-Context Multi-task Structured Bandit Learning
Figure 2 for Pretraining Decision Transformers with Reward Prediction for In-Context Multi-task Structured Bandit Learning
Figure 3 for Pretraining Decision Transformers with Reward Prediction for In-Context Multi-task Structured Bandit Learning
Figure 4 for Pretraining Decision Transformers with Reward Prediction for In-Context Multi-task Structured Bandit Learning
Viaarxiv icon

SaVeR: Optimal Data Collection Strategy for Safe Policy Evaluation in Tabular MDP

Add code
Jun 04, 2024
Figure 1 for SaVeR: Optimal Data Collection Strategy for Safe Policy Evaluation in Tabular MDP
Figure 2 for SaVeR: Optimal Data Collection Strategy for Safe Policy Evaluation in Tabular MDP
Figure 3 for SaVeR: Optimal Data Collection Strategy for Safe Policy Evaluation in Tabular MDP
Figure 4 for SaVeR: Optimal Data Collection Strategy for Safe Policy Evaluation in Tabular MDP
Viaarxiv icon

Adaptive Exploration for Data-Efficient General Value Function Evaluations

Add code
May 13, 2024
Viaarxiv icon

On-Policy Policy Gradient Reinforcement Learning Without On-Policy Sampling

Add code
Nov 14, 2023
Viaarxiv icon