Alert button
Picture for Josiah Hanna

Josiah Hanna

Alert button

Future Prediction Can be a Strong Evidence of Good History Representation in Partially Observable Environments

Add code
Bookmark button
Alert button
Feb 11, 2024
Jeongyeol Kwon, Liu Yang, Robert Nowak, Josiah Hanna

Viaarxiv icon

SPEED: Experimental Design for Policy Evaluation in Linear Heteroscedastic Bandits

Add code
Bookmark button
Alert button
Jan 29, 2023
Subhojyoti Mukherjee, Qiaomin Xie, Josiah Hanna, Robert Nowak

Figure 1 for SPEED: Experimental Design for Policy Evaluation in Linear Heteroscedastic Bandits
Figure 2 for SPEED: Experimental Design for Policy Evaluation in Linear Heteroscedastic Bandits
Figure 3 for SPEED: Experimental Design for Policy Evaluation in Linear Heteroscedastic Bandits
Viaarxiv icon

Temporal Disentanglement of Representations for Improved Generalisation in Reinforcement Learning

Add code
Bookmark button
Alert button
Jul 12, 2022
Mhairi Dunion, Trevor McInroe, Kevin Luck, Josiah Hanna, Stefano V. Albrecht

Figure 1 for Temporal Disentanglement of Representations for Improved Generalisation in Reinforcement Learning
Figure 2 for Temporal Disentanglement of Representations for Improved Generalisation in Reinforcement Learning
Figure 3 for Temporal Disentanglement of Representations for Improved Generalisation in Reinforcement Learning
Figure 4 for Temporal Disentanglement of Representations for Improved Generalisation in Reinforcement Learning
Viaarxiv icon

Multi-agent Databases via Independent Learning

Add code
Bookmark button
Alert button
May 28, 2022
Chi Zhang, Olga Papaemmanouil, Josiah Hanna

Figure 1 for Multi-agent Databases via Independent Learning
Figure 2 for Multi-agent Databases via Independent Learning
Figure 3 for Multi-agent Databases via Independent Learning
Viaarxiv icon

Decoupling Exploration and Exploitation in Reinforcement Learning

Add code
Bookmark button
Alert button
Jul 22, 2021
Lukas Schäfer, Filippos Christianos, Josiah Hanna, Stefano V. Albrecht

Figure 1 for Decoupling Exploration and Exploitation in Reinforcement Learning
Figure 2 for Decoupling Exploration and Exploitation in Reinforcement Learning
Figure 3 for Decoupling Exploration and Exploitation in Reinforcement Learning
Figure 4 for Decoupling Exploration and Exploitation in Reinforcement Learning
Viaarxiv icon

Reducing Sampling Error in Batch Temporal Difference Learning

Add code
Bookmark button
Alert button
Aug 15, 2020
Brahma Pavse, Ishan Durugkar, Josiah Hanna, Peter Stone

Viaarxiv icon

An Imitation from Observation Approach to Sim-to-Real Transfer

Add code
Bookmark button
Alert button
Aug 04, 2020
Siddarth Desai, Ishan Durugkar, Haresh Karnan, Garrett Warnell, Josiah Hanna, Peter Stone

Figure 1 for An Imitation from Observation Approach to Sim-to-Real Transfer
Figure 2 for An Imitation from Observation Approach to Sim-to-Real Transfer
Figure 3 for An Imitation from Observation Approach to Sim-to-Real Transfer
Figure 4 for An Imitation from Observation Approach to Sim-to-Real Transfer
Viaarxiv icon

Learning an Interpretable Traffic Signal Control Policy

Add code
Bookmark button
Alert button
Dec 23, 2019
James Ault, Josiah Hanna, Guni Sharon

Figure 1 for Learning an Interpretable Traffic Signal Control Policy
Figure 2 for Learning an Interpretable Traffic Signal Control Policy
Figure 3 for Learning an Interpretable Traffic Signal Control Policy
Figure 4 for Learning an Interpretable Traffic Signal Control Policy
Viaarxiv icon

Importance Sampling Policy Evaluation with an Estimated Behavior Policy

Add code
Bookmark button
Alert button
Sep 24, 2018
Josiah Hanna, Scott Niekum, Peter Stone

Figure 1 for Importance Sampling Policy Evaluation with an Estimated Behavior Policy
Figure 2 for Importance Sampling Policy Evaluation with an Estimated Behavior Policy
Figure 3 for Importance Sampling Policy Evaluation with an Estimated Behavior Policy
Viaarxiv icon