Picture for Joshua Romoff

Joshua Romoff

Minimax Exploiter: A Data Efficient Approach for Competitive Self-Play

Add code
Nov 28, 2023
Figure 1 for Minimax Exploiter: A Data Efficient Approach for Competitive Self-Play
Figure 2 for Minimax Exploiter: A Data Efficient Approach for Competitive Self-Play
Figure 3 for Minimax Exploiter: A Data Efficient Approach for Competitive Self-Play
Figure 4 for Minimax Exploiter: A Data Efficient Approach for Competitive Self-Play
Viaarxiv icon

Improving Intrinsic Exploration by Creating Stationary Objectives

Add code
Nov 03, 2023
Viaarxiv icon

Learning Computational Efficient Bots with Costly Features

Add code
Aug 18, 2023
Figure 1 for Learning Computational Efficient Bots with Costly Features
Figure 2 for Learning Computational Efficient Bots with Costly Features
Figure 3 for Learning Computational Efficient Bots with Costly Features
Figure 4 for Learning Computational Efficient Bots with Costly Features
Viaarxiv icon

Direct Behavior Specification via Constrained Reinforcement Learning

Add code
Jan 19, 2022
Figure 1 for Direct Behavior Specification via Constrained Reinforcement Learning
Figure 2 for Direct Behavior Specification via Constrained Reinforcement Learning
Figure 3 for Direct Behavior Specification via Constrained Reinforcement Learning
Figure 4 for Direct Behavior Specification via Constrained Reinforcement Learning
Viaarxiv icon

Graph augmented Deep Reinforcement Learning in the GameRLand3D environment

Add code
Dec 22, 2021
Figure 1 for Graph augmented Deep Reinforcement Learning in the GameRLand3D environment
Figure 2 for Graph augmented Deep Reinforcement Learning in the GameRLand3D environment
Figure 3 for Graph augmented Deep Reinforcement Learning in the GameRLand3D environment
Figure 4 for Graph augmented Deep Reinforcement Learning in the GameRLand3D environment
Viaarxiv icon

Deep Reinforcement Learning for Navigation in AAA Video Games

Add code
Nov 09, 2020
Figure 1 for Deep Reinforcement Learning for Navigation in AAA Video Games
Figure 2 for Deep Reinforcement Learning for Navigation in AAA Video Games
Figure 3 for Deep Reinforcement Learning for Navigation in AAA Video Games
Figure 4 for Deep Reinforcement Learning for Navigation in AAA Video Games
Viaarxiv icon

TDprop: Does Jacobi Preconditioning Help Temporal Difference Learning?

Add code
Jul 06, 2020
Figure 1 for TDprop: Does Jacobi Preconditioning Help Temporal Difference Learning?
Figure 2 for TDprop: Does Jacobi Preconditioning Help Temporal Difference Learning?
Figure 3 for TDprop: Does Jacobi Preconditioning Help Temporal Difference Learning?
Figure 4 for TDprop: Does Jacobi Preconditioning Help Temporal Difference Learning?
Viaarxiv icon

Towards the Systematic Reporting of the Energy and Carbon Footprints of Machine Learning

Add code
Jan 31, 2020
Figure 1 for Towards the Systematic Reporting of the Energy and Carbon Footprints of Machine Learning
Figure 2 for Towards the Systematic Reporting of the Energy and Carbon Footprints of Machine Learning
Figure 3 for Towards the Systematic Reporting of the Energy and Carbon Footprints of Machine Learning
Figure 4 for Towards the Systematic Reporting of the Energy and Carbon Footprints of Machine Learning
Viaarxiv icon

Gossip-based Actor-Learner Architectures for Deep Reinforcement Learning

Add code
Jun 09, 2019
Figure 1 for Gossip-based Actor-Learner Architectures for Deep Reinforcement Learning
Figure 2 for Gossip-based Actor-Learner Architectures for Deep Reinforcement Learning
Figure 3 for Gossip-based Actor-Learner Architectures for Deep Reinforcement Learning
Figure 4 for Gossip-based Actor-Learner Architectures for Deep Reinforcement Learning
Viaarxiv icon

Separating value functions across time-scales

Add code
Feb 08, 2019
Figure 1 for Separating value functions across time-scales
Figure 2 for Separating value functions across time-scales
Figure 3 for Separating value functions across time-scales
Figure 4 for Separating value functions across time-scales
Viaarxiv icon