Alert button
Picture for Joshua Romoff

Joshua Romoff

Alert button

Minimax Exploiter: A Data Efficient Approach for Competitive Self-Play

Nov 28, 2023
Daniel Bairamian, Philippe Marcotte, Joshua Romoff, Gabriel Robert, Derek Nowrouzezahrai

Viaarxiv icon

Improving Intrinsic Exploration by Creating Stationary Objectives

Nov 03, 2023
Roger Creus Castanyer, Joshua Romoff, Glen Berseth

Viaarxiv icon

Learning Computational Efficient Bots with Costly Features

Aug 18, 2023
Anthony Kobanda, Valliappan C. A., Joshua Romoff, Ludovic Denoyer

Figure 1 for Learning Computational Efficient Bots with Costly Features
Figure 2 for Learning Computational Efficient Bots with Costly Features
Figure 3 for Learning Computational Efficient Bots with Costly Features
Figure 4 for Learning Computational Efficient Bots with Costly Features
Viaarxiv icon

Direct Behavior Specification via Constrained Reinforcement Learning

Jan 19, 2022
Julien Roy, Roger Girgis, Joshua Romoff, Pierre-Luc Bacon, Christopher Pal

Figure 1 for Direct Behavior Specification via Constrained Reinforcement Learning
Figure 2 for Direct Behavior Specification via Constrained Reinforcement Learning
Figure 3 for Direct Behavior Specification via Constrained Reinforcement Learning
Figure 4 for Direct Behavior Specification via Constrained Reinforcement Learning
Viaarxiv icon

Graph augmented Deep Reinforcement Learning in the GameRLand3D environment

Dec 22, 2021
Edward Beeching, Maxim Peter, Philippe Marcotte, Jilles Debangoye, Olivier Simonin, Joshua Romoff, Christian Wolf

Figure 1 for Graph augmented Deep Reinforcement Learning in the GameRLand3D environment
Figure 2 for Graph augmented Deep Reinforcement Learning in the GameRLand3D environment
Figure 3 for Graph augmented Deep Reinforcement Learning in the GameRLand3D environment
Figure 4 for Graph augmented Deep Reinforcement Learning in the GameRLand3D environment
Viaarxiv icon

Deep Reinforcement Learning for Navigation in AAA Video Games

Nov 09, 2020
Eloi Alonso, Maxim Peter, David Goumard, Joshua Romoff

Figure 1 for Deep Reinforcement Learning for Navigation in AAA Video Games
Figure 2 for Deep Reinforcement Learning for Navigation in AAA Video Games
Figure 3 for Deep Reinforcement Learning for Navigation in AAA Video Games
Figure 4 for Deep Reinforcement Learning for Navigation in AAA Video Games
Viaarxiv icon

TDprop: Does Jacobi Preconditioning Help Temporal Difference Learning?

Jul 06, 2020
Joshua Romoff, Peter Henderson, David Kanaa, Emmanuel Bengio, Ahmed Touati, Pierre-Luc Bacon, Joelle Pineau

Figure 1 for TDprop: Does Jacobi Preconditioning Help Temporal Difference Learning?
Figure 2 for TDprop: Does Jacobi Preconditioning Help Temporal Difference Learning?
Figure 3 for TDprop: Does Jacobi Preconditioning Help Temporal Difference Learning?
Figure 4 for TDprop: Does Jacobi Preconditioning Help Temporal Difference Learning?
Viaarxiv icon

Towards the Systematic Reporting of the Energy and Carbon Footprints of Machine Learning

Jan 31, 2020
Peter Henderson, Jieru Hu, Joshua Romoff, Emma Brunskill, Dan Jurafsky, Joelle Pineau

Figure 1 for Towards the Systematic Reporting of the Energy and Carbon Footprints of Machine Learning
Figure 2 for Towards the Systematic Reporting of the Energy and Carbon Footprints of Machine Learning
Figure 3 for Towards the Systematic Reporting of the Energy and Carbon Footprints of Machine Learning
Figure 4 for Towards the Systematic Reporting of the Energy and Carbon Footprints of Machine Learning
Viaarxiv icon