Get our free extension to see links to code for papers anywhere online!

Chrome logo Add to Chrome

Firefox logo Add to Firefox

Picture for Stephen McAleer

Anytime PSRO for Two-Player Zero-Sum Games


Jan 28, 2022
Stephen McAleer, Kevin Wang, John Lanier, Marc Lanctot, Pierre Baldi, Tuomas Sandholm, Roy Fox

* Published in AAAI Reinforcement Learning in Games Workshop 

  Access Paper or Ask Questions

Anytime Optimal PSRO for Two-Player Zero-Sum Games


Jan 19, 2022
Stephen McAleer, Kevin Wang, Marc Lanctot, John Lanier, Pierre Baldi, Roy Fox

* Published in AAAI Reinforcement Learning in Games Workshop 

  Access Paper or Ask Questions

Target Entropy Annealing for Discrete Soft Actor-Critic


Dec 06, 2021
Yaosheng Xu, Dailin Hu, Litian Liang, Stephen McAleer, Pieter Abbeel, Roy Fox

* neurips 2021 deep rl workshop 

  Access Paper or Ask Questions

Temporal-Difference Value Estimation via Uncertainty-Guided Soft Updates


Oct 28, 2021
Litian Liang, Yaosheng Xu, Stephen McAleer, Dailin Hu, Alexander Ihler, Pieter Abbeel, Roy Fox

* Accepted to Deep Reinforcement Learning Workshop @ NeurIPS 2021 

  Access Paper or Ask Questions

Independent Natural Policy Gradient Always Converges in Markov Potential Games


Oct 20, 2021
Roy Fox, Stephen McAleer, Will Overman, Ioannis Panageas

* 24 pages 

  Access Paper or Ask Questions

Improving Social Welfare While Preserving Autonomy via a Pareto Mediator


Jun 07, 2021
Stephen McAleer, John Lanier, Michael Dennis, Pierre Baldi, Roy Fox


  Access Paper or Ask Questions

Discovering Multi-Agent Auto-Curricula in Two-Player Zero-Sum Games


Jun 04, 2021
Xidong Feng, Oliver Slumbers, Yaodong Yang, Ziyu Wan, Bo Liu, Stephen McAleer, Ying Wen, Jun Wang

* corresponding to  

  Access Paper or Ask Questions

XDO: A Double Oracle Algorithm for Extensive-Form Games


Mar 11, 2021
Stephen McAleer, John Lanier, Pierre Baldi, Roy Fox


  Access Paper or Ask Questions

A* Search Without Expansions: Learning Heuristic Functions with Deep Q-Networks


Feb 08, 2021
Forest Agostinelli, Alexander Shmakov, Stephen McAleer, Roy Fox, Pierre Baldi


  Access Paper or Ask Questions

Deep machine learning-assisted multiphoton microscopy to reduce light exposure and expedite imaging


Nov 10, 2020
Stephen McAleer, Alex Fast, Yuntian Xue, Magdalene Seiler, William Tang, Mihaela Balu, Pierre Baldi, Andrew W. Browne


  Access Paper or Ask Questions

Pipeline PSRO: A Scalable Approach for Finding Approximate Nash Equilibria in Large Games


Jun 15, 2020
Stephen McAleer, John Lanier, Roy Fox, Pierre Baldi

* SM and JL contributed equally 

  Access Paper or Ask Questions

Curiosity-Driven Multi-Criteria Hindsight Experience Replay


Jun 09, 2019
John B. Lanier, Stephen McAleer, Pierre Baldi

* 14 pages 

  Access Paper or Ask Questions

Solving the Rubik's Cube Without Human Knowledge


May 18, 2018
Stephen McAleer, Forest Agostinelli, Alexander Shmakov, Pierre Baldi

* First three authors contributed equally. Submitted to NIPS 2018 

  Access Paper or Ask Questions