Get our free extension to see links to code for papers anywhere online!

Chrome logo Add to Chrome

Firefox logo Add to Firefox

Picture for Pedro A. Ortega

Causal Analysis of Agent Behavior for AI Safety


Mar 05, 2021
Grégoire Déletang, Jordi Grau-Moya, Miljan Martic, Tim Genewein, Tom McGrath, Vladimir Mikulik, Markus Kunesch, Shane Legg, Pedro A. Ortega

* 16 pages, 16 figures, 6 tables 

  Access Paper or Ask Questions

Algorithms for Causal Reasoning in Probability Trees


Nov 12, 2020
Tim Genewein, Tom McGrath, Grégoire Déletang, Vladimir Mikulik, Miljan Martic, Shane Legg, Pedro A. Ortega

* (2nd version with correction to algorithm) 11 pages, 8 figures, 5 algorithms. A companion Colaboratory tutorial is available at https://github.com/deepmind/deepmind-research/tree/master/causal_reasoning 

  Access Paper or Ask Questions

Meta-trained agents implement Bayes-optimal agents


Oct 21, 2020
Vladimir Mikulik, Grégoire Delétang, Tom McGrath, Tim Genewein, Miljan Martic, Shane Legg, Pedro A. Ortega

* Published at 34th Conference on Neural Information Processing Systems (NeurIPS 2020), Vancouver, Canada 

  Access Paper or Ask Questions

Action and Perception as Divergence Minimization


Oct 05, 2020
Danijar Hafner, Pedro A. Ortega, Jimmy Ba, Thomas Parr, Karl Friston, Nicolas Heess

* 14 pages, 10 figures, 2 tables 

  Access Paper or Ask Questions

Meta reinforcement learning as task inference


May 15, 2019
Jan Humplik, Alexandre Galashov, Leonard Hasenclever, Pedro A. Ortega, Yee Whye Teh, Nicolas Heess


  Access Paper or Ask Questions

Meta-learning of Sequential Strategies


May 08, 2019
Pedro A. Ortega, Jane X. Wang, Mark Rowland, Tim Genewein, Zeb Kurth-Nelson, Razvan Pascanu, Nicolas Heess, Joel Veness, Alex Pritzel, Pablo Sprechmann, Siddhant M. Jayakumar, Tom McGrath, Kevin Miller, Mohammad Azar, Ian Osband, Neil Rabinowitz, András György, Silvia Chiappa, Simon Osindero, Yee Whye Teh, Hado van Hasselt, Nando de Freitas, Matthew Botvinick, Shane Legg

* DeepMind Technical Report (15 pages, 6 figures) 

  Access Paper or Ask Questions

Understanding Agent Incentives using Causal Influence Diagrams. Part I: Single Action Settings


Mar 12, 2019
Tom Everitt, Pedro A. Ortega, Elizabeth Barnes, Shane Legg


  Access Paper or Ask Questions

Intrinsic Social Motivation via Causal Influence in Multi-Agent RL


Oct 19, 2018
Natasha Jaques, Angeliki Lazaridou, Edward Hughes, Caglar Gulcehre, Pedro A. Ortega, DJ Strouse, Joel Z. Leibo, Nando de Freitas


  Access Paper or Ask Questions

Modeling Friends and Foes


Jun 30, 2018
Pedro A. Ortega, Shane Legg

* 13 pages, 9 figures 

  Access Paper or Ask Questions

AI Safety Gridworlds


Nov 28, 2017
Jan Leike, Miljan Martic, Victoria Krakovna, Pedro A. Ortega, Tom Everitt, Andrew Lefrancq, Laurent Orseau, Shane Legg


  Access Paper or Ask Questions

Human Decision-Making under Limited Time


Oct 06, 2016
Pedro A. Ortega, Alan A. Stocker

* 9 pages, 4 figures, NIPS Advances in Neural Information Processing Systems 29, 2016 

  Access Paper or Ask Questions

Memory shapes time perception and intertemporal choices


May 29, 2016
Pedro A. Ortega, Naftali Tishby

* 24 pages, 4 figures, 2 tables. Submitted 

  Access Paper or Ask Questions

Information-Theoretic Bounded Rationality


Dec 21, 2015
Pedro A. Ortega, Daniel A. Braun, Justin Dyer, Kee-Eung Kim, Naftali Tishby

* 47 pages, 19 figures 

  Access Paper or Ask Questions

Belief Flows of Robust Online Learning


May 26, 2015
Pedro A. Ortega, Koby Crammer, Daniel D. Lee

* Appears in Workshop on Information Theory and Applications (ITA), February 2015 

  Access Paper or Ask Questions

Subjectivity, Bayesianism, and Causality


Apr 24, 2015
Pedro A. Ortega

* 21 pages, 21 figures. Submitted to Special Issue of Pattern Recognition Letters on "Philosophical aspects of pattern recognition" 

  Access Paper or Ask Questions

An Adversarial Interpretation of Information-Theoretic Bounded Rationality


Apr 22, 2014
Pedro A. Ortega, Daniel D. Lee

* 7 pages, 4 figures. Proceedings of AAAI-14 

  Access Paper or Ask Questions

Generalized Thompson Sampling for Sequential Decision-Making and Causal Inference


Mar 18, 2013
Pedro A. Ortega, Daniel A. Braun

* Complex Adaptive Systems Modeling 2014, 2:2 
* 28 pages, 5 figures 

  Access Paper or Ask Questions

A Nonparametric Conjugate Prior Distribution for the Maximizing Argument of a Noisy Function


Nov 10, 2012
Pedro A. Ortega, Jordi Grau-Moya, Tim Genewein, David Balduzzi, Daniel A. Braun

* Neural Information Processing Systems (NIPS) 2012 
* 9 pages, 5 figures 

  Access Paper or Ask Questions

Free Energy and the Generalized Optimality Equations for Sequential Decision Making


May 17, 2012
Pedro A. Ortega, Daniel A. Braun

* European Workshop on Reinforcement Learning 2012 
* 10 pages, 2 figures 

  Access Paper or Ask Questions

Bayesian Causal Induction


Nov 30, 2011
Pedro A. Ortega

* 4 pages, 4 figures; 2011 NIPS Workshop on Philosophy and Machine Learning 

  Access Paper or Ask Questions

Information, Utility & Bounded Rationality


Jul 28, 2011
Pedro A. Ortega, Daniel A. Braun

* The Fourth Conference on General Artificial Intelligence (AGI-11), 2011 
* 10 pages. The original publication is available at www.springerlink.com 

  Access Paper or Ask Questions

An axiomatic formalization of bounded rationality based on a utility-information equivalence


Jul 06, 2010
Pedro A. Ortega, Daniel A. Braun

* 22 pages, 4 figures, 1 table 

  Access Paper or Ask Questions

A Minimum Relative Entropy Principle for Learning and Acting


Apr 11, 2010
Pedro A. Ortega, Daniel A. Braun

* 36 pages, 11 figures 

  Access Paper or Ask Questions

Convergence of Bayesian Control Rule


Feb 16, 2010
Pedro A. Ortega, Daniel A. Braun

* 8 pages, 7 figures 

  Access Paper or Ask Questions

A Minimum Relative Entropy Controller for Undiscounted Markov Decision Processes


Feb 07, 2010
Pedro A. Ortega, Daniel A. Braun

* 8 pages, 3 figures, 3 tables 

  Access Paper or Ask Questions

A Bayesian Rule for Adaptive Control based on Causal Interventions


Dec 30, 2009
Pedro A. Ortega, Daniel A. Braun

* AGI-2010 
* AGI-2010. 6 pages, 2 figures 

  Access Paper or Ask Questions

A conversion between utility and information


Dec 30, 2009
Pedro A. Ortega, Daniel A. Braun

* AGI-2010 
* AGI-2010. 6 pages, 1 figure 

  Access Paper or Ask Questions