Get our free extension to see links to code for papers anywhere online!

 Add to Chrome

 Add to Firefox

CatalyzeX Code Finder - Browser extension linking code for ML papers across the web! | Product Hunt Embed
Algorithms for Causal Reasoning in Probability Trees

Nov 12, 2020
Tim Genewein, Tom McGrath, Grégoire Déletang, Vladimir Mikulik, Miljan Martic, Shane Legg, Pedro A. Ortega

* (2nd version with correction to algorithm) 11 pages, 8 figures, 5 algorithms. A companion Colaboratory tutorial is available at https://github.com/deepmind/deepmind-research/tree/master/causal_reasoning 

  Access Paper or Ask Questions

Meta-trained agents implement Bayes-optimal agents

Oct 21, 2020
Vladimir Mikulik, Grégoire Delétang, Tom McGrath, Tim Genewein, Miljan Martic, Shane Legg, Pedro A. Ortega

* Published at 34th Conference on Neural Information Processing Systems (NeurIPS 2020), Vancouver, Canada 

  Access Paper or Ask Questions

Action and Perception as Divergence Minimization

Oct 05, 2020
Danijar Hafner, Pedro A. Ortega, Jimmy Ba, Thomas Parr, Karl Friston, Nicolas Heess

* 14 pages, 10 figures, 2 tables 

  Access Paper or Ask Questions

Meta reinforcement learning as task inference

May 15, 2019
Jan Humplik, Alexandre Galashov, Leonard Hasenclever, Pedro A. Ortega, Yee Whye Teh, Nicolas Heess


  Access Paper or Ask Questions

Meta-learning of Sequential Strategies

May 08, 2019
Pedro A. Ortega, Jane X. Wang, Mark Rowland, Tim Genewein, Zeb Kurth-Nelson, Razvan Pascanu, Nicolas Heess, Joel Veness, Alex Pritzel, Pablo Sprechmann, Siddhant M. Jayakumar, Tom McGrath, Kevin Miller, Mohammad Azar, Ian Osband, Neil Rabinowitz, András György, Silvia Chiappa, Simon Osindero, Yee Whye Teh, Hado van Hasselt, Nando de Freitas, Matthew Botvinick, Shane Legg

* DeepMind Technical Report (15 pages, 6 figures) 

  Access Paper or Ask Questions

Understanding Agent Incentives using Causal Influence Diagrams. Part I: Single Action Settings

Mar 12, 2019
Tom Everitt, Pedro A. Ortega, Elizabeth Barnes, Shane Legg


  Access Paper or Ask Questions

Intrinsic Social Motivation via Causal Influence in Multi-Agent RL

Oct 19, 2018
Natasha Jaques, Angeliki Lazaridou, Edward Hughes, Caglar Gulcehre, Pedro A. Ortega, DJ Strouse, Joel Z. Leibo, Nando de Freitas


  Access Paper or Ask Questions

Modeling Friends and Foes

Jun 30, 2018
Pedro A. Ortega, Shane Legg

* 13 pages, 9 figures 

  Access Paper or Ask Questions

AI Safety Gridworlds

Nov 28, 2017
Jan Leike, Miljan Martic, Victoria Krakovna, Pedro A. Ortega, Tom Everitt, Andrew Lefrancq, Laurent Orseau, Shane Legg


  Access Paper or Ask Questions

Human Decision-Making under Limited Time

Oct 06, 2016
Pedro A. Ortega, Alan A. Stocker

* 9 pages, 4 figures, NIPS Advances in Neural Information Processing Systems 29, 2016 

  Access Paper or Ask Questions

Memory shapes time perception and intertemporal choices

May 29, 2016
Pedro A. Ortega, Naftali Tishby

* 24 pages, 4 figures, 2 tables. Submitted 

  Access Paper or Ask Questions

Information-Theoretic Bounded Rationality

Dec 21, 2015
Pedro A. Ortega, Daniel A. Braun, Justin Dyer, Kee-Eung Kim, Naftali Tishby

* 47 pages, 19 figures 

  Access Paper or Ask Questions

Belief Flows of Robust Online Learning

May 26, 2015
Pedro A. Ortega, Koby Crammer, Daniel D. Lee

* Appears in Workshop on Information Theory and Applications (ITA), February 2015 

  Access Paper or Ask Questions

Subjectivity, Bayesianism, and Causality

Apr 24, 2015
Pedro A. Ortega

* 21 pages, 21 figures. Submitted to Special Issue of Pattern Recognition Letters on "Philosophical aspects of pattern recognition" 

  Access Paper or Ask Questions

An Adversarial Interpretation of Information-Theoretic Bounded Rationality

Apr 22, 2014
Pedro A. Ortega, Daniel D. Lee

* 7 pages, 4 figures. Proceedings of AAAI-14 

  Access Paper or Ask Questions

Generalized Thompson Sampling for Sequential Decision-Making and Causal Inference

Mar 18, 2013
Pedro A. Ortega, Daniel A. Braun

* Complex Adaptive Systems Modeling 2014, 2:2 
* 28 pages, 5 figures 

  Access Paper or Ask Questions

A Nonparametric Conjugate Prior Distribution for the Maximizing Argument of a Noisy Function

Nov 10, 2012
Pedro A. Ortega, Jordi Grau-Moya, Tim Genewein, David Balduzzi, Daniel A. Braun

* Neural Information Processing Systems (NIPS) 2012 
* 9 pages, 5 figures 

  Access Paper or Ask Questions

Free Energy and the Generalized Optimality Equations for Sequential Decision Making

May 17, 2012
Pedro A. Ortega, Daniel A. Braun

* European Workshop on Reinforcement Learning 2012 
* 10 pages, 2 figures 

  Access Paper or Ask Questions

Bayesian Causal Induction

Nov 30, 2011
Pedro A. Ortega

* 4 pages, 4 figures; 2011 NIPS Workshop on Philosophy and Machine Learning 

  Access Paper or Ask Questions

Information, Utility & Bounded Rationality

Jul 28, 2011
Pedro A. Ortega, Daniel A. Braun

* The Fourth Conference on General Artificial Intelligence (AGI-11), 2011 
* 10 pages. The original publication is available at www.springerlink.com 

  Access Paper or Ask Questions

An axiomatic formalization of bounded rationality based on a utility-information equivalence

Jul 06, 2010
Pedro A. Ortega, Daniel A. Braun

* 22 pages, 4 figures, 1 table 

  Access Paper or Ask Questions

A Minimum Relative Entropy Principle for Learning and Acting

Apr 11, 2010
Pedro A. Ortega, Daniel A. Braun

* 36 pages, 11 figures 

  Access Paper or Ask Questions

Convergence of Bayesian Control Rule

Feb 16, 2010
Pedro A. Ortega, Daniel A. Braun

* 8 pages, 7 figures 

  Access Paper or Ask Questions

A Minimum Relative Entropy Controller for Undiscounted Markov Decision Processes

Feb 07, 2010
Pedro A. Ortega, Daniel A. Braun

* 8 pages, 3 figures, 3 tables 

  Access Paper or Ask Questions

A Bayesian Rule for Adaptive Control based on Causal Interventions

Dec 30, 2009
Pedro A. Ortega, Daniel A. Braun

* AGI-2010 
* AGI-2010. 6 pages, 2 figures 

  Access Paper or Ask Questions

A conversion between utility and information

Dec 30, 2009
Pedro A. Ortega, Daniel A. Braun

* AGI-2010 
* AGI-2010. 6 pages, 1 figure 

  Access Paper or Ask Questions