Alert button
Picture for Pedro A. Ortega

Pedro A. Ortega

Alert button

Neural Networks and the Chomsky Hierarchy

Add code
Bookmark button
Alert button
Jul 05, 2022
Grégoire Delétang, Anian Ruoss, Jordi Grau-Moya, Tim Genewein, Li Kevin Wenliang, Elliot Catt, Marcus Hutter, Shane Legg, Pedro A. Ortega

Figure 1 for Neural Networks and the Chomsky Hierarchy
Figure 2 for Neural Networks and the Chomsky Hierarchy
Figure 3 for Neural Networks and the Chomsky Hierarchy
Figure 4 for Neural Networks and the Chomsky Hierarchy
Viaarxiv icon

Model-Free Risk-Sensitive Reinforcement Learning

Add code
Bookmark button
Alert button
Nov 04, 2021
Grégoire Delétang, Jordi Grau-Moya, Markus Kunesch, Tim Genewein, Rob Brekelmans, Shane Legg, Pedro A. Ortega

Figure 1 for Model-Free Risk-Sensitive Reinforcement Learning
Figure 2 for Model-Free Risk-Sensitive Reinforcement Learning
Figure 3 for Model-Free Risk-Sensitive Reinforcement Learning
Figure 4 for Model-Free Risk-Sensitive Reinforcement Learning
Viaarxiv icon

Shaking the foundations: delusions in sequence models for interaction and control

Add code
Bookmark button
Alert button
Oct 20, 2021
Pedro A. Ortega, Markus Kunesch, Grégoire Delétang, Tim Genewein, Jordi Grau-Moya, Joel Veness, Jonas Buchli, Jonas Degrave, Bilal Piot, Julien Perolat, Tom Everitt, Corentin Tallec, Emilio Parisotto, Tom Erez, Yutian Chen, Scott Reed, Marcus Hutter, Nando de Freitas, Shane Legg

Figure 1 for Shaking the foundations: delusions in sequence models for interaction and control
Figure 2 for Shaking the foundations: delusions in sequence models for interaction and control
Figure 3 for Shaking the foundations: delusions in sequence models for interaction and control
Figure 4 for Shaking the foundations: delusions in sequence models for interaction and control
Viaarxiv icon

Causal Analysis of Agent Behavior for AI Safety

Add code
Bookmark button
Alert button
Mar 05, 2021
Grégoire Déletang, Jordi Grau-Moya, Miljan Martic, Tim Genewein, Tom McGrath, Vladimir Mikulik, Markus Kunesch, Shane Legg, Pedro A. Ortega

Figure 1 for Causal Analysis of Agent Behavior for AI Safety
Figure 2 for Causal Analysis of Agent Behavior for AI Safety
Figure 3 for Causal Analysis of Agent Behavior for AI Safety
Figure 4 for Causal Analysis of Agent Behavior for AI Safety
Viaarxiv icon

Algorithms for Causal Reasoning in Probability Trees

Add code
Bookmark button
Alert button
Nov 12, 2020
Tim Genewein, Tom McGrath, Grégoire Déletang, Vladimir Mikulik, Miljan Martic, Shane Legg, Pedro A. Ortega

Figure 1 for Algorithms for Causal Reasoning in Probability Trees
Figure 2 for Algorithms for Causal Reasoning in Probability Trees
Figure 3 for Algorithms for Causal Reasoning in Probability Trees
Figure 4 for Algorithms for Causal Reasoning in Probability Trees
Viaarxiv icon

Meta-trained agents implement Bayes-optimal agents

Add code
Bookmark button
Alert button
Oct 21, 2020
Vladimir Mikulik, Grégoire Delétang, Tom McGrath, Tim Genewein, Miljan Martic, Shane Legg, Pedro A. Ortega

Figure 1 for Meta-trained agents implement Bayes-optimal agents
Figure 2 for Meta-trained agents implement Bayes-optimal agents
Figure 3 for Meta-trained agents implement Bayes-optimal agents
Figure 4 for Meta-trained agents implement Bayes-optimal agents
Viaarxiv icon

Action and Perception as Divergence Minimization

Add code
Bookmark button
Alert button
Oct 05, 2020
Danijar Hafner, Pedro A. Ortega, Jimmy Ba, Thomas Parr, Karl Friston, Nicolas Heess

Figure 1 for Action and Perception as Divergence Minimization
Figure 2 for Action and Perception as Divergence Minimization
Figure 3 for Action and Perception as Divergence Minimization
Figure 4 for Action and Perception as Divergence Minimization
Viaarxiv icon

Meta reinforcement learning as task inference

Add code
Bookmark button
Alert button
May 15, 2019
Jan Humplik, Alexandre Galashov, Leonard Hasenclever, Pedro A. Ortega, Yee Whye Teh, Nicolas Heess

Figure 1 for Meta reinforcement learning as task inference
Figure 2 for Meta reinforcement learning as task inference
Figure 3 for Meta reinforcement learning as task inference
Figure 4 for Meta reinforcement learning as task inference
Viaarxiv icon