Alert button
Picture for Jordi Grau-Moya

Jordi Grau-Moya

Alert button

Bellman: A Toolbox for Model-Based Reinforcement Learning in TensorFlow

Apr 13, 2021
John McLeod, Hrvoje Stojic, Vincent Adam, Dongho Kim, Jordi Grau-Moya, Peter Vrancx, Felix Leibfried

Figure 1 for Bellman: A Toolbox for Model-Based Reinforcement Learning in TensorFlow
Viaarxiv icon

Causal Analysis of Agent Behavior for AI Safety

Mar 05, 2021
Grégoire Déletang, Jordi Grau-Moya, Miljan Martic, Tim Genewein, Tom McGrath, Vladimir Mikulik, Markus Kunesch, Shane Legg, Pedro A. Ortega

Figure 1 for Causal Analysis of Agent Behavior for AI Safety
Figure 2 for Causal Analysis of Agent Behavior for AI Safety
Figure 3 for Causal Analysis of Agent Behavior for AI Safety
Figure 4 for Causal Analysis of Agent Behavior for AI Safety
Viaarxiv icon

Mutual-Information Regularization in Markov Decision Processes and Actor-Critic Learning

Sep 11, 2019
Felix Leibfried, Jordi Grau-Moya

Figure 1 for Mutual-Information Regularization in Markov Decision Processes and Actor-Critic Learning
Figure 2 for Mutual-Information Regularization in Markov Decision Processes and Actor-Critic Learning
Figure 3 for Mutual-Information Regularization in Markov Decision Processes and Actor-Critic Learning
Figure 4 for Mutual-Information Regularization in Markov Decision Processes and Actor-Critic Learning
Viaarxiv icon

A Unified Bellman Optimality Principle Combining Reward Maximization and Empowerment

Sep 09, 2019
Felix Leibfried, Sergio Pascual-Diaz, Jordi Grau-Moya

Figure 1 for A Unified Bellman Optimality Principle Combining Reward Maximization and Empowerment
Figure 2 for A Unified Bellman Optimality Principle Combining Reward Maximization and Empowerment
Figure 3 for A Unified Bellman Optimality Principle Combining Reward Maximization and Empowerment
Viaarxiv icon

Disentangled Skill Embeddings for Reinforcement Learning

Jun 21, 2019
Janith C. Petangoda, Sergio Pascual-Diaz, Vincent Adam, Peter Vrancx, Jordi Grau-Moya

Figure 1 for Disentangled Skill Embeddings for Reinforcement Learning
Figure 2 for Disentangled Skill Embeddings for Reinforcement Learning
Figure 3 for Disentangled Skill Embeddings for Reinforcement Learning
Figure 4 for Disentangled Skill Embeddings for Reinforcement Learning
Viaarxiv icon

Regularised Deep Reinforcement Learning with Guaranteed Convergence

Sep 06, 2018
Felix Leibfried, Rasul Tutunov, Jordi Grau-Moya, Haitham Bou-Ammar

Figure 1 for Regularised Deep Reinforcement Learning with Guaranteed Convergence
Figure 2 for Regularised Deep Reinforcement Learning with Guaranteed Convergence
Figure 3 for Regularised Deep Reinforcement Learning with Guaranteed Convergence
Figure 4 for Regularised Deep Reinforcement Learning with Guaranteed Convergence
Viaarxiv icon

Balancing Two-Player Stochastic Games with Soft Q-Learning

Feb 09, 2018
Jordi Grau-Moya, Felix Leibfried, Haitham Bou-Ammar

Figure 1 for Balancing Two-Player Stochastic Games with Soft Q-Learning
Figure 2 for Balancing Two-Player Stochastic Games with Soft Q-Learning
Figure 3 for Balancing Two-Player Stochastic Games with Soft Q-Learning
Figure 4 for Balancing Two-Player Stochastic Games with Soft Q-Learning
Viaarxiv icon

Planning with Information-Processing Constraints and Model Uncertainty in Markov Decision Processes

Apr 07, 2016
Jordi Grau-Moya, Felix Leibfried, Tim Genewein, Daniel A. Braun

Figure 1 for Planning with Information-Processing Constraints and Model Uncertainty in Markov Decision Processes
Figure 2 for Planning with Information-Processing Constraints and Model Uncertainty in Markov Decision Processes
Viaarxiv icon