Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Title:Deep reinforcement learning under signal temporal logic constraints using Lagrangian relaxation

Jan 29, 2022

Junya Ikemoto, Toshimitsu Ushio

Figure 1 for Deep reinforcement learning under signal temporal logic constraints using Lagrangian relaxation

Figure 2 for Deep reinforcement learning under signal temporal logic constraints using Lagrangian relaxation

Figure 3 for Deep reinforcement learning under signal temporal logic constraints using Lagrangian relaxation

Figure 4 for Deep reinforcement learning under signal temporal logic constraints using Lagrangian relaxation

Share this with someone who'll enjoy it:

Abstract:Deep reinforcement learning (DRL) has attracted much attention as an approach to solve sequential decision making problems without mathematical models of systems or environments. In general, a constraint may be imposed on the decision making. In this study, we consider the optimal decision making problems with constraints to complete temporal high-level tasks in the continuous state-action domain. We describe the constraints using signal temporal logic (STL), which is useful for time sensitive control tasks since it can specify continuous signals within a bounded time interval. To deal with the STL constraints, we introduce an extended constrained Markov decision process (CMDP), which is called a $\tau$-CMDP. We formulate the STL constrained optimal decision making problem as the $\tau$-CMDP and propose a two-phase constrained DRL algorithm using the Lagrangian relaxation method. Through simulations, we also demonstrate the learning performance of the proposed algorithm.

* 9 pages, 10 figures. arXiv admin note: text overlap with arXiv:2108.01317

View paper on

Share this with someone who'll enjoy it:

Title:Deep reinforcement learning under signal temporal logic constraints using Lagrangian relaxation

Paper and Code