Alert button
Picture for Yoshihiro Okawa

Yoshihiro Okawa

Alert button

Safe Exploration Method for Reinforcement Learning under Existence of Disturbance

Add code
Bookmark button
Alert button
Sep 30, 2022
Yoshihiro Okawa, Tomotake Sasaki, Hitoshi Yanami, Toru Namerikawa

Figure 1 for Safe Exploration Method for Reinforcement Learning under Existence of Disturbance
Figure 2 for Safe Exploration Method for Reinforcement Learning under Existence of Disturbance
Figure 3 for Safe Exploration Method for Reinforcement Learning under Existence of Disturbance
Figure 4 for Safe Exploration Method for Reinforcement Learning under Existence of Disturbance
Viaarxiv icon

Model-free two-step design for improving transient learning performance in nonlinear optimal regulator problems

Add code
Bookmark button
Alert button
Mar 05, 2021
Yuka Masumoto, Yoshihiro Okawa, Tomotake Sasaki, Yutaka Hori

Figure 1 for Model-free two-step design for improving transient learning performance in nonlinear optimal regulator problems
Figure 2 for Model-free two-step design for improving transient learning performance in nonlinear optimal regulator problems
Figure 3 for Model-free two-step design for improving transient learning performance in nonlinear optimal regulator problems
Figure 4 for Model-free two-step design for improving transient learning performance in nonlinear optimal regulator problems
Viaarxiv icon

Automatic Exploration Process Adjustment for Safe Reinforcement Learning with Joint Chance Constraint Satisfaction

Add code
Bookmark button
Alert button
Mar 05, 2021
Yoshihiro Okawa, Tomotake Sasaki, Hidenao Iwane

Figure 1 for Automatic Exploration Process Adjustment for Safe Reinforcement Learning with Joint Chance Constraint Satisfaction
Figure 2 for Automatic Exploration Process Adjustment for Safe Reinforcement Learning with Joint Chance Constraint Satisfaction
Figure 3 for Automatic Exploration Process Adjustment for Safe Reinforcement Learning with Joint Chance Constraint Satisfaction
Figure 4 for Automatic Exploration Process Adjustment for Safe Reinforcement Learning with Joint Chance Constraint Satisfaction
Viaarxiv icon