Picture for Akifumi Wachi

Akifumi Wachi

Flipping-based Policy for Chance-Constrained Markov Decision Processes

Add code
Oct 09, 2024
Viaarxiv icon

Stepwise Alignment for Constrained Language Model Policy Optimization

Add code
Apr 17, 2024
Viaarxiv icon

A Survey of Constraint Formulations in Safe Reinforcement Learning

Add code
Feb 03, 2024
Viaarxiv icon

Long-term Safe Reinforcement Learning with Binary Feedback

Add code
Jan 11, 2024
Viaarxiv icon

Verbosity Bias in Preference Labeling by Large Language Models

Add code
Oct 16, 2023
Viaarxiv icon

Safe Exploration in Reinforcement Learning: A Generalized Formulation and Algorithms

Add code
Oct 05, 2023
Viaarxiv icon

Safe Policy Optimization with Local Generalized Linear Function Approximations

Add code
Nov 09, 2021
Figure 1 for Safe Policy Optimization with Local Generalized Linear Function Approximations
Figure 2 for Safe Policy Optimization with Local Generalized Linear Function Approximations
Figure 3 for Safe Policy Optimization with Local Generalized Linear Function Approximations
Figure 4 for Safe Policy Optimization with Local Generalized Linear Function Approximations
Viaarxiv icon

LOA: Logical Optimal Actions for Text-based Interaction Games

Add code
Oct 21, 2021
Figure 1 for LOA: Logical Optimal Actions for Text-based Interaction Games
Figure 2 for LOA: Logical Optimal Actions for Text-based Interaction Games
Figure 3 for LOA: Logical Optimal Actions for Text-based Interaction Games
Figure 4 for LOA: Logical Optimal Actions for Text-based Interaction Games
Viaarxiv icon

Neuro-Symbolic Reinforcement Learning with First-Order Logic

Add code
Oct 21, 2021
Figure 1 for Neuro-Symbolic Reinforcement Learning with First-Order Logic
Figure 2 for Neuro-Symbolic Reinforcement Learning with First-Order Logic
Figure 3 for Neuro-Symbolic Reinforcement Learning with First-Order Logic
Viaarxiv icon

Reinforcement Learning with External Knowledge by using Logical Neural Networks

Add code
Mar 03, 2021
Figure 1 for Reinforcement Learning with External Knowledge by using Logical Neural Networks
Figure 2 for Reinforcement Learning with External Knowledge by using Logical Neural Networks
Viaarxiv icon