Alert button
Picture for Akifumi Wachi

Akifumi Wachi

Alert button

Stepwise Alignment for Constrained Language Model Policy Optimization

Add code
Bookmark button
Alert button
Apr 17, 2024
Akifumi Wachi, Thien Q Tran, Rei Sato, Takumi Tanabe, Yohei Akimoto

Viaarxiv icon

A Survey of Constraint Formulations in Safe Reinforcement Learning

Add code
Bookmark button
Alert button
Feb 03, 2024
Akifumi Wachi, Xun Shen, Yanan Sui

Viaarxiv icon

Long-term Safe Reinforcement Learning with Binary Feedback

Add code
Bookmark button
Alert button
Jan 11, 2024
Akifumi Wachi, Wataru Hashimoto, Kazumune Hashimoto

Viaarxiv icon

Verbosity Bias in Preference Labeling by Large Language Models

Add code
Bookmark button
Alert button
Oct 16, 2023
Keita Saito, Akifumi Wachi, Koki Wataoka, Youhei Akimoto

Viaarxiv icon

Safe Exploration in Reinforcement Learning: A Generalized Formulation and Algorithms

Add code
Bookmark button
Alert button
Oct 05, 2023
Akifumi Wachi, Wataru Hashimoto, Xun Shen, Kazumune Hashimoto

Viaarxiv icon

Safe Policy Optimization with Local Generalized Linear Function Approximations

Add code
Bookmark button
Alert button
Nov 09, 2021
Akifumi Wachi, Yunyue Wei, Yanan Sui

Figure 1 for Safe Policy Optimization with Local Generalized Linear Function Approximations
Figure 2 for Safe Policy Optimization with Local Generalized Linear Function Approximations
Figure 3 for Safe Policy Optimization with Local Generalized Linear Function Approximations
Figure 4 for Safe Policy Optimization with Local Generalized Linear Function Approximations
Viaarxiv icon

LOA: Logical Optimal Actions for Text-based Interaction Games

Add code
Bookmark button
Alert button
Oct 21, 2021
Daiki Kimura, Subhajit Chaudhury, Masaki Ono, Michiaki Tatsubori, Don Joven Agravante, Asim Munawar, Akifumi Wachi, Ryosuke Kohita, Alexander Gray

Figure 1 for LOA: Logical Optimal Actions for Text-based Interaction Games
Figure 2 for LOA: Logical Optimal Actions for Text-based Interaction Games
Figure 3 for LOA: Logical Optimal Actions for Text-based Interaction Games
Figure 4 for LOA: Logical Optimal Actions for Text-based Interaction Games
Viaarxiv icon

Neuro-Symbolic Reinforcement Learning with First-Order Logic

Add code
Bookmark button
Alert button
Oct 21, 2021
Daiki Kimura, Masaki Ono, Subhajit Chaudhury, Ryosuke Kohita, Akifumi Wachi, Don Joven Agravante, Michiaki Tatsubori, Asim Munawar, Alexander Gray

Figure 1 for Neuro-Symbolic Reinforcement Learning with First-Order Logic
Figure 2 for Neuro-Symbolic Reinforcement Learning with First-Order Logic
Figure 3 for Neuro-Symbolic Reinforcement Learning with First-Order Logic
Viaarxiv icon

Reinforcement Learning with External Knowledge by using Logical Neural Networks

Add code
Bookmark button
Alert button
Mar 03, 2021
Daiki Kimura, Subhajit Chaudhury, Akifumi Wachi, Ryosuke Kohita, Asim Munawar, Michiaki Tatsubori, Alexander Gray

Figure 1 for Reinforcement Learning with External Knowledge by using Logical Neural Networks
Figure 2 for Reinforcement Learning with External Knowledge by using Logical Neural Networks
Viaarxiv icon

Q-learning with Language Model for Edit-based Unsupervised Summarization

Add code
Bookmark button
Alert button
Oct 09, 2020
Ryosuke Kohita, Akifumi Wachi, Yang Zhao, Ryuki Tachibana

Figure 1 for Q-learning with Language Model for Edit-based Unsupervised Summarization
Figure 2 for Q-learning with Language Model for Edit-based Unsupervised Summarization
Figure 3 for Q-learning with Language Model for Edit-based Unsupervised Summarization
Figure 4 for Q-learning with Language Model for Edit-based Unsupervised Summarization
Viaarxiv icon