Alert button
Picture for Takumi Tanabe

Takumi Tanabe

Alert button

Stepwise Alignment for Constrained Language Model Policy Optimization

Add code
Bookmark button
Alert button
Apr 17, 2024
Akifumi Wachi, Thien Q Tran, Rei Sato, Takumi Tanabe, Yohei Akimoto

Viaarxiv icon

Max-Min Off-Policy Actor-Critic Method Focusing on Worst-Case Robustness to Model Misspecification

Add code
Bookmark button
Alert button
Nov 07, 2022
Takumi Tanabe, Rei Sato, Kazuto Fukuchi, Jun Sakuma, Youhei Akimoto

Figure 1 for Max-Min Off-Policy Actor-Critic Method Focusing on Worst-Case Robustness to Model Misspecification
Figure 2 for Max-Min Off-Policy Actor-Critic Method Focusing on Worst-Case Robustness to Model Misspecification
Figure 3 for Max-Min Off-Policy Actor-Critic Method Focusing on Worst-Case Robustness to Model Misspecification
Figure 4 for Max-Min Off-Policy Actor-Critic Method Focusing on Worst-Case Robustness to Model Misspecification
Viaarxiv icon

Level Generation for Angry Birds with Sequential VAE and Latent Variable Evolution

Add code
Bookmark button
Alert button
Apr 13, 2021
Takumi Tanabe, Kazuto Fukuchi, Jun Sakuma, Youhei Akimoto

Figure 1 for Level Generation for Angry Birds with Sequential VAE and Latent Variable Evolution
Figure 2 for Level Generation for Angry Birds with Sequential VAE and Latent Variable Evolution
Figure 3 for Level Generation for Angry Birds with Sequential VAE and Latent Variable Evolution
Figure 4 for Level Generation for Angry Birds with Sequential VAE and Latent Variable Evolution
Viaarxiv icon