Alert button
Picture for Kenshi Abe

Kenshi Abe

Alert button

Regularized Best-of-N Sampling to Mitigate Reward Hacking for Language Model Alignment

Add code
Bookmark button
Alert button
Apr 05, 2024
Yuu Jinnai, Tetsuro Morimura, Kaito Ariu, Kenshi Abe

Viaarxiv icon

Scalable and Provably Fair Exposure Control for Large-Scale Recommender Systems

Add code
Bookmark button
Alert button
Feb 22, 2024
Riku Togashi, Kenshi Abe, Yuta Saito

Viaarxiv icon

Return-Aligned Decision Transformer

Add code
Bookmark button
Alert button
Feb 06, 2024
Tsunehiko Tanaka, Kenshi Abe, Kaito Ariu, Tetsuro Morimura, Edgar Simo-Serra

Viaarxiv icon

Learning Fair Division from Bandit Feedback

Add code
Bookmark button
Alert button
Nov 15, 2023
Hakuei Yamada, Junpei Komiyama, Kenshi Abe, Atsushi Iwasaki

Viaarxiv icon

Model-Based Minimum Bayes Risk Decoding

Add code
Bookmark button
Alert button
Nov 09, 2023
Yuu Jinnai, Tetsuro Morimura, Ukyo Honda, Kaito Ariu, Kenshi Abe

Viaarxiv icon

Why Guided Dialog Policy Learning performs well? Understanding the role of adversarial learning and its alternative

Add code
Bookmark button
Alert button
Jul 13, 2023
Sho Shimoyama, Tetsuro Morimura, Kenshi Abe, Toda Takamichi, Yuta Tomomatsu, Masakazu Sugiyama, Asahi Hentona, Yuuki Azuma, Hirotaka Ninomiya

Figure 1 for Why Guided Dialog Policy Learning performs well? Understanding the role of adversarial learning and its alternative
Figure 2 for Why Guided Dialog Policy Learning performs well? Understanding the role of adversarial learning and its alternative
Figure 3 for Why Guided Dialog Policy Learning performs well? Understanding the role of adversarial learning and its alternative
Figure 4 for Why Guided Dialog Policy Learning performs well? Understanding the role of adversarial learning and its alternative
Viaarxiv icon

A Slingshot Approach to Learning in Monotone Games

Add code
Bookmark button
Alert button
May 26, 2023
Kenshi Abe, Kaito Ariu, Mitsuki Sakamoto, Atsushi Iwasaki

Figure 1 for A Slingshot Approach to Learning in Monotone Games
Figure 2 for A Slingshot Approach to Learning in Monotone Games
Figure 3 for A Slingshot Approach to Learning in Monotone Games
Figure 4 for A Slingshot Approach to Learning in Monotone Games
Viaarxiv icon

Exploration of Unranked Items in Safe Online Learning to Re-Rank

Add code
Bookmark button
Alert button
May 02, 2023
Hiroaki Shiino, Kaito Ariu, Kenshi Abe, Togashi Riku

Figure 1 for Exploration of Unranked Items in Safe Online Learning to Re-Rank
Figure 2 for Exploration of Unranked Items in Safe Online Learning to Re-Rank
Figure 3 for Exploration of Unranked Items in Safe Online Learning to Re-Rank
Viaarxiv icon

Fair Matrix Factorisation for Large-Scale Recommender Systems

Add code
Bookmark button
Alert button
Sep 09, 2022
Riku Togashi, Kenshi Abe

Figure 1 for Fair Matrix Factorisation for Large-Scale Recommender Systems
Figure 2 for Fair Matrix Factorisation for Large-Scale Recommender Systems
Figure 3 for Fair Matrix Factorisation for Large-Scale Recommender Systems
Figure 4 for Fair Matrix Factorisation for Large-Scale Recommender Systems
Viaarxiv icon

Last-Iterate Convergence with Full- and Noisy-Information Feedback in Two-Player Zero-Sum Games

Add code
Bookmark button
Alert button
Aug 21, 2022
Kenshi Abe, Kaito Ariu, Mitsuki Sakamoto, Kentaro Toyoshima, Atsushi Iwasaki

Figure 1 for Last-Iterate Convergence with Full- and Noisy-Information Feedback in Two-Player Zero-Sum Games
Figure 2 for Last-Iterate Convergence with Full- and Noisy-Information Feedback in Two-Player Zero-Sum Games
Figure 3 for Last-Iterate Convergence with Full- and Noisy-Information Feedback in Two-Player Zero-Sum Games
Figure 4 for Last-Iterate Convergence with Full- and Noisy-Information Feedback in Two-Player Zero-Sum Games
Viaarxiv icon