Alert button
Picture for Yuta Saito

Yuta Saito

Alert button

Scalable and Provably Fair Exposure Control for Large-Scale Recommender Systems

Feb 22, 2024
Riku Togashi, Kenshi Abe, Yuta Saito

Viaarxiv icon

POTEC: Off-Policy Learning for Large Action Spaces via Two-Stage Policy Decomposition

Feb 09, 2024
Yuta Saito, Jihan Yao, Thorsten Joachims

Viaarxiv icon

Off-Policy Evaluation of Slate Bandit Policies via Optimizing Abstraction

Feb 03, 2024
Haruka Kiyohara, Masahiro Nomura, Yuta Saito

Viaarxiv icon

SCOPE-RL: A Python Library for Offline Reinforcement Learning and Off-Policy Evaluation

Dec 04, 2023
Haruka Kiyohara, Ren Kishimoto, Kosuke Kawakami, Ken Kobayashi, Kazuhide Nakata, Yuta Saito

Figure 1 for SCOPE-RL: A Python Library for Offline Reinforcement Learning and Off-Policy Evaluation
Figure 2 for SCOPE-RL: A Python Library for Offline Reinforcement Learning and Off-Policy Evaluation
Figure 3 for SCOPE-RL: A Python Library for Offline Reinforcement Learning and Off-Policy Evaluation
Figure 4 for SCOPE-RL: A Python Library for Offline Reinforcement Learning and Off-Policy Evaluation
Viaarxiv icon

Towards Assessing and Benchmarking Risk-Return Tradeoff of Off-Policy Evaluation

Dec 04, 2023
Haruka Kiyohara, Ren Kishimoto, Kosuke Kawakami, Ken Kobayashi, Kazuhide Nakata, Yuta Saito

Viaarxiv icon

Off-Policy Evaluation of Ranking Policies under Diverse User Behavior

Jun 26, 2023
Haruka Kiyohara, Masatoshi Uehara, Yusuke Narita, Nobuyuki Shimizu, Yasuo Yamamoto, Yuta Saito

Figure 1 for Off-Policy Evaluation of Ranking Policies under Diverse User Behavior
Figure 2 for Off-Policy Evaluation of Ranking Policies under Diverse User Behavior
Figure 3 for Off-Policy Evaluation of Ranking Policies under Diverse User Behavior
Figure 4 for Off-Policy Evaluation of Ranking Policies under Diverse User Behavior
Viaarxiv icon

Off-Policy Evaluation for Large Action Spaces via Conjunct Effect Modeling

May 14, 2023
Yuta Saito, Qingyang Ren, Thorsten Joachims

Figure 1 for Off-Policy Evaluation for Large Action Spaces via Conjunct Effect Modeling
Figure 2 for Off-Policy Evaluation for Large Action Spaces via Conjunct Effect Modeling
Figure 3 for Off-Policy Evaluation for Large Action Spaces via Conjunct Effect Modeling
Figure 4 for Off-Policy Evaluation for Large Action Spaces via Conjunct Effect Modeling
Viaarxiv icon

Policy-Adaptive Estimator Selection for Off-Policy Evaluation

Nov 25, 2022
Takuma Udagawa, Haruka Kiyohara, Yusuke Narita, Yuta Saito, Kei Tateno

Figure 1 for Policy-Adaptive Estimator Selection for Off-Policy Evaluation
Figure 2 for Policy-Adaptive Estimator Selection for Off-Policy Evaluation
Figure 3 for Policy-Adaptive Estimator Selection for Off-Policy Evaluation
Figure 4 for Policy-Adaptive Estimator Selection for Off-Policy Evaluation
Viaarxiv icon