Picture for Yuta Saito

Yuta Saito

POTEC: Off-Policy Learning for Large Action Spaces via Two-Stage Policy Decomposition

Add code
Feb 09, 2024
Figure 1 for POTEC: Off-Policy Learning for Large Action Spaces via Two-Stage Policy Decomposition
Figure 2 for POTEC: Off-Policy Learning for Large Action Spaces via Two-Stage Policy Decomposition
Figure 3 for POTEC: Off-Policy Learning for Large Action Spaces via Two-Stage Policy Decomposition
Figure 4 for POTEC: Off-Policy Learning for Large Action Spaces via Two-Stage Policy Decomposition
Viaarxiv icon

Off-Policy Evaluation of Slate Bandit Policies via Optimizing Abstraction

Add code
Feb 03, 2024
Figure 1 for Off-Policy Evaluation of Slate Bandit Policies via Optimizing Abstraction
Figure 2 for Off-Policy Evaluation of Slate Bandit Policies via Optimizing Abstraction
Figure 3 for Off-Policy Evaluation of Slate Bandit Policies via Optimizing Abstraction
Figure 4 for Off-Policy Evaluation of Slate Bandit Policies via Optimizing Abstraction
Viaarxiv icon

Towards Assessing and Benchmarking Risk-Return Tradeoff of Off-Policy Evaluation

Add code
Dec 04, 2023
Figure 1 for Towards Assessing and Benchmarking Risk-Return Tradeoff of Off-Policy Evaluation
Figure 2 for Towards Assessing and Benchmarking Risk-Return Tradeoff of Off-Policy Evaluation
Figure 3 for Towards Assessing and Benchmarking Risk-Return Tradeoff of Off-Policy Evaluation
Figure 4 for Towards Assessing and Benchmarking Risk-Return Tradeoff of Off-Policy Evaluation
Viaarxiv icon

SCOPE-RL: A Python Library for Offline Reinforcement Learning and Off-Policy Evaluation

Add code
Dec 04, 2023
Figure 1 for SCOPE-RL: A Python Library for Offline Reinforcement Learning and Off-Policy Evaluation
Figure 2 for SCOPE-RL: A Python Library for Offline Reinforcement Learning and Off-Policy Evaluation
Figure 3 for SCOPE-RL: A Python Library for Offline Reinforcement Learning and Off-Policy Evaluation
Figure 4 for SCOPE-RL: A Python Library for Offline Reinforcement Learning and Off-Policy Evaluation
Viaarxiv icon

Off-Policy Evaluation of Ranking Policies under Diverse User Behavior

Add code
Jun 26, 2023
Viaarxiv icon

Off-Policy Evaluation for Large Action Spaces via Conjunct Effect Modeling

Add code
May 14, 2023
Figure 1 for Off-Policy Evaluation for Large Action Spaces via Conjunct Effect Modeling
Figure 2 for Off-Policy Evaluation for Large Action Spaces via Conjunct Effect Modeling
Figure 3 for Off-Policy Evaluation for Large Action Spaces via Conjunct Effect Modeling
Figure 4 for Off-Policy Evaluation for Large Action Spaces via Conjunct Effect Modeling
Viaarxiv icon

Policy-Adaptive Estimator Selection for Off-Policy Evaluation

Add code
Nov 25, 2022
Figure 1 for Policy-Adaptive Estimator Selection for Off-Policy Evaluation
Figure 2 for Policy-Adaptive Estimator Selection for Off-Policy Evaluation
Figure 3 for Policy-Adaptive Estimator Selection for Off-Policy Evaluation
Figure 4 for Policy-Adaptive Estimator Selection for Off-Policy Evaluation
Viaarxiv icon

Fair Ranking as Fair Division: Impact-Based Individual Fairness in Ranking

Add code
Jun 15, 2022
Figure 1 for Fair Ranking as Fair Division: Impact-Based Individual Fairness in Ranking
Figure 2 for Fair Ranking as Fair Division: Impact-Based Individual Fairness in Ranking
Figure 3 for Fair Ranking as Fair Division: Impact-Based Individual Fairness in Ranking
Figure 4 for Fair Ranking as Fair Division: Impact-Based Individual Fairness in Ranking
Viaarxiv icon

A Real-World Implementation of Unbiased Lift-based Bidding System

Add code
Feb 23, 2022
Figure 1 for A Real-World Implementation of Unbiased Lift-based Bidding System
Figure 2 for A Real-World Implementation of Unbiased Lift-based Bidding System
Figure 3 for A Real-World Implementation of Unbiased Lift-based Bidding System
Figure 4 for A Real-World Implementation of Unbiased Lift-based Bidding System
Viaarxiv icon

Off-Policy Evaluation for Large Action Spaces via Embeddings

Add code
Feb 13, 2022
Figure 1 for Off-Policy Evaluation for Large Action Spaces via Embeddings
Figure 2 for Off-Policy Evaluation for Large Action Spaces via Embeddings
Figure 3 for Off-Policy Evaluation for Large Action Spaces via Embeddings
Figure 4 for Off-Policy Evaluation for Large Action Spaces via Embeddings
Viaarxiv icon