Picture for Yuta Saito

Yuta Saito

Off-Policy Evaluation for Ranking Policies under Deterministic Logging Policies

Add code
Mar 23, 2026
Viaarxiv icon

Off-Policy Learning with Limited Supply

Add code
Mar 19, 2026
Viaarxiv icon

Beyond Match Maximization and Fairness: Retention-Optimized Two-Sided Matching

Add code
Feb 17, 2026
Viaarxiv icon

A General Framework for Off-Policy Learning with Partially-Observed Reward

Add code
Jun 17, 2025
Figure 1 for A General Framework for Off-Policy Learning with Partially-Observed Reward
Figure 2 for A General Framework for Off-Policy Learning with Partially-Observed Reward
Figure 3 for A General Framework for Off-Policy Learning with Partially-Observed Reward
Figure 4 for A General Framework for Off-Policy Learning with Partially-Observed Reward
Viaarxiv icon

Prompt Optimization with Logged Bandit Data

Add code
Apr 03, 2025
Viaarxiv icon

A Best-of-Both Approach to Improve Match Predictions and Reciprocal Recommendations for Job Search

Add code
Sep 18, 2024
Figure 1 for A Best-of-Both Approach to Improve Match Predictions and Reciprocal Recommendations for Job Search
Figure 2 for A Best-of-Both Approach to Improve Match Predictions and Reciprocal Recommendations for Job Search
Figure 3 for A Best-of-Both Approach to Improve Match Predictions and Reciprocal Recommendations for Job Search
Viaarxiv icon

Effective Off-Policy Evaluation and Learning in Contextual Combinatorial Bandits

Add code
Aug 20, 2024
Figure 1 for Effective Off-Policy Evaluation and Learning in Contextual Combinatorial Bandits
Figure 2 for Effective Off-Policy Evaluation and Learning in Contextual Combinatorial Bandits
Figure 3 for Effective Off-Policy Evaluation and Learning in Contextual Combinatorial Bandits
Figure 4 for Effective Off-Policy Evaluation and Learning in Contextual Combinatorial Bandits
Viaarxiv icon

Long-term Off-Policy Evaluation and Learning

Add code
Apr 24, 2024
Figure 1 for Long-term Off-Policy Evaluation and Learning
Figure 2 for Long-term Off-Policy Evaluation and Learning
Figure 3 for Long-term Off-Policy Evaluation and Learning
Figure 4 for Long-term Off-Policy Evaluation and Learning
Viaarxiv icon

Hyperparameter Optimization Can Even be Harmful in Off-Policy Learning and How to Deal with It

Add code
Apr 23, 2024
Viaarxiv icon

Scalable and Provably Fair Exposure Control for Large-Scale Recommender Systems

Add code
Feb 22, 2024
Figure 1 for Scalable and Provably Fair Exposure Control for Large-Scale Recommender Systems
Figure 2 for Scalable and Provably Fair Exposure Control for Large-Scale Recommender Systems
Figure 3 for Scalable and Provably Fair Exposure Control for Large-Scale Recommender Systems
Figure 4 for Scalable and Provably Fair Exposure Control for Large-Scale Recommender Systems
Viaarxiv icon