Picture for Boxiang Lyu

Boxiang Lyu

An Instrumental Value for Data Production and its Application to Data Pricing

Add code
Dec 24, 2024
Viaarxiv icon

Traversing Pareto Optimal Policies: Provably Efficient Multi-Objective Reinforcement Learning

Add code
Jul 24, 2024
Figure 1 for Traversing Pareto Optimal Policies: Provably Efficient Multi-Objective Reinforcement Learning
Figure 2 for Traversing Pareto Optimal Policies: Provably Efficient Multi-Objective Reinforcement Learning
Figure 3 for Traversing Pareto Optimal Policies: Provably Efficient Multi-Objective Reinforcement Learning
Figure 4 for Traversing Pareto Optimal Policies: Provably Efficient Multi-Objective Reinforcement Learning
Viaarxiv icon

Pessimism Meets Risk: Risk-Sensitive Offline Reinforcement Learning

Add code
Jul 10, 2024
Figure 1 for Pessimism Meets Risk: Risk-Sensitive Offline Reinforcement Learning
Viaarxiv icon

Addressing Budget Allocation and Revenue Allocation in Data Market Environments Using an Adaptive Sampling Algorithm

Add code
Jun 05, 2023
Figure 1 for Addressing Budget Allocation and Revenue Allocation in Data Market Environments Using an Adaptive Sampling Algorithm
Figure 2 for Addressing Budget Allocation and Revenue Allocation in Data Market Environments Using an Adaptive Sampling Algorithm
Figure 3 for Addressing Budget Allocation and Revenue Allocation in Data Market Environments Using an Adaptive Sampling Algorithm
Figure 4 for Addressing Budget Allocation and Revenue Allocation in Data Market Environments Using an Adaptive Sampling Algorithm
Viaarxiv icon

Pairwise Ranking Losses of Click-Through Rates Prediction for Welfare Maximization in Ad Auctions

Add code
Jun 01, 2023
Figure 1 for Pairwise Ranking Losses of Click-Through Rates Prediction for Welfare Maximization in Ad Auctions
Figure 2 for Pairwise Ranking Losses of Click-Through Rates Prediction for Welfare Maximization in Ad Auctions
Figure 3 for Pairwise Ranking Losses of Click-Through Rates Prediction for Welfare Maximization in Ad Auctions
Figure 4 for Pairwise Ranking Losses of Click-Through Rates Prediction for Welfare Maximization in Ad Auctions
Viaarxiv icon

A Reinforcement Learning Approach in Multi-Phase Second-Price Auction Design

Add code
Oct 19, 2022
Figure 1 for A Reinforcement Learning Approach in Multi-Phase Second-Price Auction Design
Viaarxiv icon

One Policy is Enough: Parallel Exploration with a Single Policy is Minimax Optimal for Reward-Free Reinforcement Learning

Add code
May 31, 2022
Viaarxiv icon

Pessimism meets VCG: Learning Dynamic Mechanism Design via Offline Reinforcement Learning

Add code
May 05, 2022
Viaarxiv icon

Personalized Federated Learning with Multiple Known Clusters

Add code
Apr 28, 2022
Figure 1 for Personalized Federated Learning with Multiple Known Clusters
Figure 2 for Personalized Federated Learning with Multiple Known Clusters
Figure 3 for Personalized Federated Learning with Multiple Known Clusters
Figure 4 for Personalized Federated Learning with Multiple Known Clusters
Viaarxiv icon

Learning Dynamic Mechanisms in Unknown Environments: A Reinforcement Learning Approach

Add code
Feb 25, 2022
Figure 1 for Learning Dynamic Mechanisms in Unknown Environments: A Reinforcement Learning Approach
Figure 2 for Learning Dynamic Mechanisms in Unknown Environments: A Reinforcement Learning Approach
Viaarxiv icon