Picture for Zhuoran Yang

Zhuoran Yang

Embed to Control Partially Observed Systems: Representation Learning with Provable Sample Efficiency

Add code
May 26, 2022
Figure 1 for Embed to Control Partially Observed Systems: Representation Learning with Provable Sample Efficiency
Viaarxiv icon

Human-in-the-loop: Provably Efficient Preference-based Reinforcement Learning with General Function Approximation

Add code
May 24, 2022
Viaarxiv icon

Pessimism meets VCG: Learning Dynamic Mechanism Design via Offline Reinforcement Learning

Add code
May 05, 2022
Viaarxiv icon

Sample-Efficient Reinforcement Learning for POMDPs with Linear Function Approximations

Add code
Apr 20, 2022
Figure 1 for Sample-Efficient Reinforcement Learning for POMDPs with Linear Function Approximations
Figure 2 for Sample-Efficient Reinforcement Learning for POMDPs with Linear Function Approximations
Viaarxiv icon

Learn to Match with No Regret: Reinforcement Learning in Markov Matching Markets

Add code
Mar 07, 2022
Figure 1 for Learn to Match with No Regret: Reinforcement Learning in Markov Matching Markets
Viaarxiv icon

The Best of Both Worlds: Reinforcement Learning with Logarithmic Regret and Policy Switches

Add code
Mar 03, 2022
Viaarxiv icon

Learning Dynamic Mechanisms in Unknown Environments: A Reinforcement Learning Approach

Add code
Feb 25, 2022
Figure 1 for Learning Dynamic Mechanisms in Unknown Environments: A Reinforcement Learning Approach
Figure 2 for Learning Dynamic Mechanisms in Unknown Environments: A Reinforcement Learning Approach
Viaarxiv icon

Pessimistic Bootstrapping for Uncertainty-Driven Offline Reinforcement Learning

Add code
Feb 23, 2022
Figure 1 for Pessimistic Bootstrapping for Uncertainty-Driven Offline Reinforcement Learning
Figure 2 for Pessimistic Bootstrapping for Uncertainty-Driven Offline Reinforcement Learning
Figure 3 for Pessimistic Bootstrapping for Uncertainty-Driven Offline Reinforcement Learning
Figure 4 for Pessimistic Bootstrapping for Uncertainty-Driven Offline Reinforcement Learning
Viaarxiv icon

Sequential Information Design: Markov Persuasion Process and Its Efficient Reinforcement Learning

Add code
Feb 22, 2022
Viaarxiv icon

Pessimistic Minimax Value Iteration: Provably Efficient Equilibrium Learning from Offline Datasets

Add code
Feb 15, 2022
Figure 1 for Pessimistic Minimax Value Iteration: Provably Efficient Equilibrium Learning from Offline Datasets
Viaarxiv icon