Picture for Zhuoran Yang

Zhuoran Yang

Empowering Autonomous Driving with Large Language Models: A Safety Perspective

Add code
Nov 28, 2023
Viaarxiv icon

Provably Efficient High-Dimensional Bandit Learning with Batched Feedbacks

Add code
Nov 24, 2023
Figure 1 for Provably Efficient High-Dimensional Bandit Learning with Batched Feedbacks
Figure 2 for Provably Efficient High-Dimensional Bandit Learning with Batched Feedbacks
Figure 3 for Provably Efficient High-Dimensional Bandit Learning with Batched Feedbacks
Figure 4 for Provably Efficient High-Dimensional Bandit Learning with Batched Feedbacks
Viaarxiv icon

Posterior Sampling for Competitive RL: Function Approximation and Partial Observation

Add code
Oct 30, 2023
Viaarxiv icon

Learning Regularized Graphon Mean-Field Games with Unknown Graphons

Add code
Oct 26, 2023
Figure 1 for Learning Regularized Graphon Mean-Field Games with Unknown Graphons
Figure 2 for Learning Regularized Graphon Mean-Field Games with Unknown Graphons
Figure 3 for Learning Regularized Graphon Mean-Field Games with Unknown Graphons
Viaarxiv icon

Learning Regularized Monotone Graphon Mean-Field Games

Add code
Oct 12, 2023
Figure 1 for Learning Regularized Monotone Graphon Mean-Field Games
Figure 2 for Learning Regularized Monotone Graphon Mean-Field Games
Viaarxiv icon

Sample-Efficient Multi-Agent RL: An Optimization Perspective

Add code
Oct 10, 2023
Viaarxiv icon

Actions Speak What You Want: Provably Sample-Efficient Reinforcement Learning of the Quantal Stackelberg Equilibrium from Strategic Feedbacks

Add code
Jul 26, 2023
Viaarxiv icon

Contextual Dynamic Pricing with Strategic Buyers

Add code
Jul 08, 2023
Figure 1 for Contextual Dynamic Pricing with Strategic Buyers
Figure 2 for Contextual Dynamic Pricing with Strategic Buyers
Figure 3 for Contextual Dynamic Pricing with Strategic Buyers
Figure 4 for Contextual Dynamic Pricing with Strategic Buyers
Viaarxiv icon

A General Framework for Sequential Decision-Making under Adaptivity Constraints

Add code
Jun 27, 2023
Figure 1 for A General Framework for Sequential Decision-Making under Adaptivity Constraints
Figure 2 for A General Framework for Sequential Decision-Making under Adaptivity Constraints
Viaarxiv icon

Provably Efficient Representation Learning with Tractable Planning in Low-Rank POMDP

Add code
Jun 21, 2023
Viaarxiv icon