Picture for Zhuoran Yang

Zhuoran Yang

On the Role of Information Structure in Reinforcement Learning for Partially-Observable Sequential Teams and Games

Add code
Mar 01, 2024
Viaarxiv icon

Training Dynamics of Multi-Head Softmax Attention for In-Context Learning: Emergence, Convergence, and Optimality

Add code
Feb 29, 2024
Viaarxiv icon

Double Duality: Variational Primal-Dual Policy Optimization for Constrained Reinforcement Learning

Add code
Feb 16, 2024
Viaarxiv icon

Principled Penalty-based Methods for Bilevel Reinforcement Learning and RLHF

Add code
Feb 10, 2024
Viaarxiv icon

Symmetric Mean-field Langevin Dynamics for Distributional Minimax Problems

Add code
Dec 02, 2023
Viaarxiv icon

Empowering Autonomous Driving with Large Language Models: A Safety Perspective

Add code
Nov 28, 2023
Viaarxiv icon

Provably Efficient High-Dimensional Bandit Learning with Batched Feedbacks

Add code
Nov 24, 2023
Viaarxiv icon

Posterior Sampling for Competitive RL: Function Approximation and Partial Observation

Add code
Oct 30, 2023
Viaarxiv icon

Learning Regularized Graphon Mean-Field Games with Unknown Graphons

Add code
Oct 26, 2023
Viaarxiv icon

Learning Regularized Monotone Graphon Mean-Field Games

Add code
Oct 12, 2023
Viaarxiv icon