Alert button
Picture for Zhuoran Yang

Zhuoran Yang

Alert button

Unveil Conditional Diffusion Models with Classifier-free Guidance: A Sharp Statistical Theory

Mar 18, 2024
Hengyu Fu, Zhuoran Yang, Mengdi Wang, Minshuo Chen

Viaarxiv icon

On the Role of Information Structure in Reinforcement Learning for Partially-Observable Sequential Teams and Games

Mar 01, 2024
Awni Altabaa, Zhuoran Yang

Viaarxiv icon

Training Dynamics of Multi-Head Softmax Attention for In-Context Learning: Emergence, Convergence, and Optimality

Feb 29, 2024
Siyu Chen, Heejune Sheen, Tianhao Wang, Zhuoran Yang

Viaarxiv icon

Double Duality: Variational Primal-Dual Policy Optimization for Constrained Reinforcement Learning

Feb 16, 2024
Zihao Li, Boyi Liu, Zhuoran Yang, Zhaoran Wang, Mengdi Wang

Viaarxiv icon

Principled Penalty-based Methods for Bilevel Reinforcement Learning and RLHF

Feb 10, 2024
Han Shen, Zhuoran Yang, Tianyi Chen

Viaarxiv icon

Symmetric Mean-field Langevin Dynamics for Distributional Minimax Problems

Dec 02, 2023
Juno Kim, Kakei Yamamoto, Kazusato Oko, Zhuoran Yang, Taiji Suzuki

Viaarxiv icon

Empowering Autonomous Driving with Large Language Models: A Safety Perspective

Nov 28, 2023
Yixuan Wang, Ruochen Jiao, Chengtian Lang, Sinong Simon Zhan, Chao Huang, Zhaoran Wang, Zhuoran Yang, Qi Zhu

Figure 1 for Empowering Autonomous Driving with Large Language Models: A Safety Perspective
Figure 2 for Empowering Autonomous Driving with Large Language Models: A Safety Perspective
Figure 3 for Empowering Autonomous Driving with Large Language Models: A Safety Perspective
Figure 4 for Empowering Autonomous Driving with Large Language Models: A Safety Perspective
Viaarxiv icon

Provably Efficient High-Dimensional Bandit Learning with Batched Feedbacks

Nov 24, 2023
Jianqing Fan, Zhaoran Wang, Zhuoran Yang, Chenlu Ye

Figure 1 for Provably Efficient High-Dimensional Bandit Learning with Batched Feedbacks
Figure 2 for Provably Efficient High-Dimensional Bandit Learning with Batched Feedbacks
Figure 3 for Provably Efficient High-Dimensional Bandit Learning with Batched Feedbacks
Figure 4 for Provably Efficient High-Dimensional Bandit Learning with Batched Feedbacks
Viaarxiv icon

Posterior Sampling for Competitive RL: Function Approximation and Partial Observation

Oct 30, 2023
Shuang Qiu, Ziyu Dai, Han Zhong, Zhaoran Wang, Zhuoran Yang, Tong Zhang

Viaarxiv icon

Learning Regularized Graphon Mean-Field Games with Unknown Graphons

Oct 26, 2023
Fengzhuo Zhang, Vincent Y. F. Tan, Zhaoran Wang, Zhuoran Yang

Viaarxiv icon