Alert button
Picture for Zhuoran Yang

Zhuoran Yang

Alert button

Sample-efficient Learning of Infinite-horizon Average-reward MDPs with General Function Approximation

Add code
Bookmark button
Alert button
Apr 19, 2024
Jianliang He, Han Zhong, Zhuoran Yang

Viaarxiv icon

A Mean-Field Analysis of Neural Gradient Descent-Ascent: Applications to Functional Conditional Moment Equations

Add code
Bookmark button
Alert button
Apr 18, 2024
Yuchen Zhu, Yufeng Zhang, Zhaoran Wang, Zhuoran Yang, Xiaohong Chen

Viaarxiv icon

Unveil Conditional Diffusion Models with Classifier-free Guidance: A Sharp Statistical Theory

Add code
Bookmark button
Alert button
Mar 18, 2024
Hengyu Fu, Zhuoran Yang, Mengdi Wang, Minshuo Chen

Figure 1 for Unveil Conditional Diffusion Models with Classifier-free Guidance: A Sharp Statistical Theory
Figure 2 for Unveil Conditional Diffusion Models with Classifier-free Guidance: A Sharp Statistical Theory
Figure 3 for Unveil Conditional Diffusion Models with Classifier-free Guidance: A Sharp Statistical Theory
Figure 4 for Unveil Conditional Diffusion Models with Classifier-free Guidance: A Sharp Statistical Theory
Viaarxiv icon

On the Role of Information Structure in Reinforcement Learning for Partially-Observable Sequential Teams and Games

Add code
Bookmark button
Alert button
Mar 01, 2024
Awni Altabaa, Zhuoran Yang

Figure 1 for On the Role of Information Structure in Reinforcement Learning for Partially-Observable Sequential Teams and Games
Figure 2 for On the Role of Information Structure in Reinforcement Learning for Partially-Observable Sequential Teams and Games
Figure 3 for On the Role of Information Structure in Reinforcement Learning for Partially-Observable Sequential Teams and Games
Figure 4 for On the Role of Information Structure in Reinforcement Learning for Partially-Observable Sequential Teams and Games
Viaarxiv icon

Training Dynamics of Multi-Head Softmax Attention for In-Context Learning: Emergence, Convergence, and Optimality

Add code
Bookmark button
Alert button
Feb 29, 2024
Siyu Chen, Heejune Sheen, Tianhao Wang, Zhuoran Yang

Viaarxiv icon

Double Duality: Variational Primal-Dual Policy Optimization for Constrained Reinforcement Learning

Add code
Bookmark button
Alert button
Feb 16, 2024
Zihao Li, Boyi Liu, Zhuoran Yang, Zhaoran Wang, Mengdi Wang

Viaarxiv icon

Principled Penalty-based Methods for Bilevel Reinforcement Learning and RLHF

Add code
Bookmark button
Alert button
Feb 10, 2024
Han Shen, Zhuoran Yang, Tianyi Chen

Viaarxiv icon

Symmetric Mean-field Langevin Dynamics for Distributional Minimax Problems

Add code
Bookmark button
Alert button
Dec 02, 2023
Juno Kim, Kakei Yamamoto, Kazusato Oko, Zhuoran Yang, Taiji Suzuki

Viaarxiv icon

Empowering Autonomous Driving with Large Language Models: A Safety Perspective

Add code
Bookmark button
Alert button
Nov 28, 2023
Yixuan Wang, Ruochen Jiao, Chengtian Lang, Sinong Simon Zhan, Chao Huang, Zhaoran Wang, Zhuoran Yang, Qi Zhu

Figure 1 for Empowering Autonomous Driving with Large Language Models: A Safety Perspective
Figure 2 for Empowering Autonomous Driving with Large Language Models: A Safety Perspective
Figure 3 for Empowering Autonomous Driving with Large Language Models: A Safety Perspective
Figure 4 for Empowering Autonomous Driving with Large Language Models: A Safety Perspective
Viaarxiv icon

Provably Efficient High-Dimensional Bandit Learning with Batched Feedbacks

Add code
Bookmark button
Alert button
Nov 24, 2023
Jianqing Fan, Zhaoran Wang, Zhuoran Yang, Chenlu Ye

Figure 1 for Provably Efficient High-Dimensional Bandit Learning with Batched Feedbacks
Figure 2 for Provably Efficient High-Dimensional Bandit Learning with Batched Feedbacks
Figure 3 for Provably Efficient High-Dimensional Bandit Learning with Batched Feedbacks
Figure 4 for Provably Efficient High-Dimensional Bandit Learning with Batched Feedbacks
Viaarxiv icon