Alert button
Picture for Nuoya Xiong

Nuoya Xiong

Alert button

Sample-Efficient Multi-Agent RL: An Optimization Perspective

Oct 10, 2023
Nuoya Xiong, Zhihan Liu, Zhaoran Wang, Zhuoran Yang

Viaarxiv icon

How Over-Parameterization Slows Down Gradient Descent in Matrix Sensing: The Curses of Symmetry and Initialization

Oct 09, 2023
Nuoya Xiong, Lijun Ding, Simon S. Du

Figure 1 for How Over-Parameterization Slows Down Gradient Descent in Matrix Sensing: The Curses of Symmetry and Initialization
Figure 2 for How Over-Parameterization Slows Down Gradient Descent in Matrix Sensing: The Curses of Symmetry and Initialization
Figure 3 for How Over-Parameterization Slows Down Gradient Descent in Matrix Sensing: The Curses of Symmetry and Initialization
Viaarxiv icon

A General Framework for Sequential Decision-Making under Adaptivity Constraints

Jun 27, 2023
Nuoya Xiong, Zhaoran Wang, Zhuoran Yang

Viaarxiv icon

Provably Safe Reinforcement Learning with Step-wise Violation Constraints

Feb 13, 2023
Nuoya Xiong, Yihan du, Longbo huang

Figure 1 for Provably Safe Reinforcement Learning with Step-wise Violation Constraints
Figure 2 for Provably Safe Reinforcement Learning with Step-wise Violation Constraints
Figure 3 for Provably Safe Reinforcement Learning with Step-wise Violation Constraints
Figure 4 for Provably Safe Reinforcement Learning with Step-wise Violation Constraints
Viaarxiv icon

Combinatorial Causal Bandits without Graph Skeleton

Jan 31, 2023
Shi Feng, Nuoya Xiong, Wei Chen

Figure 1 for Combinatorial Causal Bandits without Graph Skeleton
Figure 2 for Combinatorial Causal Bandits without Graph Skeleton
Viaarxiv icon

Pure Exploration of Causal Bandits

Jun 16, 2022
Nuoya Xiong, Wei Chen

Figure 1 for Pure Exploration of Causal Bandits
Figure 2 for Pure Exploration of Causal Bandits
Viaarxiv icon