Picture for Sihan Zeng

Sihan Zeng

Partially Observable Contextual Bandits with Linear Payoffs

Add code
Sep 17, 2024
Viaarxiv icon

Fast Two-Time-Scale Stochastic Gradient Method with Applications in Reinforcement Learning

Add code
May 15, 2024
Viaarxiv icon

Natural Policy Gradient and Actor Critic Methods for Constrained Multi-Task Reinforcement Learning

Add code
May 03, 2024
Viaarxiv icon

QCQP-Net: Reliably Learning Feasible Alternating Current Optimal Power Flow Solutions Under Constraints

Add code
Jan 11, 2024
Figure 1 for QCQP-Net: Reliably Learning Feasible Alternating Current Optimal Power Flow Solutions Under Constraints
Figure 2 for QCQP-Net: Reliably Learning Feasible Alternating Current Optimal Power Flow Solutions Under Constraints
Figure 3 for QCQP-Net: Reliably Learning Feasible Alternating Current Optimal Power Flow Solutions Under Constraints
Viaarxiv icon

Near-Optimal Fair Resource Allocation for Strategic Agents without Money: A Data-Driven Approach

Add code
Nov 18, 2023
Viaarxiv icon

Connected Superlevel Set in (Deep) Reinforcement Learning and its Application to Minimax Theorems

Add code
Mar 23, 2023
Viaarxiv icon

Sequential Fair Resource Allocation under a Markov Decision Process Framework

Add code
Jan 10, 2023
Viaarxiv icon

Regularized Gradient Descent Ascent for Two-Player Zero-Sum Markov Games

Add code
May 27, 2022
Figure 1 for Regularized Gradient Descent Ascent for Two-Player Zero-Sum Markov Games
Figure 2 for Regularized Gradient Descent Ascent for Two-Player Zero-Sum Markov Games
Viaarxiv icon

A Reinforcement Learning Approach to Parameter Selection for Distributed Optimization in Power Systems

Add code
Oct 22, 2021
Figure 1 for A Reinforcement Learning Approach to Parameter Selection for Distributed Optimization in Power Systems
Figure 2 for A Reinforcement Learning Approach to Parameter Selection for Distributed Optimization in Power Systems
Figure 3 for A Reinforcement Learning Approach to Parameter Selection for Distributed Optimization in Power Systems
Figure 4 for A Reinforcement Learning Approach to Parameter Selection for Distributed Optimization in Power Systems
Viaarxiv icon

Finite-Time Complexity of Online Primal-Dual Natural Actor-Critic Algorithm for Constrained Markov Decision Processes

Add code
Oct 21, 2021
Figure 1 for Finite-Time Complexity of Online Primal-Dual Natural Actor-Critic Algorithm for Constrained Markov Decision Processes
Figure 2 for Finite-Time Complexity of Online Primal-Dual Natural Actor-Critic Algorithm for Constrained Markov Decision Processes
Viaarxiv icon