Alert button
Picture for P. R. Kumar

P. R. Kumar

Alert button

Provable Policy Gradient Methods for Average-Reward Markov Potential Games

Add code
Bookmark button
Alert button
Mar 09, 2024
Min Cheng, Ruida Zhou, P. R. Kumar, Chao Tian

Figure 1 for Provable Policy Gradient Methods for Average-Reward Markov Potential Games
Figure 2 for Provable Policy Gradient Methods for Average-Reward Markov Potential Games
Viaarxiv icon

Provably Fast Convergence of Independent Natural Policy Gradient for Markov Potential Games

Add code
Bookmark button
Alert button
Oct 27, 2023
Youbang Sun, Tao Liu, Ruida Zhou, P. R. Kumar, Shahin Shahrampour

Viaarxiv icon

Value-Biased Maximum Likelihood Estimation for Model-based Reinforcement Learning in Discounted Linear MDPs

Add code
Bookmark button
Alert button
Oct 17, 2023
Yu-Heng Hung, Ping-Chun Hsieh, Akshay Mete, P. R. Kumar

Viaarxiv icon

Natural Actor-Critic for Robust Reinforcement Learning with Function Approximation

Add code
Bookmark button
Alert button
Jul 17, 2023
Ruida Zhou, Tao Liu, Min Cheng, Dileep Kalathil, P. R. Kumar, Chao Tian

Figure 1 for Natural Actor-Critic for Robust Reinforcement Learning with Function Approximation
Figure 2 for Natural Actor-Critic for Robust Reinforcement Learning with Function Approximation
Figure 3 for Natural Actor-Critic for Robust Reinforcement Learning with Function Approximation
Figure 4 for Natural Actor-Critic for Robust Reinforcement Learning with Function Approximation
Viaarxiv icon

Finite Time Regret Bounds for Minimum Variance Control of Autoregressive Systems with Exogenous Inputs

Add code
Bookmark button
Alert button
May 26, 2023
Rahul Singh, Akshay Mete, Avik Kar, P. R. Kumar

Figure 1 for Finite Time Regret Bounds for Minimum Variance Control of Autoregressive Systems with Exogenous Inputs
Figure 2 for Finite Time Regret Bounds for Minimum Variance Control of Autoregressive Systems with Exogenous Inputs
Figure 3 for Finite Time Regret Bounds for Minimum Variance Control of Autoregressive Systems with Exogenous Inputs
Figure 4 for Finite Time Regret Bounds for Minimum Variance Control of Autoregressive Systems with Exogenous Inputs
Viaarxiv icon

Recommender system as an exploration coordinator: a bounded O(1) regret algorithm for large platforms

Add code
Bookmark button
Alert button
Jan 29, 2023
Hyunwook Kang, P. R. Kumar

Figure 1 for Recommender system as an exploration coordinator: a bounded O(1) regret algorithm for large platforms
Viaarxiv icon

TERRA: Beam Management for Outdoor mm-Wave Networks

Add code
Bookmark button
Alert button
Jan 10, 2023
Santosh Ganji, Jaewon Kim, Romil Sonigra, P. R. Kumar

Figure 1 for TERRA: Beam Management for Outdoor mm-Wave Networks
Figure 2 for TERRA: Beam Management for Outdoor mm-Wave Networks
Figure 3 for TERRA: Beam Management for Outdoor mm-Wave Networks
Figure 4 for TERRA: Beam Management for Outdoor mm-Wave Networks
Viaarxiv icon

Energy System Digitization in the Era of AI: A Three-Layered Approach towards Carbon Neutrality

Add code
Bookmark button
Alert button
Nov 02, 2022
Le Xie, Tong Huang, Xiangtian Zheng, Yan Liu, Mengdi Wang, Vijay Vittal, P. R. Kumar, Srinivas Shakkottai, Yi Cui

Figure 1 for Energy System Digitization in the Era of AI: A Three-Layered Approach towards Carbon Neutrality
Figure 2 for Energy System Digitization in the Era of AI: A Three-Layered Approach towards Carbon Neutrality
Figure 3 for Energy System Digitization in the Era of AI: A Three-Layered Approach towards Carbon Neutrality
Figure 4 for Energy System Digitization in the Era of AI: A Three-Layered Approach towards Carbon Neutrality
Viaarxiv icon

Anchor-Changing Regularized Natural Policy Gradient for Multi-Objective Reinforcement Learning

Add code
Bookmark button
Alert button
Jun 10, 2022
Ruida Zhou, Tao Liu, Dileep Kalathil, P. R. Kumar, Chao Tian

Figure 1 for Anchor-Changing Regularized Natural Policy Gradient for Multi-Objective Reinforcement Learning
Figure 2 for Anchor-Changing Regularized Natural Policy Gradient for Multi-Objective Reinforcement Learning
Figure 3 for Anchor-Changing Regularized Natural Policy Gradient for Multi-Objective Reinforcement Learning
Figure 4 for Anchor-Changing Regularized Natural Policy Gradient for Multi-Objective Reinforcement Learning
Viaarxiv icon