Alert button
Picture for P. R. Kumar

P. R. Kumar

Alert button

Provable Policy Gradient Methods for Average-Reward Markov Potential Games

Mar 09, 2024
Min Cheng, Ruida Zhou, P. R. Kumar, Chao Tian

Viaarxiv icon

Provably Fast Convergence of Independent Natural Policy Gradient for Markov Potential Games

Oct 27, 2023
Youbang Sun, Tao Liu, Ruida Zhou, P. R. Kumar, Shahin Shahrampour

Viaarxiv icon

Value-Biased Maximum Likelihood Estimation for Model-based Reinforcement Learning in Discounted Linear MDPs

Oct 17, 2023
Yu-Heng Hung, Ping-Chun Hsieh, Akshay Mete, P. R. Kumar

Viaarxiv icon

Natural Actor-Critic for Robust Reinforcement Learning with Function Approximation

Jul 17, 2023
Ruida Zhou, Tao Liu, Min Cheng, Dileep Kalathil, P. R. Kumar, Chao Tian

Figure 1 for Natural Actor-Critic for Robust Reinforcement Learning with Function Approximation
Figure 2 for Natural Actor-Critic for Robust Reinforcement Learning with Function Approximation
Figure 3 for Natural Actor-Critic for Robust Reinforcement Learning with Function Approximation
Figure 4 for Natural Actor-Critic for Robust Reinforcement Learning with Function Approximation
Viaarxiv icon

Finite Time Regret Bounds for Minimum Variance Control of Autoregressive Systems with Exogenous Inputs

May 26, 2023
Rahul Singh, Akshay Mete, Avik Kar, P. R. Kumar

Figure 1 for Finite Time Regret Bounds for Minimum Variance Control of Autoregressive Systems with Exogenous Inputs
Figure 2 for Finite Time Regret Bounds for Minimum Variance Control of Autoregressive Systems with Exogenous Inputs
Figure 3 for Finite Time Regret Bounds for Minimum Variance Control of Autoregressive Systems with Exogenous Inputs
Figure 4 for Finite Time Regret Bounds for Minimum Variance Control of Autoregressive Systems with Exogenous Inputs
Viaarxiv icon

Recommender system as an exploration coordinator: a bounded O(1) regret algorithm for large platforms

Jan 29, 2023
Hyunwook Kang, P. R. Kumar

Figure 1 for Recommender system as an exploration coordinator: a bounded O(1) regret algorithm for large platforms
Viaarxiv icon

TERRA: Beam Management for Outdoor mm-Wave Networks

Jan 10, 2023
Santosh Ganji, Jaewon Kim, Romil Sonigra, P. R. Kumar

Figure 1 for TERRA: Beam Management for Outdoor mm-Wave Networks
Figure 2 for TERRA: Beam Management for Outdoor mm-Wave Networks
Figure 3 for TERRA: Beam Management for Outdoor mm-Wave Networks
Figure 4 for TERRA: Beam Management for Outdoor mm-Wave Networks
Viaarxiv icon

Energy System Digitization in the Era of AI: A Three-Layered Approach towards Carbon Neutrality

Nov 02, 2022
Le Xie, Tong Huang, Xiangtian Zheng, Yan Liu, Mengdi Wang, Vijay Vittal, P. R. Kumar, Srinivas Shakkottai, Yi Cui

Figure 1 for Energy System Digitization in the Era of AI: A Three-Layered Approach towards Carbon Neutrality
Figure 2 for Energy System Digitization in the Era of AI: A Three-Layered Approach towards Carbon Neutrality
Figure 3 for Energy System Digitization in the Era of AI: A Three-Layered Approach towards Carbon Neutrality
Figure 4 for Energy System Digitization in the Era of AI: A Three-Layered Approach towards Carbon Neutrality
Viaarxiv icon

Anchor-Changing Regularized Natural Policy Gradient for Multi-Objective Reinforcement Learning

Jun 10, 2022
Ruida Zhou, Tao Liu, Dileep Kalathil, P. R. Kumar, Chao Tian

Figure 1 for Anchor-Changing Regularized Natural Policy Gradient for Multi-Objective Reinforcement Learning
Figure 2 for Anchor-Changing Regularized Natural Policy Gradient for Multi-Objective Reinforcement Learning
Figure 3 for Anchor-Changing Regularized Natural Policy Gradient for Multi-Objective Reinforcement Learning
Figure 4 for Anchor-Changing Regularized Natural Policy Gradient for Multi-Objective Reinforcement Learning
Viaarxiv icon