Picture for Navdeep Kumar

Navdeep Kumar

Policy Gradient with Tree Search: Avoiding Local Optimas through Lookahead

Add code
Jun 08, 2025
Viaarxiv icon

Dual Formulation for Non-Rectangular Lp Robust Markov Decision Processes

Add code
Feb 13, 2025
Viaarxiv icon

Improved Sample Complexity for Global Convergence of Actor-Critic Algorithms

Add code
Oct 11, 2024
Figure 1 for Improved Sample Complexity for Global Convergence of Actor-Critic Algorithms
Figure 2 for Improved Sample Complexity for Global Convergence of Actor-Critic Algorithms
Viaarxiv icon

On the Global Convergence of Policy Gradient in Average Reward Markov Decision Processes

Add code
Mar 11, 2024
Viaarxiv icon

Solving Non-Rectangular Reward-Robust MDPs via Frequency Regularization

Add code
Sep 03, 2023
Viaarxiv icon

Robust Reinforcement Learning via Adversarial Kernel Approximation

Add code
Jun 09, 2023
Viaarxiv icon

An Efficient Solution to s-Rectangular Robust Markov Decision Processes

Add code
Jan 31, 2023
Viaarxiv icon

Policy Gradient for s-Rectangular Robust Markov Decision Processes

Add code
Jan 31, 2023
Viaarxiv icon

Policy Gradient for Reinforcement Learning with General Utilities

Add code
Oct 03, 2022
Viaarxiv icon

Efficient Policy Iteration for Robust Markov Decision Processes via Regularization

Add code
May 28, 2022
Figure 1 for Efficient Policy Iteration for Robust Markov Decision Processes via Regularization
Figure 2 for Efficient Policy Iteration for Robust Markov Decision Processes via Regularization
Viaarxiv icon