Get our free extension to see links to code for papers anywhere online!

Chrome logo  Add to Chrome

Firefox logo Add to Firefox

Achieving Zero Constraint Violation for Constrained Reinforcement Learning via Primal-Dual Approach


Sep 13, 2021
Qinbo Bai, Amrit Singh Bedi, Mridul Agarwal, Alec Koppel, Vaneet Aggarwal


   Access Paper or Ask Questions

  • Share via Twitter
  • Share via Facebook
  • Share via LinkedIn
  • Share via Whatsapp
  • Share via Messenger
  • Share via Email

Concave Utility Reinforcement Learning with Zero-Constraint Violations


Sep 12, 2021
Mridul Agarwal, Qinbo Bai, Vaneet Aggarwal


   Access Paper or Ask Questions

  • Share via Twitter
  • Share via Facebook
  • Share via LinkedIn
  • Share via Whatsapp
  • Share via Messenger
  • Share via Email

On the Approximation of Cooperative Heterogeneous Multi-Agent Reinforcement Learning (MARL) using Mean Field Control (MFC)


Sep 09, 2021
Washim Uddin Mondal, Mridul Agarwal, Vaneet Aggarwal, Satish V. Ukkusuri

* 47 pages 

   Access Paper or Ask Questions

  • Share via Twitter
  • Share via Facebook
  • Share via LinkedIn
  • Share via Whatsapp
  • Share via Messenger
  • Share via Email

Markov Decision Processes with Long-Term Average Constraints


Jun 12, 2021
Mridul Agarwal, Qinbo Bai, Vaneet Aggarwal


   Access Paper or Ask Questions

  • Share via Twitter
  • Share via Facebook
  • Share via LinkedIn
  • Share via Whatsapp
  • Share via Messenger
  • Share via Email

Joint Optimization of Multi-Objective Reinforcement Learning with Policy Gradient Based Algorithm


May 28, 2021
Qinbo Bai, Mridul Agarwal, Vaneet Aggarwal


   Access Paper or Ask Questions

  • Share via Twitter
  • Share via Facebook
  • Share via LinkedIn
  • Share via Whatsapp
  • Share via Messenger
  • Share via Email

Communication Efficient Parallel Reinforcement Learning


Feb 22, 2021
Mridul Agarwal, Bhargav Ganguly, Vaneet Aggarwal


   Access Paper or Ask Questions

  • Share via Twitter
  • Share via Facebook
  • Share via LinkedIn
  • Share via Whatsapp
  • Share via Messenger
  • Share via Email

Multi-Agent Multi-Armed Bandits with Limited Communication


Feb 10, 2021
Mridul Agarwal, Vaneet Aggarwal, Kamyar Azizzadenesheli


   Access Paper or Ask Questions

  • Share via Twitter
  • Share via Facebook
  • Share via LinkedIn
  • Share via Whatsapp
  • Share via Messenger
  • Share via Email

Blind Decision Making: Reinforcement Learning with Delayed Observations


Nov 16, 2020
Mridul Agarwal, Vaneet Aggarwal


   Access Paper or Ask Questions

  • Share via Twitter
  • Share via Facebook
  • Share via LinkedIn
  • Share via Whatsapp
  • Share via Messenger
  • Share via Email

DART: aDaptive Accept RejecT for non-linear top-K subset identification


Nov 16, 2020
Mridul Agarwal, Vaneet Aggarwal, Christopher J. Quinn, Abhishek Umrawal


   Access Paper or Ask Questions

  • Share via Twitter
  • Share via Facebook
  • Share via LinkedIn
  • Share via Whatsapp
  • Share via Messenger
  • Share via Email

Escaping Saddle Points for Zeroth-order Nonconvex Optimization using Estimated Gradient Descent


Oct 03, 2019
Qinbo Bai, Mridul Agarwal, Vaneet Aggarwal

* arXiv admin note: text overlap with arXiv:1703.00887 by other authors 

   Access Paper or Ask Questions

  • Share via Twitter
  • Share via Facebook
  • Share via LinkedIn
  • Share via Whatsapp
  • Share via Messenger
  • Share via Email
1
2
>>