Alert button
Picture for Vivek S. Borkar

Vivek S. Borkar

Alert button

A Concentration Bound for TD(0) with Function Approximation

Add code
Bookmark button
Alert button
Dec 16, 2023
Siddharth Chandak, Vivek S. Borkar

Viaarxiv icon

Approximation of Convex Envelope Using Reinforcement Learning

Add code
Bookmark button
Alert button
Nov 24, 2023
Vivek S. Borkar, Adit Akarsh

Viaarxiv icon

Decentralised Q-Learning for Multi-Agent Markov Decision Processes with a Satisfiability Criterion

Add code
Bookmark button
Alert button
Nov 21, 2023
Keshav P. Keval, Vivek S. Borkar

Viaarxiv icon

Actor-Critic or Critic-Actor? A Tale of Two Time Scales

Add code
Bookmark button
Alert button
Oct 10, 2022
Shalabh Bhatnagar, Vivek S. Borkar, Soumyajit Guin

Figure 1 for Actor-Critic or Critic-Actor? A Tale of Two Time Scales
Figure 2 for Actor-Critic or Critic-Actor? A Tale of Two Time Scales
Figure 3 for Actor-Critic or Critic-Actor? A Tale of Two Time Scales
Figure 4 for Actor-Critic or Critic-Actor? A Tale of Two Time Scales
Viaarxiv icon

A Concentration Bound for LSPE($λ$)

Add code
Bookmark button
Alert button
Nov 04, 2021
Vivek S. Borkar, Siddharth Chandak, Harsh Dolhare

Viaarxiv icon

Concentration of Contractive Stochastic Approximation and Reinforcement Learning

Add code
Bookmark button
Alert button
Jun 27, 2021
Siddharth Chandak, Vivek S. Borkar

Viaarxiv icon

Dynamic social learning under graph constraints

Add code
Bookmark button
Alert button
Jul 08, 2020
Konstantin Avrachenkov, Vivek S. Borkar, Sharayu Moharir, Suhail M. Shah

Figure 1 for Dynamic social learning under graph constraints
Figure 2 for Dynamic social learning under graph constraints
Figure 3 for Dynamic social learning under graph constraints
Viaarxiv icon

Whittle index based Q-learning for restless bandits with average reward

Add code
Bookmark button
Alert button
Apr 29, 2020
Konstantin Avrachenkov, Vivek S. Borkar

Figure 1 for Whittle index based Q-learning for restless bandits with average reward
Figure 2 for Whittle index based Q-learning for restless bandits with average reward
Figure 3 for Whittle index based Q-learning for restless bandits with average reward
Figure 4 for Whittle index based Q-learning for restless bandits with average reward
Viaarxiv icon

Vector Field Guidance for Convoy Monitoring Using Elliptical Orbits

Add code
Bookmark button
Alert button
Sep 13, 2017
Aseem V. Borkar, Vivek S. Borkar, Arpita Sinha

Figure 1 for Vector Field Guidance for Convoy Monitoring Using Elliptical Orbits
Figure 2 for Vector Field Guidance for Convoy Monitoring Using Elliptical Orbits
Figure 3 for Vector Field Guidance for Convoy Monitoring Using Elliptical Orbits
Figure 4 for Vector Field Guidance for Convoy Monitoring Using Elliptical Orbits
Viaarxiv icon

Gradient Estimation with Simultaneous Perturbation and Compressive Sensing

Add code
Bookmark button
Alert button
Jul 26, 2016
Vivek S. Borkar, Vikranth R. Dwaracherla, Neeraja Sahasrabudhe

Figure 1 for Gradient Estimation with Simultaneous Perturbation and Compressive Sensing
Figure 2 for Gradient Estimation with Simultaneous Perturbation and Compressive Sensing
Figure 3 for Gradient Estimation with Simultaneous Perturbation and Compressive Sensing
Figure 4 for Gradient Estimation with Simultaneous Perturbation and Compressive Sensing
Viaarxiv icon