Alert button
Picture for Dheeraj Nagaraj

Dheeraj Nagaraj

Alert button

A Decision-Language Model (DLM) for Dynamic Restless Multi-Armed Bandit Tasks in Public Health

Add code
Bookmark button
Alert button
Feb 23, 2024
Nikhil Behari, Edwin Zhang, Yunfan Zhao, Aparna Taneja, Dheeraj Nagaraj, Milind Tambe

Viaarxiv icon

Towards Zero Shot Learning in Restless Multi-armed Bandits

Add code
Bookmark button
Alert button
Oct 23, 2023
Yunfan Zhao, Nikhil Behari, Edward Hughes, Edwin Zhang, Dheeraj Nagaraj, Karl Tuyls, Aparna Taneja, Milind Tambe

Viaarxiv icon

Near Optimal Heteroscedastic Regression with Symbiotic Learning

Add code
Bookmark button
Alert button
Jul 01, 2023
Dheeraj Baby, Aniket Das, Dheeraj Nagaraj, Praneeth Netrapalli

Figure 1 for Near Optimal Heteroscedastic Regression with Symbiotic Learning
Figure 2 for Near Optimal Heteroscedastic Regression with Symbiotic Learning
Viaarxiv icon

Stochastic Re-weighted Gradient Descent via Distributionally Robust Optimization

Add code
Bookmark button
Alert button
Jun 15, 2023
Ramnath Kumar, Kushal Majmundar, Dheeraj Nagaraj, Arun Sai Suggala

Figure 1 for Stochastic Re-weighted Gradient Descent via Distributionally Robust Optimization
Figure 2 for Stochastic Re-weighted Gradient Descent via Distributionally Robust Optimization
Figure 3 for Stochastic Re-weighted Gradient Descent via Distributionally Robust Optimization
Figure 4 for Stochastic Re-weighted Gradient Descent via Distributionally Robust Optimization
Viaarxiv icon

Provably Fast Finite Particle Variants of SVGD via Virtual Particle Stochastic Approximation

Add code
Bookmark button
Alert button
May 27, 2023
Aniket Das, Dheeraj Nagaraj

Figure 1 for Provably Fast Finite Particle Variants of SVGD via Virtual Particle Stochastic Approximation
Figure 2 for Provably Fast Finite Particle Variants of SVGD via Virtual Particle Stochastic Approximation
Viaarxiv icon

Indexability is Not Enough for Whittle: Improved, Near-Optimal Algorithms for Restless Bandits

Add code
Bookmark button
Alert button
Oct 31, 2022
Abheek Ghosh, Dheeraj Nagaraj, Manish Jain, Milind Tambe

Figure 1 for Indexability is Not Enough for Whittle: Improved, Near-Optimal Algorithms for Restless Bandits
Figure 2 for Indexability is Not Enough for Whittle: Improved, Near-Optimal Algorithms for Restless Bandits
Figure 3 for Indexability is Not Enough for Whittle: Improved, Near-Optimal Algorithms for Restless Bandits
Figure 4 for Indexability is Not Enough for Whittle: Improved, Near-Optimal Algorithms for Restless Bandits
Viaarxiv icon

Finite time analysis of temporal difference learning with linear function approximation: Tail averaging and regularisation

Add code
Bookmark button
Alert button
Oct 12, 2022
Gandharv Patil, Prashanth L. A., Dheeraj Nagaraj, Doina Precup

Figure 1 for Finite time analysis of temporal difference learning with linear function approximation: Tail averaging and regularisation
Figure 2 for Finite time analysis of temporal difference learning with linear function approximation: Tail averaging and regularisation
Figure 3 for Finite time analysis of temporal difference learning with linear function approximation: Tail averaging and regularisation
Viaarxiv icon

Multi-User Reinforcement Learning with Low Rank Rewards

Add code
Bookmark button
Alert button
Oct 11, 2022
Naman Agarwal, Prateek Jain, Suhas Kowshik, Dheeraj Nagaraj, Praneeth Netrapalli

Viaarxiv icon

Entropic Convergence of Random Batch Methods for Interacting Particle Diffusion

Add code
Bookmark button
Alert button
Jun 08, 2022
Dheeraj Nagaraj

Figure 1 for Entropic Convergence of Random Batch Methods for Interacting Particle Diffusion
Figure 2 for Entropic Convergence of Random Batch Methods for Interacting Particle Diffusion
Viaarxiv icon