Alert button
Picture for Milind Tambe

Milind Tambe

Alert button

Robust Restless Bandits: Tackling Interval Uncertainty with Deep Reinforcement Learning

Add code
Bookmark button
Alert button
Jul 04, 2021
Jackson A. Killian, Lily Xu, Arpita Biswas, Milind Tambe

Figure 1 for Robust Restless Bandits: Tackling Interval Uncertainty with Deep Reinforcement Learning
Figure 2 for Robust Restless Bandits: Tackling Interval Uncertainty with Deep Reinforcement Learning
Figure 3 for Robust Restless Bandits: Tackling Interval Uncertainty with Deep Reinforcement Learning
Figure 4 for Robust Restless Bandits: Tackling Interval Uncertainty with Deep Reinforcement Learning
Viaarxiv icon

Q-Learning Lagrange Policies for Multi-Action Restless Bandits

Add code
Bookmark button
Alert button
Jun 22, 2021
Jackson A. Killian, Arpita Biswas, Sanket Shah, Milind Tambe

Figure 1 for Q-Learning Lagrange Policies for Multi-Action Restless Bandits
Figure 2 for Q-Learning Lagrange Policies for Multi-Action Restless Bandits
Figure 3 for Q-Learning Lagrange Policies for Multi-Action Restless Bandits
Viaarxiv icon

Robust Reinforcement Learning Under Minimax Regret for Green Security

Add code
Bookmark button
Alert button
Jun 15, 2021
Lily Xu, Andrew Perrault, Fei Fang, Haipeng Chen, Milind Tambe

Figure 1 for Robust Reinforcement Learning Under Minimax Regret for Green Security
Figure 2 for Robust Reinforcement Learning Under Minimax Regret for Green Security
Figure 3 for Robust Reinforcement Learning Under Minimax Regret for Green Security
Figure 4 for Robust Reinforcement Learning Under Minimax Regret for Green Security
Viaarxiv icon

Learning MDPs from Features: Predict-Then-Optimize for Sequential Decision Problems by Reinforcement Learning

Add code
Bookmark button
Alert button
Jun 14, 2021
Kai Wang, Sanket Shah, Haipeng Chen, Andrew Perrault, Finale Doshi-Velez, Milind Tambe

Figure 1 for Learning MDPs from Features: Predict-Then-Optimize for Sequential Decision Problems by Reinforcement Learning
Figure 2 for Learning MDPs from Features: Predict-Then-Optimize for Sequential Decision Problems by Reinforcement Learning
Figure 3 for Learning MDPs from Features: Predict-Then-Optimize for Sequential Decision Problems by Reinforcement Learning
Figure 4 for Learning MDPs from Features: Predict-Then-Optimize for Sequential Decision Problems by Reinforcement Learning
Viaarxiv icon

Contingency-Aware Influence Maximization: A Reinforcement Learning Approach

Add code
Bookmark button
Alert button
Jun 13, 2021
Haipeng Chen, Wei Qiu, Han-Ching Ou, Bo An, Milind Tambe

Figure 1 for Contingency-Aware Influence Maximization: A Reinforcement Learning Approach
Figure 2 for Contingency-Aware Influence Maximization: A Reinforcement Learning Approach
Figure 3 for Contingency-Aware Influence Maximization: A Reinforcement Learning Approach
Figure 4 for Contingency-Aware Influence Maximization: A Reinforcement Learning Approach
Viaarxiv icon

Learn to Intervene: An Adaptive Learning Policy for Restless Bandits in Application to Preventive Healthcare

Add code
Bookmark button
Alert button
May 17, 2021
Arpita Biswas, Gaurav Aggarwal, Pradeep Varakantham, Milind Tambe

Figure 1 for Learn to Intervene: An Adaptive Learning Policy for Restless Bandits in Application to Preventive Healthcare
Figure 2 for Learn to Intervene: An Adaptive Learning Policy for Restless Bandits in Application to Preventive Healthcare
Figure 3 for Learn to Intervene: An Adaptive Learning Policy for Restless Bandits in Application to Preventive Healthcare
Viaarxiv icon

Selective Intervention Planning using Restless Multi-Armed Bandits to Improve Maternal and Child Health Outcomes

Add code
Bookmark button
Alert button
Apr 05, 2021
Siddharth Nishtala, Lovish Madaan, Aditya Mate, Harshavardhan Kamarthi, Anirudh Grama, Divy Thakkar, Dhyanesh Narayanan, Suresh Chaudhary, Neha Madhiwalla, Ramesh Padmanabhan, Aparna Hegde, Pradeep Varakantham, Balaraman Ravindran, Milind Tambe

Figure 1 for Selective Intervention Planning using Restless Multi-Armed Bandits to Improve Maternal and Child Health Outcomes
Figure 2 for Selective Intervention Planning using Restless Multi-Armed Bandits to Improve Maternal and Child Health Outcomes
Figure 3 for Selective Intervention Planning using Restless Multi-Armed Bandits to Improve Maternal and Child Health Outcomes
Figure 4 for Selective Intervention Planning using Restless Multi-Armed Bandits to Improve Maternal and Child Health Outcomes
Viaarxiv icon

Selective Intervention Planning using RMABs: Increasing Program Engagement to Improve Maternal and Child Health Outcomes

Add code
Bookmark button
Alert button
Mar 18, 2021
Siddharth Nishtala, Lovish Madaan, Harshavardhan Kamarthi, Anirudh Grama, Divy Thakkar, Dhyanesh Narayanan, Suresh Chaudhary, Neha Madhiwalla, Ramesh Padmanabhan, Aparna Hegde, Pradeep Varakantham, Balaraman Ravindran, Milind Tambe

Figure 1 for Selective Intervention Planning using RMABs: Increasing Program Engagement to Improve Maternal and Child Health Outcomes
Figure 2 for Selective Intervention Planning using RMABs: Increasing Program Engagement to Improve Maternal and Child Health Outcomes
Figure 3 for Selective Intervention Planning using RMABs: Increasing Program Engagement to Improve Maternal and Child Health Outcomes
Figure 4 for Selective Intervention Planning using RMABs: Increasing Program Engagement to Improve Maternal and Child Health Outcomes
Viaarxiv icon

Efficient Algorithms for Finite Horizon and Streaming Restless Multi-Armed Bandit Problems

Add code
Bookmark button
Alert button
Mar 08, 2021
Aditya Mate, Arpita Biswas, Christoph Siebenbrunner, Milind Tambe

Figure 1 for Efficient Algorithms for Finite Horizon and Streaming Restless Multi-Armed Bandit Problems
Figure 2 for Efficient Algorithms for Finite Horizon and Streaming Restless Multi-Armed Bandit Problems
Figure 3 for Efficient Algorithms for Finite Horizon and Streaming Restless Multi-Armed Bandit Problems
Figure 4 for Efficient Algorithms for Finite Horizon and Streaming Restless Multi-Armed Bandit Problems
Viaarxiv icon