Alert button
Picture for Aditya Mahajan

Aditya Mahajan

Alert button

Model approximation in MDPs with unbounded per-step cost

Add code
Bookmark button
Alert button
Feb 13, 2024
Berk Bozkurt, Aditya Mahajan, Ashutosh Nayyar, Yi Ouyang

Viaarxiv icon

Bridging State and History Representations: Understanding Self-Predictive RL

Add code
Bookmark button
Alert button
Jan 17, 2024
Tianwei Ni, Benjamin Eysenbach, Erfan Seyedsalehi, Michel Ma, Clement Gehring, Aditya Mahajan, Pierre-Luc Bacon

Viaarxiv icon

Approximate information state based convergence analysis of recurrent Q-learning

Add code
Bookmark button
Alert button
Jun 09, 2023
Erfan Seyedsalehi, Nima Akbarzadeh, Amit Sinha, Aditya Mahajan

Figure 1 for Approximate information state based convergence analysis of recurrent Q-learning
Figure 2 for Approximate information state based convergence analysis of recurrent Q-learning
Figure 3 for Approximate information state based convergence analysis of recurrent Q-learning
Figure 4 for Approximate information state based convergence analysis of recurrent Q-learning
Viaarxiv icon

Dealing With Non-stationarity in Decentralized Cooperative Multi-Agent Deep Reinforcement Learning via Multi-Timescale Learning

Add code
Bookmark button
Alert button
Feb 06, 2023
Hadi Nekoei, Akilesh Badrinaaraayanan, Amit Sinha, Mohammad Amini, Janarthanan Rajendran, Aditya Mahajan, Sarath Chandar

Figure 1 for Dealing With Non-stationarity in Decentralized Cooperative Multi-Agent Deep Reinforcement Learning via Multi-Timescale Learning
Figure 2 for Dealing With Non-stationarity in Decentralized Cooperative Multi-Agent Deep Reinforcement Learning via Multi-Timescale Learning
Figure 3 for Dealing With Non-stationarity in Decentralized Cooperative Multi-Agent Deep Reinforcement Learning via Multi-Timescale Learning
Figure 4 for Dealing With Non-stationarity in Decentralized Cooperative Multi-Agent Deep Reinforcement Learning via Multi-Timescale Learning
Viaarxiv icon

On learning history based policies for controlling Markov decision processes

Add code
Bookmark button
Alert button
Nov 06, 2022
Gandharv Patil, Aditya Mahajan, Doina Precup

Figure 1 for On learning history based policies for controlling Markov decision processes
Figure 2 for On learning history based policies for controlling Markov decision processes
Figure 3 for On learning history based policies for controlling Markov decision processes
Figure 4 for On learning history based policies for controlling Markov decision processes
Viaarxiv icon

On learning Whittle index policy for restless bandits with scalable regret

Add code
Bookmark button
Alert button
Feb 07, 2022
Nima Akbarzadeh, Aditya Mahajan

Figure 1 for On learning Whittle index policy for restless bandits with scalable regret
Figure 2 for On learning Whittle index policy for restless bandits with scalable regret
Figure 3 for On learning Whittle index policy for restless bandits with scalable regret
Viaarxiv icon

Consistency and Rate of Convergence of Switched Least Squares System Identification for Autonomous Switched Linear Systems

Add code
Bookmark button
Alert button
Dec 20, 2021
Borna Sayedana, Mohammad Afshari, Peter E. Caines, Aditya Mahajan

Figure 1 for Consistency and Rate of Convergence of Switched Least Squares System Identification for Autonomous Switched Linear Systems
Viaarxiv icon

Scalable Operator Allocation for Multi-Robot Assistance: A Restless Bandit Approach

Add code
Bookmark button
Alert button
Nov 11, 2021
Abhinav Dahiya, Nima Akbarzadeh, Aditya Mahajan, Stephen L. Smith

Figure 1 for Scalable Operator Allocation for Multi-Robot Assistance: A Restless Bandit Approach
Figure 2 for Scalable Operator Allocation for Multi-Robot Assistance: A Restless Bandit Approach
Figure 3 for Scalable Operator Allocation for Multi-Robot Assistance: A Restless Bandit Approach
Figure 4 for Scalable Operator Allocation for Multi-Robot Assistance: A Restless Bandit Approach
Viaarxiv icon

A relaxed technical assumption for posterior sampling-based reinforcement learning for control of unknown linear systems

Add code
Bookmark button
Alert button
Aug 19, 2021
Mukul Gagrani, Sagar Sudhakara, Aditya Mahajan, Ashutosh Nayyar, Yi Ouyang

Figure 1 for A relaxed technical assumption for posterior sampling-based reinforcement learning for control of unknown linear systems
Figure 2 for A relaxed technical assumption for posterior sampling-based reinforcement learning for control of unknown linear systems
Figure 3 for A relaxed technical assumption for posterior sampling-based reinforcement learning for control of unknown linear systems
Viaarxiv icon

Scalable regret for learning to control network-coupled subsystems with unknown dynamics

Add code
Bookmark button
Alert button
Aug 18, 2021
Sagar Sudhakara, Aditya Mahajan, Ashutosh Nayyar, Yi Ouyang

Figure 1 for Scalable regret for learning to control network-coupled subsystems with unknown dynamics
Figure 2 for Scalable regret for learning to control network-coupled subsystems with unknown dynamics
Figure 3 for Scalable regret for learning to control network-coupled subsystems with unknown dynamics
Viaarxiv icon