Alert button
Picture for Ashutosh Nayyar

Ashutosh Nayyar

Alert button

Model approximation in MDPs with unbounded per-step cost

Add code
Bookmark button
Alert button
Feb 13, 2024
Berk Bozkurt, Aditya Mahajan, Ashutosh Nayyar, Yi Ouyang

Viaarxiv icon

Regret Analysis of the Posterior Sampling-based Learning Algorithm for Episodic POMDPs

Add code
Bookmark button
Alert button
Oct 16, 2023
Dengwang Tang, Rahul Jain, Ashutosh Nayyar, Pierluigi Nuzzo

Viaarxiv icon

Conditional Kernel Imitation Learning for Continuous State Environments

Add code
Bookmark button
Alert button
Aug 24, 2023
Rishabh Agrawal, Nathan Dahlin, Rahul Jain, Ashutosh Nayyar

Figure 1 for Conditional Kernel Imitation Learning for Continuous State Environments
Figure 2 for Conditional Kernel Imitation Learning for Continuous State Environments
Viaarxiv icon

Optimal Control of Logically Constrained Partially Observable and Multi-Agent Markov Decision Processes

Add code
Bookmark button
Alert button
May 24, 2023
Krishna C. Kalagarla, Dhruva Kartik, Dongming Shen, Rahul Jain, Ashutosh Nayyar, Pierluigi Nuzzo

Figure 1 for Optimal Control of Logically Constrained Partially Observable and Multi-Agent Markov Decision Processes
Figure 2 for Optimal Control of Logically Constrained Partially Observable and Multi-Agent Markov Decision Processes
Figure 3 for Optimal Control of Logically Constrained Partially Observable and Multi-Agent Markov Decision Processes
Figure 4 for Optimal Control of Logically Constrained Partially Observable and Multi-Agent Markov Decision Processes
Viaarxiv icon

A Novel Point-based Algorithm for Multi-agent Control Using the Common Information Approach

Add code
Bookmark button
Alert button
Apr 10, 2023
Dengwang Tang, Ashutosh Nayyar, Rahul Jain

Figure 1 for A Novel Point-based Algorithm for Multi-agent Control Using the Common Information Approach
Figure 2 for A Novel Point-based Algorithm for Multi-agent Control Using the Common Information Approach
Figure 3 for A Novel Point-based Algorithm for Multi-agent Control Using the Common Information Approach
Figure 4 for A Novel Point-based Algorithm for Multi-agent Control Using the Common Information Approach
Viaarxiv icon

Learning Zero-sum Stochastic Games with Posterior Sampling

Add code
Bookmark button
Alert button
Sep 08, 2021
Mehdi Jafarnia-Jahromi, Rahul Jain, Ashutosh Nayyar

Viaarxiv icon

A relaxed technical assumption for posterior sampling-based reinforcement learning for control of unknown linear systems

Add code
Bookmark button
Alert button
Aug 19, 2021
Mukul Gagrani, Sagar Sudhakara, Aditya Mahajan, Ashutosh Nayyar, Yi Ouyang

Figure 1 for A relaxed technical assumption for posterior sampling-based reinforcement learning for control of unknown linear systems
Figure 2 for A relaxed technical assumption for posterior sampling-based reinforcement learning for control of unknown linear systems
Figure 3 for A relaxed technical assumption for posterior sampling-based reinforcement learning for control of unknown linear systems
Viaarxiv icon

Scalable regret for learning to control network-coupled subsystems with unknown dynamics

Add code
Bookmark button
Alert button
Aug 18, 2021
Sagar Sudhakara, Aditya Mahajan, Ashutosh Nayyar, Yi Ouyang

Figure 1 for Scalable regret for learning to control network-coupled subsystems with unknown dynamics
Figure 2 for Scalable regret for learning to control network-coupled subsystems with unknown dynamics
Figure 3 for Scalable regret for learning to control network-coupled subsystems with unknown dynamics
Viaarxiv icon

Online Learning for Unknown Partially Observable MDPs

Add code
Bookmark button
Alert button
Feb 25, 2021
Mehdi Jafarnia-Jahromi, Rahul Jain, Ashutosh Nayyar

Viaarxiv icon

Thompson sampling for linear quadratic mean-field teams

Add code
Bookmark button
Alert button
Nov 09, 2020
Mukul Gagrani, Sagar Sudhakara, Aditya Mahajan, Ashutosh Nayyar, Yi Ouyang

Figure 1 for Thompson sampling for linear quadratic mean-field teams
Figure 2 for Thompson sampling for linear quadratic mean-field teams
Viaarxiv icon