Alert button
Picture for Debmalya Mandal

Debmalya Mandal

Alert button

Corruption-Robust Offline Two-Player Zero-Sum Markov Games

Add code
Bookmark button
Alert button
Mar 04, 2024
Andi Nika, Debmalya Mandal, Adish Singla, Goran Radanović

Figure 1 for Corruption-Robust Offline Two-Player Zero-Sum Markov Games
Figure 2 for Corruption-Robust Offline Two-Player Zero-Sum Markov Games
Viaarxiv icon

Reward Model Learning vs. Direct Policy Optimization: A Comparative Analysis of Learning from Human Preferences

Add code
Bookmark button
Alert button
Mar 04, 2024
Andi Nika, Debmalya Mandal, Parameswaran Kamalaruban, Georgios Tzannetos, Goran Radanović, Adish Singla

Figure 1 for Reward Model Learning vs. Direct Policy Optimization: A Comparative Analysis of Learning from Human Preferences
Viaarxiv icon

Performative Reinforcement Learning in Gradually Shifting Environments

Add code
Bookmark button
Alert button
Feb 15, 2024
Ben Rank, Stelios Triantafyllou, Debmalya Mandal, Goran Radanovic

Viaarxiv icon

Learning the Expected Core of Strictly Convex Stochastic Cooperative Games

Add code
Bookmark button
Alert button
Feb 10, 2024
Nam Phuong Tran, The Anh Ta, Shuqing Shi, Debmalya Mandal, Yali Du, Long Tran-Thanh

Viaarxiv icon

Corruption Robust Offline Reinforcement Learning with Human Feedback

Add code
Bookmark button
Alert button
Feb 09, 2024
Debmalya Mandal, Andi Nika, Parameswaran Kamalaruban, Adish Singla, Goran Radanović

Viaarxiv icon

Agent-Specific Effects

Add code
Bookmark button
Alert button
Oct 17, 2023
Stelios Triantafyllou, Aleksa Sukovic, Debmalya Mandal, Goran Radanovic

Viaarxiv icon

Markov Decision Processes with Time-Varying Geometric Discounting

Add code
Bookmark button
Alert button
Jul 19, 2023
Jiarui Gan, Annika Hennes, Rupak Majumdar, Debmalya Mandal, Goran Radanovic

Figure 1 for Markov Decision Processes with Time-Varying Geometric Discounting
Figure 2 for Markov Decision Processes with Time-Varying Geometric Discounting
Figure 3 for Markov Decision Processes with Time-Varying Geometric Discounting
Viaarxiv icon

Sequential Principal-Agent Problems with Communication: Efficient Computation and Learning

Add code
Bookmark button
Alert button
Jun 06, 2023
Jiarui Gan, Rupak Majumdar, Debmalya Mandal, Goran Radanovic

Viaarxiv icon

Implicit Poisoning Attacks in Two-Agent Reinforcement Learning: Adversarial Policies for Training-Time Attacks

Add code
Bookmark button
Alert button
Feb 27, 2023
Mohammad Mohammadi, Jonathan Nöther, Debmalya Mandal, Adish Singla, Goran Radanovic

Figure 1 for Implicit Poisoning Attacks in Two-Agent Reinforcement Learning: Adversarial Policies for Training-Time Attacks
Figure 2 for Implicit Poisoning Attacks in Two-Agent Reinforcement Learning: Adversarial Policies for Training-Time Attacks
Figure 3 for Implicit Poisoning Attacks in Two-Agent Reinforcement Learning: Adversarial Policies for Training-Time Attacks
Figure 4 for Implicit Poisoning Attacks in Two-Agent Reinforcement Learning: Adversarial Policies for Training-Time Attacks
Viaarxiv icon

Online Reinforcement Learning with Uncertain Episode Lengths

Add code
Bookmark button
Alert button
Feb 07, 2023
Debmalya Mandal, Goran Radanovic, Jiarui Gan, Adish Singla, Rupak Majumdar

Figure 1 for Online Reinforcement Learning with Uncertain Episode Lengths
Viaarxiv icon