Picture for Bikramjit Banerjee

Bikramjit Banerjee

Quasimetric Value Functions with Dense Rewards

Add code
Sep 13, 2024
Viaarxiv icon

Latent Interactive A2C for Improved RL in Open Many-Agent Systems

Add code
May 09, 2023
Viaarxiv icon

Many Agent Reinforcement Learning Under Partial Observability

Add code
Jun 17, 2021
Figure 1 for Many Agent Reinforcement Learning Under Partial Observability
Figure 2 for Many Agent Reinforcement Learning Under Partial Observability
Figure 3 for Many Agent Reinforcement Learning Under Partial Observability
Figure 4 for Many Agent Reinforcement Learning Under Partial Observability
Viaarxiv icon

Cooperative-Competitive Reinforcement Learning with History-Dependent Rewards

Add code
Oct 15, 2020
Figure 1 for Cooperative-Competitive Reinforcement Learning with History-Dependent Rewards
Figure 2 for Cooperative-Competitive Reinforcement Learning with History-Dependent Rewards
Figure 3 for Cooperative-Competitive Reinforcement Learning with History-Dependent Rewards
Figure 4 for Cooperative-Competitive Reinforcement Learning with History-Dependent Rewards
Viaarxiv icon

Maximum Entropy Multi-Task Inverse RL

Add code
Apr 27, 2020
Figure 1 for Maximum Entropy Multi-Task Inverse RL
Figure 2 for Maximum Entropy Multi-Task Inverse RL
Figure 3 for Maximum Entropy Multi-Task Inverse RL
Viaarxiv icon

A Framework and Method for Online Inverse Reinforcement Learning

Add code
May 21, 2018
Figure 1 for A Framework and Method for Online Inverse Reinforcement Learning
Figure 2 for A Framework and Method for Online Inverse Reinforcement Learning
Viaarxiv icon