Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Title:An Online Data-Driven Emergency-Response Method for Autonomous Agents in Unforeseen Situations

Dec 17, 2021

Glenn Maguire, Nicholas Ketz, Praveen Pilly, Jean-Baptiste Mouret

Figure 1 for An Online Data-Driven Emergency-Response Method for Autonomous Agents in Unforeseen Situations

Figure 2 for An Online Data-Driven Emergency-Response Method for Autonomous Agents in Unforeseen Situations

Figure 3 for An Online Data-Driven Emergency-Response Method for Autonomous Agents in Unforeseen Situations

Figure 4 for An Online Data-Driven Emergency-Response Method for Autonomous Agents in Unforeseen Situations

Share this with someone who'll enjoy it:

Abstract:Reinforcement learning agents perform well when presented with inputs within the distribution of those encountered during training. However, they are unable to respond effectively when faced with novel, out-of-distribution events, until they have undergone additional training. This paper presents an online, data-driven, emergency-response method that aims to provide autonomous agents the ability to react to unexpected situations that are very different from those it has been trained or designed to address. In such situations, learned policies cannot be expected to perform appropriately since the observations obtained in these novel situations would fall outside the distribution of inputs that the agent has been optimized to handle. The proposed approach devises a customized response to the unforeseen situation sequentially, by selecting actions that minimize the rate of increase of the reconstruction error from a variational auto-encoder. This optimization is achieved online in a data-efficient manner (on the order of 30 data-points) using a modified Bayesian optimization procedure. We demonstrate the potential of this approach in a simulated 3D car driving scenario, in which the agent devises a response in under 2 seconds to avoid collisions with objects it has not seen during training.

View paper on

Share this with someone who'll enjoy it:

Title:An Online Data-Driven Emergency-Response Method for Autonomous Agents in Unforeseen Situations

Paper and Code