Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Richard Everett

Identifying Sources and Sinks in the Presence of Multiple Agents with Gaussian Process Vector Calculus

Feb 22, 2018

Adam D. Cobb, Richard Everett, Andrew Markham, Stephen J. Roberts

Figure 1 for Identifying Sources and Sinks in the Presence of Multiple Agents with Gaussian Process Vector Calculus

Figure 2 for Identifying Sources and Sinks in the Presence of Multiple Agents with Gaussian Process Vector Calculus

Figure 3 for Identifying Sources and Sinks in the Presence of Multiple Agents with Gaussian Process Vector Calculus

Figure 4 for Identifying Sources and Sinks in the Presence of Multiple Agents with Gaussian Process Vector Calculus

Abstract:In systems of multiple agents, identifying the cause of observed agent dynamics is challenging. Often, these agents operate in diverse, non-stationary environments, where models rely on hand-crafted environment-specific features to infer influential regions in the system's surroundings. To overcome the limitations of these inflexible models, we present GP-LAPLACE, a technique for locating sources and sinks from trajectories in time-varying fields. Using Gaussian processes, we jointly infer a spatio-temporal vector field, as well as canonical vector calculus operations on that field. Notably, we do this from only agent trajectories without requiring knowledge of the environment, and also obtain a metric for denoting the significance of inferred causal features in the environment by exploiting our probabilistic method. To evaluate our approach, we apply it to both synthetic and real-world GPS data, demonstrating the applicability of our technique in the presence of multiple agents, as well as its superiority over existing methods.

* 9 pages, 5 figures, conference submission, University of Oxford

Via

Access Paper or Ask Questions

Inferring agent objectives at different scales of a complex adaptive system

Dec 04, 2017

Dieter Hendricks, Adam Cobb, Richard Everett, Jonathan Downing, Stephen J. Roberts

Figure 1 for Inferring agent objectives at different scales of a complex adaptive system

Figure 2 for Inferring agent objectives at different scales of a complex adaptive system

Abstract:We introduce a framework to study the effective objectives at different time scales of financial market microstructure. The financial market can be regarded as a complex adaptive system, where purposeful agents collectively and simultaneously create and perceive their environment as they interact with it. It has been suggested that multiple agent classes operate in this system, with a non-trivial hierarchy of top-down and bottom-up causation classes with different effective models governing each level. We conjecture that agent classes may in fact operate at different time scales and thus act differently in response to the same perceived market state. Given scale-specific temporal state trajectories and action sequences estimated from aggregate market behaviour, we use Inverse Reinforcement Learning to compute the effective reward function for the aggregate agent class at each scale, allowing us to assess the relative attractiveness of feature vectors across different scales. Differences in reward functions for feature vectors may indicate different objectives of market participants, which could assist in finding the scale boundary for agent classes. This has implications for learning algorithms operating in this domain.

* 6 pages, 3 figures, NIPS 2017 Workshop on Learning in the Presence of Strategic Behaviour (MLStrat)

Via

Access Paper or Ask Questions