Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Branko Ristic

Reinforcement Learning Trained Observer Control for Bearings-Only Tracking

May 04, 2026

Branko Ristic, Sanjeev Arulampalam

Abstract:This paper develops a deep reinforcement learning based observer control policy for autonomous bearings-only tracking of a moving target. The observer manoeuvre problem is formulated as a belief Markov decision process, where the belief state is represented by the posterior of a cubature Kalman filter (CKF). The reward function is designed to address two conflicting objectives: minimising the absolute target position estimation error (Euclidean distance) and maintaining CKF estimation consistency (Mahalanobis distance). The reward is formulated as a geometric interpolation between the two objectives on the Pareto front, parametrised by a weighting factor $β\in [0,1]$. The policy is implemented as a deep Q-network (DQN) trained over 50,000 episodes. Performance is evaluated over 5,000 Monte Carlo episodes and compared against two baselines: the perpendicular-to-bearing heuristic and the D-optimal Fisher information maximisation criterion. The results show that the DQN policy at $β= 0.7$ achieves the best trade-off between accuracy and robustness: it matches the information-theoretic baseline on mean tracking accuracy while reducing the worst-case error by nearly a factor of ten, owing to the implicit filter-consistency regularisation provided by the Mahalanobis term in the reward.

* 7 pages, 2 figures, 3 tables

Via

Access Paper or Ask Questions

Chernoff fusion of Bernoulli Gaussian max filters

Oct 30, 2024

Zhijin Chen, Branko Ristic, Du Yong Kim

Abstract:Statistical dependencies between information sources are rarely known, yet in practical distributed tracking schemes, they must be taken into account in order to prevent track divergences. Chernoff fusion is well-known and universally accepted method that can address the problem of track fusion when the statistical dependence between the fusing sources is unknown. In this paper we derive the exact Chernoff fusion equations for Bernoulli Gaussian max filters. These filters have been recently derived in the framework of possibility theory, as the analog of the Bernoulli Gaussian sum filters. The main motivation for the possibilistic approach is that it effectively deals with imprecise mathematical models (e.g. dynamics, measurements) used in tracking algorithms. The paper also demonstrates the proposed possibilistic fusion scheme in the absence of knowledge about statistical dependence.

Via

Access Paper or Ask Questions

Credal Valuation Networks for Machine Reasoning Under Uncertainty

Aug 04, 2022

Branko Ristic, Alessio Benavoli, Sanjeev Arulampalam

Figure 1 for Credal Valuation Networks for Machine Reasoning Under Uncertainty

Figure 2 for Credal Valuation Networks for Machine Reasoning Under Uncertainty

Figure 3 for Credal Valuation Networks for Machine Reasoning Under Uncertainty

Figure 4 for Credal Valuation Networks for Machine Reasoning Under Uncertainty

Abstract:Contemporary undertakings provide limitless opportunities for widespread application of machine reasoning and artificial intelligence in situations characterised by uncertainty, hostility and sheer volume of data. The paper develops a valuation network as a graphical system for higher-level fusion and reasoning under uncertainty in support of the human operators. Valuations, which are mathematical representation of (uncertain) knowledge and collected data, are expressed as credal sets, defined as coherent interval probabilities in the framework of imprecise probability theory. The basic operations with such credal sets, combination and marginalisation, are defined to satisfy the axioms of a valuation algebra. A practical implementation of the credal valuation network is discussed and its utility demonstrated on a small scale example.

* 16 pages, 3 figures

Via

Access Paper or Ask Questions

Autonomous search for a diffusive source in an unknown environment

Jun 07, 2013

Branko Ristic, Alex Skvortsov, Andrew Walker

Figure 1 for Autonomous search for a diffusive source in an unknown environment

Figure 2 for Autonomous search for a diffusive source in an unknown environment

Figure 3 for Autonomous search for a diffusive source in an unknown environment

Figure 4 for Autonomous search for a diffusive source in an unknown environment

Abstract:The paper presents an approach to olfactory search for a diffusive emitting source of tracer (e.g. aerosol, gas) in an environment with unknown map of randomly placed and shaped obstacles. The measurements of tracer concentration are sporadic, noisy and without directional information. The search domain is discretised and modelled by a finite two-dimensional lattice. The links is the lattice represent the traversable paths for emitted particles and for the searcher. A missing link in the lattice indicates a blocked paths, due to the walls or obstacles. The searcher must simultaneously estimate the source parameters, the map of the search domain and its own location within the map. The solution is formulated in the sequential Bayesian framework and implemented as a Rao-Blackwellised particle filter with information-driven motion control. The numerical results demonstrate the concept and its performance.

* 11 pages, 7 figures

Via

Access Paper or Ask Questions

Performance Evaluation of Random Set Based Pedestrian Tracking Algorithms

Oct 25, 2012

Branko Ristic, Jamie Sherrah, Ángel F. García-Fernández

Figure 1 for Performance Evaluation of Random Set Based Pedestrian Tracking Algorithms

Figure 2 for Performance Evaluation of Random Set Based Pedestrian Tracking Algorithms

Figure 3 for Performance Evaluation of Random Set Based Pedestrian Tracking Algorithms

Figure 4 for Performance Evaluation of Random Set Based Pedestrian Tracking Algorithms

Abstract:The paper evaluates the error performance of three random finite set based multi-object trackers in the context of pedestrian video tracking. The evaluation is carried out using a publicly available video dataset of 4500 frames (town centre street) for which the ground truth is available. The input to all pedestrian tracking algorithms is an identical set of head and body detections, obtained using the Histogram of Oriented Gradients (HOG) detector. The tracking error is measured using the recently proposed OSPA metric for tracks, adopted as the only known mathematically rigorous metric for measuring the distance between two sets of tracks. A comparative analysis is presented under various conditions.

* 6 pages, 3 figures

Via

Access Paper or Ask Questions