Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Giovanni Bacci

MM Algorithms to Estimate Parameters in Continuous-time Markov Chains

Feb 16, 2023

Giovanni Bacci, Anna Ingólfsdóttir, Kim G. Larsen, Raphaël Reynouard

Figure 1 for MM Algorithms to Estimate Parameters in Continuous-time Markov Chains

Figure 2 for MM Algorithms to Estimate Parameters in Continuous-time Markov Chains

Figure 3 for MM Algorithms to Estimate Parameters in Continuous-time Markov Chains

Figure 4 for MM Algorithms to Estimate Parameters in Continuous-time Markov Chains

Abstract:Continuous-time Markov chains (CTMCs) are popular modeling formalism that constitutes the underlying semantics for real-time probabilistic systems such as queuing networks, stochastic process algebras, and calculi for systems biology. Prism and Storm are popular model checking tools that provide a number of powerful analysis techniques for CTMCs. These tools accept models expressed as the parallel composition of a number of modules interacting with each other. The outcome of the analysis is strongly dependent on the parameter values used in the model which govern the timing and probability of events of the resulting CTMC. However, for some applications, parameter values have to be empirically estimated from partially-observable executions. In this work, we address the problem of estimating parameter values of CTMCs expressed as Prism models from a number of partially-observable executions. We introduce the class parametric CTMCs -- CTMCs where transition rates are polynomial functions over a set of parameters -- as an abstraction of CTMCs covering a large class of Prism models. Then, building on a theory of algorithms known by the initials MM, for minorization-maximization, we present iterative maximum likelihood estimation algorithms for parametric CTMCs covering two learning scenarios: when both state-labels and dwell times are observable, or just state-labels are. We conclude by illustrating the use of our technique in a simple but non-trivial case study: the analysis of the spread of COVID-19 in presence of lockdown countermeasures.

Via

Access Paper or Ask Questions

Active Learning of Markov Decision Processes using Baum-Welch algorithm (Extended)

Oct 06, 2021

Giovanni Bacci, Anna Ingólfsdóttir, Kim Larsen, Raphaël Reynouard

Figure 1 for Active Learning of Markov Decision Processes using Baum-Welch algorithm (Extended)

Figure 2 for Active Learning of Markov Decision Processes using Baum-Welch algorithm (Extended)

Figure 3 for Active Learning of Markov Decision Processes using Baum-Welch algorithm (Extended)

Figure 4 for Active Learning of Markov Decision Processes using Baum-Welch algorithm (Extended)

Abstract:Cyber-physical systems (CPSs) are naturally modelled as reactive systems with nondeterministic and probabilistic dynamics. Model-based verification techniques have proved effective in the deployment of safety-critical CPSs. Central for a successful application of such techniques is the construction of an accurate formal model for the system. Manual construction can be a resource-demanding and error-prone process, thus motivating the design of automata learning algorithms to synthesise a system model from observed system behaviours. This paper revisits and adapts the classic Baum-Welch algorithm for learning Markov decision processes and Markov chains. For the case of MDPs, which typically demand more observations, we present a model-based active learning sampling strategy that choses examples which are most informative w.r.t.\ the current model hypothesis. We empirically compare our approach with state-of-the-art tools and demonstrate that the proposed active learning procedure can significantly reduce the number of observations required to obtain accurate models.

* 7 pages, 7 figures, submitted and accepted (short) to ICMLA 2021

Via

Access Paper or Ask Questions

Approximating Euclidean by Imprecise Markov Decision Processes

Jun 26, 2020

Manfred Jaeger, Giorgio Bacci, Giovanni Bacci, Kim Guldstrand Larsen, Peter Gjøl Jensen

Figure 1 for Approximating Euclidean by Imprecise Markov Decision Processes

Figure 2 for Approximating Euclidean by Imprecise Markov Decision Processes

Figure 3 for Approximating Euclidean by Imprecise Markov Decision Processes

Figure 4 for Approximating Euclidean by Imprecise Markov Decision Processes

Abstract:Euclidean Markov decision processes are a powerful tool for modeling control problems under uncertainty over continuous domains. Finite state imprecise, Markov decision processes can be used to approximate the behavior of these infinite models. In this paper we address two questions: first, we investigate what kind of approximation guarantees are obtained when the Euclidean process is approximated by finite state approximations induced by increasingly fine partitions of the continuous state space. We show that for cost functions over finite time horizons the approximations become arbitrarily precise. Second, we use imprecise Markov decision process approximations as a tool to analyse and validate cost functions and strategies obtained by reinforcement learning. We find that, on the one hand, our new theoretical results validate basic design choices of a previously proposed reinforcement learning approach. On the other hand, the imprecise Markov decision process approximations reveal some inaccuracies in the learned cost functions.

Via

Access Paper or Ask Questions

**L*-Based Learning of Markov Decision Processes**

Jun 28, 2019

Martin Tappler, Bernhard K. Aichernig, Giovanni Bacci, Maria Eichlseder, Kim G. Larsen

Figure 1 for L*-Based Learning of Markov Decision Processes

Figure 2 for L*-Based Learning of Markov Decision Processes

Figure 3 for L*-Based Learning of Markov Decision Processes

Figure 4 for L*-Based Learning of Markov Decision Processes

Abstract:Automata learning techniques automatically generate system models from test observations. These techniques usually fall into two categories: passive and active. Passive learning uses a predetermined data set, e.g., system logs. In contrast, active learning actively queries the system under learning, which is considered more efficient. An influential active learning technique is Angluin's L* algorithm for regular languages which inspired several generalisations from DFAs to other automata-based modelling formalisms. In this work, we study L*-based learning of deterministic Markov decision processes, first assuming an ideal setting with perfect information. Then, we relax this assumption and present a novel learning algorithm that collects information by sampling system traces via testing. Experiments with the implementation of our sampling-based algorithm suggest that it achieves better accuracy than state-of-the-art passive learning techniques with the same amount of test data. Unlike existing learning algorithms with predefined states, our algorithm learns the complete model structure including the states.

* an extended version of a conference paper accepted for presentation at FM 2019, the 23rd international symposium on formal methods

Via

Access Paper or Ask Questions