Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Alessandro Betti

i-EM S.r.l

Evaluating Continual Learning Algorithms by Generating 3D Virtual Environments

Sep 16, 2021
Enrico Meloni, Alessandro Betti, Lapo Faggi, Simone Marullo, Matteo Tiezzi, Stefano Melacci

Figure 1 for Evaluating Continual Learning Algorithms by Generating 3D Virtual Environments

Figure 2 for Evaluating Continual Learning Algorithms by Generating 3D Virtual Environments

Figure 3 for Evaluating Continual Learning Algorithms by Generating 3D Virtual Environments

Figure 4 for Evaluating Continual Learning Algorithms by Generating 3D Virtual Environments

Continual learning refers to the ability of humans and animals to incrementally learn over time in a given environment. Trying to simulate this learning process in machines is a challenging task, also due to the inherent difficulty in creating conditions for designing continuously evolving dynamics that are typical of the real-world. Many existing research works usually involve training and testing of virtual agents on datasets of static images or short videos, considering sequences of distinct learning tasks. However, in order to devise continual learning algorithms that operate in more realistic conditions, it is fundamental to gain access to rich, fully customizable and controlled experimental playgrounds. Focussing on the specific case of vision, we thus propose to leverage recent advances in 3D virtual environments in order to approach the automatic generation of potentially life-long dynamic scenes with photo-realistic appearance. Scenes are composed of objects that move along variable routes with different and fully customizable timings, and randomness can also be included in their evolution. A novel element of this paper is that scenes are described in a parametric way, thus allowing the user to fully control the visual complexity of the input stream the agent perceives. These general principles are concretely implemented exploiting a recently published 3D virtual environment. The user can generate scenes without the need of having strong skills in computer graphics, since all the generation facilities are exposed through a simple high-level Python interface. We publicly share the proposed generator.

* 8 pages, 7 figures, accepted at the 1st International Workshop on Continual Semi-Supervised Learning (CSSL) @ IJCAI 2021

Via

Access Paper or Ask Questions

An Optimal Control Approach to Learning in SIDARTHE Epidemic model

Oct 28, 2020
Andrea Zugarini, Enrico Meloni, Alessandro Betti, Andrea Panizza, Marco Corneli, Marco Gori

Figure 1 for An Optimal Control Approach to Learning in SIDARTHE Epidemic model

Figure 2 for An Optimal Control Approach to Learning in SIDARTHE Epidemic model

Figure 3 for An Optimal Control Approach to Learning in SIDARTHE Epidemic model

Figure 4 for An Optimal Control Approach to Learning in SIDARTHE Epidemic model

The COVID-19 outbreak has stimulated the interest in the proposal of novel epidemiological models to predict the course of the epidemic so as to help planning effective control strategies. In particular, in order to properly interpret the available data, it has become clear that one must go beyond most classic epidemiological models and consider models that, like the recently proposed SIDARTHE, offer a richer description of the stages of infection. The problem of learning the parameters of these models is of crucial importance especially when assuming that they are time-variant, which further enriches their effectiveness. In this paper we propose a general approach for learning time-variant parameters of dynamic compartmental models from epidemic data. We formulate the problem in terms of a functional risk that depends on the learning variables through the solutions of a dynamic system. The resulting variational problem is then solved by using a gradient flow on a suitable, regularized functional. We forecast the epidemic evolution in Italy and France. Results indicate that the model provides reliable and challenging predictions over all available data as well as the fundamental role of the chosen strategy on the time-variant parameters.

* 11 pages, 7 figures, submitted at TNNLS

Via

Access Paper or Ask Questions

Developing Constrained Neural Units Over Time

Sep 01, 2020
Alessandro Betti, Marco Gori, Simone Marullo, Stefano Melacci

Figure 1 for Developing Constrained Neural Units Over Time

Figure 2 for Developing Constrained Neural Units Over Time

Figure 3 for Developing Constrained Neural Units Over Time

Figure 4 for Developing Constrained Neural Units Over Time

In this paper we present a foundational study on a constrained method that defines learning problems with Neural Networks in the context of the principle of least cognitive action, which very much resembles the principle of least action in mechanics. Starting from a general approach to enforce constraints into the dynamical laws of learning, this work focuses on an alternative way of defining Neural Networks, that is different from the majority of existing approaches. In particular, the structure of the neural architecture is defined by means of a special class of constraints that are extended also to the interaction with data, leading to "architectural" and "input-related" constraints, respectively. The proposed theory is cast into the time domain, in which data are presented to the network in an ordered manner, that makes this study an important step toward alternative ways of processing continuous streams of data with Neural Networks. The connection with the classic Backpropagation-based update rule of the weights of networks is discussed, showing that there are conditions under which our approach degenerates to Backpropagation. Moreover, the theory is experimentally evaluated on a simple problem that allows us to deeply study several aspects of the theory itself and to show the soundness of the model.

Via

Access Paper or Ask Questions

Wave Propagation of Visual Stimuli in Focus of Attention

Jun 19, 2020
Lapo Faggi, Alessandro Betti, Dario Zanca, Stefano Melacci, Marco Gori

Figure 1 for Wave Propagation of Visual Stimuli in Focus of Attention

Figure 2 for Wave Propagation of Visual Stimuli in Focus of Attention

Figure 3 for Wave Propagation of Visual Stimuli in Focus of Attention

Figure 4 for Wave Propagation of Visual Stimuli in Focus of Attention

Fast reactions to changes in the surrounding visual environment require efficient attention mechanisms to reallocate computational resources to most relevant locations in the visual field. While current computational models keep improving their predictive ability thanks to the increasing availability of data, they still struggle approximating the effectiveness and efficiency exhibited by foveated animals. In this paper, we present a biologically-plausible computational model of focus of attention that exhibits spatiotemporal locality and that is very well-suited for parallel and distributed implementations. Attention emerges as a wave propagation process originated by visual stimuli corresponding to details and motion information. The resulting field obeys the principle of "inhibition of return" so as not to get stuck in potential holes. An accurate experimentation of the model shows that it achieves top level performance in scanpath prediction tasks. This can easily be understood at the light of a theoretical result that we establish in the paper, where we prove that as the velocity of wave propagation goes to infinity, the proposed model reduces to recently proposed state of the art gravitational models of focus of attention.

Via

Access Paper or Ask Questions

Focus of Attention Improves Information Transfer in Visual Features

Jun 16, 2020
Matteo Tiezzi, Stefano Melacci, Alessandro Betti, Marco Maggini, Marco Gori

Figure 1 for Focus of Attention Improves Information Transfer in Visual Features

Figure 2 for Focus of Attention Improves Information Transfer in Visual Features

Figure 3 for Focus of Attention Improves Information Transfer in Visual Features

Figure 4 for Focus of Attention Improves Information Transfer in Visual Features

Unsupervised learning from continuous visual streams is a challenging problem that cannot be naturally and efficiently managed in the classic batch-mode setting of computation. The information stream must be carefully processed accordingly to an appropriate spatio-temporal distribution of the visual data, while most approaches of learning commonly assume uniform probability density. In this paper we focus on unsupervised learning for transferring visual information in a truly online setting by using a computational model that is inspired to the principle of least action in physics. The maximization of the mutual information is carried out by a temporal process which yields online estimation of the entropy terms. The model, which is based on second-order differential equations, maximizes the information transfer from the input to a discrete space of symbols related to the visual features of the input, whose computation is supported by hidden neurons. In order to better structure the input probability distribution, we use a human-like focus of attention model that, coherently with the information maximization model, is also based on second-order differential equations. We provide experimental results to support the theory by showing that the spatio-temporal filtering induced by the focus of attention allows the system to globally transfer more information from the input stream over the focused areas and, in some contexts, over the whole frames with respect to the unfiltered case that yields uniform probability distributions.

Via

Access Paper or Ask Questions

Local Propagation in Constraint-based Neural Network

Feb 18, 2020
Giuseppe Marra, Matteo Tiezzi, Stefano Melacci, Alessandro Betti, Marco Maggini, Marco Gori

Figure 1 for Local Propagation in Constraint-based Neural Network

Figure 2 for Local Propagation in Constraint-based Neural Network

Figure 3 for Local Propagation in Constraint-based Neural Network

Figure 4 for Local Propagation in Constraint-based Neural Network

In this paper we study a constraint-based representation of neural network architectures. We cast the learning problem in the Lagrangian framework and we investigate a simple optimization procedure that is well suited to fulfil the so-called architectural constraints, learning from the available supervisions. The computational structure of the proposed Local Propagation (LP) algorithm is based on the search for saddle points in the adjoint space composed of weights, neural outputs, and Lagrange multipliers. All the updates of the model variables are locally performed, so that LP is fully parallelizable over the neural units, circumventing the classic problem of gradient vanishing in deep networks. The implementation of popular neural models is described in the context of LP, together with those conditions that trace a natural connection with Backpropagation. We also investigate the setting in which we tolerate bounded violations of the architectural constraints, and we provide experimental evidence that LP is a feasible approach to train shallow and deep networks, opening the road to further investigations on more complex architectures, easily describable by constraints.

Via

Access Paper or Ask Questions

Real-Time target detection in maritime scenarios based on YOLOv3 model

Feb 10, 2020
Alessandro Betti, Benedetto Michelozzi, Andrea Bracci, Andrea Masini

Figure 1 for Real-Time target detection in maritime scenarios based on YOLOv3 model

Figure 2 for Real-Time target detection in maritime scenarios based on YOLOv3 model

Figure 3 for Real-Time target detection in maritime scenarios based on YOLOv3 model

Figure 4 for Real-Time target detection in maritime scenarios based on YOLOv3 model

In this work a novel ships dataset is proposed consisting of more than 56k images of marine vessels collected by means of web-scraping and including 12 ship categories. A YOLOv3 single-stage detector based on Keras API is built on top of this dataset. Current results on four categories (cargo ship, naval ship, oil ship and tug ship) show Average Precision up to 96% for Intersection over Union (IoU) of 0.5 and satisfactory detection performances up to IoU of 0.8. A Data Analytics GUI service based on QT framework and Darknet-53 engine is also implemented in order to simplify the deployment process and analyse massive amount of images even for people without Data Science expertise.

* Paper presented at the 9th International Symposium on Optronics in Defence & Security, 28-30 January 2020 (OPTRO2020, Paris). Oral Presentation

Via

Access Paper or Ask Questions

Backprop Diffusion is Biologically Plausible

Dec 10, 2019
Alessandro Betti, Marco Gori

Figure 1 for Backprop Diffusion is Biologically Plausible

Figure 2 for Backprop Diffusion is Biologically Plausible

Figure 3 for Backprop Diffusion is Biologically Plausible

The Backpropagation algorithm relies on the abstraction of using a neural model that gets rid of the notion of time, since the input is mapped instantaneously to the output. In this paper, we claim that this abstraction of ignoring time, along with the abrupt input changes that occur when feeding the training set, are in fact the reasons why, in some papers, Backprop biological plausibility is regarded as an arguable issue. We show that as soon as a deep feedforward network operates with neurons with time-delayed response, the backprop weight update turns out to be the basic equation of a biologically plausible diffusion process based on forward-backward waves. We also show that such a process very well approximates the gradient for inputs that are not too fast with respect to the depth of the network. These remarks somewhat disclose the diffusion process behind the backprop equation and leads us to interpret the corresponding algorithm as a degeneration of a more general diffusion process that takes place also in neural networks with cyclic connections.

* 6 pages, 3 figures

Via

Access Paper or Ask Questions

Condition monitoring and early diagnostics methodologies for hydropower plants

Nov 13, 2019
Alessandro Betti, Emanuele Crisostomi, Gianluca Paolinelli, Antonio Piazzi, Fabrizio Ruffini, Mauro Tucci

Figure 1 for Condition monitoring and early diagnostics methodologies for hydropower plants

Figure 2 for Condition monitoring and early diagnostics methodologies for hydropower plants

Figure 3 for Condition monitoring and early diagnostics methodologies for hydropower plants

Figure 4 for Condition monitoring and early diagnostics methodologies for hydropower plants

Hydropower plants are one of the most convenient option for power generation, as they generate energy exploiting a renewable source, they have relatively low operating and maintenance costs, and they may be used to provide ancillary services, exploiting the large reservoirs of available water. The recent advances in Information and Communication Technologies (ICT) and in machine learning methodologies are seen as fundamental enablers to upgrade and modernize the current operation of most hydropower plants, in terms of condition monitoring, early diagnostics and eventually predictive maintenance. While very few works, or running technologies, have been documented so far for the hydro case, in this paper we propose a novel Key Performance Indicator (KPI) that we have recently developed and tested on operating hydropower plants. In particular, we show that after more than one year of operation it has been able to identify several faults, and to support the operation and maintenance tasks of plant operators. Also, we show that the proposed KPI outperforms conventional multivariable process control charts, like the Hotelling $t_2$ index.

* 8 pages, 4 figures. This work has been submitted to the Elsevier Renewable Energy for possible publication

Via

Access Paper or Ask Questions

A Scalable Predictive Maintenance Model for Detecting Wind Turbine Component Failures Based on SCADA Data

Oct 22, 2019
Lorenzo Gigoni, Alessandro Betti, Mauro Tucci, Emanuele Crisostomi

Figure 1 for A Scalable Predictive Maintenance Model for Detecting Wind Turbine Component Failures Based on SCADA Data

Figure 2 for A Scalable Predictive Maintenance Model for Detecting Wind Turbine Component Failures Based on SCADA Data

Figure 3 for A Scalable Predictive Maintenance Model for Detecting Wind Turbine Component Failures Based on SCADA Data

Figure 4 for A Scalable Predictive Maintenance Model for Detecting Wind Turbine Component Failures Based on SCADA Data

In this work, a novel predictive maintenance system is presented and applied to the main components of wind turbines. The proposed model is based on machine learning and statistical process control tools applied to SCADA (Supervisory Control And Data Acquisition) data of critical components. The test campaign was divided into two stages: a first two years long offline test, and a second one year long real-time test. The offline test used historical faults from six wind farms located in Italy and Romania, corresponding to a total of 150 wind turbines and an overall installed nominal power of 283 MW. The results demonstrate outstanding capabilities of anomaly prediction up to 2 months before device unscheduled downtime. Furthermore, the real-time 12-months test confirms the ability of the proposed system to detect several anomalies, therefore allowing the operators to identify the root causes, and to schedule maintenance actions before reaching a catastrophic stage.

* Paper presented at the conference IEEE PES General Meeting 2019, August 4-8 (Atlanta, USA)

Via

Access Paper or Ask Questions