Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

German I. Parisi

On the role of neurogenesis in overcoming catastrophic forgetting

Nov 06, 2018
German I. Parisi, Xu Ji, Stefan Wermter

Figure 1 for On the role of neurogenesis in overcoming catastrophic forgetting

Figure 2 for On the role of neurogenesis in overcoming catastrophic forgetting

Figure 3 for On the role of neurogenesis in overcoming catastrophic forgetting

Figure 4 for On the role of neurogenesis in overcoming catastrophic forgetting

Lifelong learning capabilities are crucial for artificial autonomous agents operating on real-world data, which is typically non-stationary and temporally correlated. In this work, we demonstrate that dynamically grown networks outperform static networks in incremental learning scenarios, even when bounded by the same amount of memory in both cases. Learning is unsupervised in our models, a condition that additionally makes training more challenging whilst increasing the realism of the study, since humans are able to learn without dense manual annotation. Our results on artificial neural networks reinforce that structural plasticity constitutes effective prevention against catastrophic forgetting in non-stationary environments, as well as empirically supporting the importance of neurogenesis in the mammalian brain.

* Submitted to NIPS'18 Workshop on Continual Learning

Via

Access Paper or Ask Questions

Lifelong Learning of Spatiotemporal Representations with Dual-Memory Recurrent Self-Organization

Oct 30, 2018
German I. Parisi, Jun Tani, Cornelius Weber, Stefan Wermter

Figure 1 for Lifelong Learning of Spatiotemporal Representations with Dual-Memory Recurrent Self-Organization

Figure 2 for Lifelong Learning of Spatiotemporal Representations with Dual-Memory Recurrent Self-Organization

Figure 3 for Lifelong Learning of Spatiotemporal Representations with Dual-Memory Recurrent Self-Organization

Figure 4 for Lifelong Learning of Spatiotemporal Representations with Dual-Memory Recurrent Self-Organization

Artificial autonomous agents and robots interacting in complex environments are required to continually acquire and fine-tune knowledge over sustained periods of time. The ability to learn from continuous streams of information is referred to as lifelong learning and represents a long-standing challenge for neural network models due to catastrophic forgetting. Computational models of lifelong learning typically alleviate catastrophic forgetting in experimental scenarios with given datasets of static images and limited complexity, thereby differing significantly from the conditions artificial agents are exposed to. In more natural settings, sequential information may become progressively available over time and access to previous experience may be restricted. In this paper, we propose a dual-memory self-organizing architecture for lifelong learning scenarios. The architecture comprises two growing recurrent networks with the complementary tasks of learning object instances (episodic memory) and categories (semantic memory). Both growing networks can expand in response to novel sensory experience: the episodic memory learns fine-grained spatiotemporal representations of object instances in an unsupervised fashion while the semantic memory uses task-relevant signals to regulate structural plasticity levels and develop more compact representations from episodic experience. For the consolidation of knowledge in the absence of external sensory input, the episodic memory periodically replays trajectories of neural reactivations. We evaluate the proposed model on the CORe50 benchmark dataset for continuous object recognition, showing that we significantly outperform current methods of lifelong learning in three different incremental learning scenarios

Via

Access Paper or Ask Questions

Assessing the Contribution of Semantic Congruency to Multisensory Integration and Conflict Resolution

Oct 15, 2018
Di Fu, Pablo Barros, German I. Parisi, Haiyan Wu, Sven Magg, Xun Liu, Stefan Wermter

Figure 1 for Assessing the Contribution of Semantic Congruency to Multisensory Integration and Conflict Resolution

Figure 2 for Assessing the Contribution of Semantic Congruency to Multisensory Integration and Conflict Resolution

Figure 3 for Assessing the Contribution of Semantic Congruency to Multisensory Integration and Conflict Resolution

Figure 4 for Assessing the Contribution of Semantic Congruency to Multisensory Integration and Conflict Resolution

The efficient integration of multisensory observations is a key property of the brain that yields the robust interaction with the environment. However, artificial multisensory perception remains an open issue especially in situations of sensory uncertainty and conflicts. In this work, we extend previous studies on audio-visual (AV) conflict resolution in complex environments. In particular, we focus on quantitatively assessing the contribution of semantic congruency during an AV spatial localization task. In addition to conflicts in the spatial domain (i.e. spatially misaligned stimuli), we consider gender-specific conflicts with male and female avatars. Our results suggest that while semantically related stimuli affect the magnitude of the visual bias (perceptually shifting the location of the sound towards a semantically congruent visual cue), humans still strongly rely on environmental statistics to solve AV conflicts. Together with previously reported results, this work contributes to a better understanding of how multisensory integration and conflict resolution can be modelled in artificial agents and robots operating in real-world environments.

* Workshop on Crossmodal Learning for Intelligent Robotics at IROS'18, Madrid, Spain

Via

Access Paper or Ask Questions

A Neurorobotic Experiment for Crossmodal Conflict Resolution in Complex Environments

Sep 24, 2018
German I. Parisi, Pablo Barros, Di Fu, Sven Magg, Haiyan Wu, Xun Liu, Stefan Wermter

Figure 1 for A Neurorobotic Experiment for Crossmodal Conflict Resolution in Complex Environments

Figure 2 for A Neurorobotic Experiment for Crossmodal Conflict Resolution in Complex Environments

Figure 3 for A Neurorobotic Experiment for Crossmodal Conflict Resolution in Complex Environments

Figure 4 for A Neurorobotic Experiment for Crossmodal Conflict Resolution in Complex Environments

Crossmodal conflict resolution is crucial for robot sensorimotor coupling through the interaction with the environment, yielding swift and robust behaviour also in noisy conditions. In this paper, we propose a neurorobotic experiment in which an iCub robot exhibits human-like responses in a complex crossmodal environment. To better understand how humans deal with multisensory conflicts, we conducted a behavioural study exposing 33 subjects to congruent and incongruent dynamic audio-visual cues. In contrast to previous studies using simplified stimuli, we designed a scenario with four animated avatars and observed that the magnitude and extension of the visual bias are related to the semantics embedded in the scene, i.e., visual cues that are congruent with environmental statistics (moving lips and vocalization) induce the strongest bias. We implement a deep learning model that processes stereophonic sound, facial features, and body motion to trigger a discrete behavioural response. After training the model, we exposed the iCub to the same experimental conditions as the human subjects, showing that the robot can replicate similar responses in real time. Our interdisciplinary work provides important insights into how crossmodal conflict resolution can be modelled in robots and introduces future research directions for the efficient combination of sensory observations with internally generated knowledge and expectations.

* Accepted at the IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS 2018), Madrid, Spain

Via

Access Paper or Ask Questions

Multi-modal Feedback for Affordance-driven Interactive Reinforcement Learning

Jul 26, 2018
Francisco Cruz, German I. Parisi, Stefan Wermter

Figure 1 for Multi-modal Feedback for Affordance-driven Interactive Reinforcement Learning

Figure 2 for Multi-modal Feedback for Affordance-driven Interactive Reinforcement Learning

Figure 3 for Multi-modal Feedback for Affordance-driven Interactive Reinforcement Learning

Figure 4 for Multi-modal Feedback for Affordance-driven Interactive Reinforcement Learning

Interactive reinforcement learning (IRL) extends traditional reinforcement learning (RL) by allowing an agent to interact with parent-like trainers during a task. In this paper, we present an IRL approach using dynamic audio-visual input in terms of vocal commands and hand gestures as feedback. Our architecture integrates multi-modal information to provide robust commands from multiple sensory cues along with a confidence value indicating the trustworthiness of the feedback. The integration process also considers the case in which the two modalities convey incongruent information. Additionally, we modulate the influence of sensory-driven feedback in the IRL task using goal-oriented knowledge in terms of contextual affordances. We implement a neural network architecture to predict the effect of performed actions with different objects to avoid failed-states, i.e., states from which it is not possible to accomplish the task. In our experimental setup, we explore the interplay of multimodal feedback and task-specific affordances in a robot cleaning scenario. We compare the learning performance of the agent under four different conditions: traditional RL, multi-modal IRL, and each of these two setups with the use of contextual affordances. Our experiments show that the best performance is obtained by using audio-visual feedback with affordancemodulated IRL. The obtained results demonstrate the importance of multi-modal sensory processing integrated with goal-oriented knowledge in IRL tasks.

* Accepted at IEEE IJCNN 2018, Rio de Janeiro, Brazil

Via

Access Paper or Ask Questions

Towards Modeling the Interaction of Spatial-Associative Neural Network Representations for Multisensory Perception

Jul 13, 2018
German I. Parisi, Jonathan Tong, Pablo Barros, Brigitte Röder, Stefan Wermter

Figure 1 for Towards Modeling the Interaction of Spatial-Associative Neural Network Representations for Multisensory Perception

Figure 2 for Towards Modeling the Interaction of Spatial-Associative Neural Network Representations for Multisensory Perception

Our daily perceptual experience is driven by different neural mechanisms that yield multisensory interaction as the interplay between exogenous stimuli and endogenous expectations. While the interaction of multisensory cues according to their spatiotemporal properties and the formation of multisensory feature-based representations have been widely studied, the interaction of spatial-associative neural representations has received considerably less attention. In this paper, we propose a neural network architecture that models the interaction of spatial-associative representations to perform causal inference of audiovisual stimuli. We investigate the spatial alignment of exogenous audiovisual stimuli modulated by associative congruence. In the spatial layer, topographically arranged networks account for the interaction of audiovisual input in terms of population codes. In the associative layer, congruent audiovisual representations are obtained via the experience-driven development of feature-based associations. Levels of congruency are obtained as a by-product of the neurodynamics of self-organizing networks, where the amount of neural activation triggered by the input can be expressed via a nonlinear distance function. Our novel proposal is that activity-driven levels of congruency can be used as top-down modulatory projections to spatially distributed representations of sensory input, e.g. semantically related audiovisual pairs will yield a higher level of integration than unrelated pairs. Furthermore, levels of neural response in unimodal layers may be seen as sensory reliability for the dynamic weighting of crossmodal cues. We describe a series of planned experiments to validate our model in the tasks of multisensory interaction on the basis of semantic congruence and unimodal cue reliability.

* Workshop on Computational Models for Crossmodal Learning, IEEE ICDL-EPIROB 2017, Lisbon, Portugal

Via

Access Paper or Ask Questions

Continual Lifelong Learning with Neural Networks: A Review

Jul 07, 2018
German I. Parisi, Ronald Kemker, Jose L. Part, Christopher Kanan, Stefan Wermter

Figure 1 for Continual Lifelong Learning with Neural Networks: A Review

Figure 2 for Continual Lifelong Learning with Neural Networks: A Review

Figure 3 for Continual Lifelong Learning with Neural Networks: A Review

Figure 4 for Continual Lifelong Learning with Neural Networks: A Review

Humans and animals have the ability to continually acquire and fine-tune knowledge throughout their lifespan. This ability, referred to as lifelong learning, is mediated by a rich set of neurocognitive mechanisms that together contribute to the development and specialization of our sensorimotor skills as well as to the long-term memory consolidation and retrieval without catastrophic forgetting. Consequently, lifelong learning capabilities are crucial for computational learning systems and autonomous agents interacting in the real world and processing continuous streams of information. However, lifelong learning remains a long-standing challenge for machine learning and neural network models since the continual acquisition of incrementally available information from non-stationary data distributions generally leads to catastrophic forgetting or interference. This limitation represents a major drawback also for state-of-the-art deep and shallow neural network models that typically learn representations from stationary batches of training data, thus without accounting for situations in which the number of tasks is not known a priori and the information becomes incrementally available over time. In this review, we critically summarize the main challenges linked to lifelong learning for artificial learning systems and compare existing neural network approaches that alleviate, to different extents, catastrophic forgetting. Although significant advances have been made in domain-specific learning with neural networks, extensive research efforts are required for the development of robust lifelong learning on autonomous agents and robots. We discuss well-established and emerging research motivated by lifelong learning factors in biological systems such as neurosynaptic plasticity, multi-task transfer learning, intrinsically motivated exploration, and crossmodal learning.

Via

Access Paper or Ask Questions

An Incremental Self-Organizing Architecture for Sensorimotor Learning and Prediction

Mar 09, 2018
Luiza Mici, German I. Parisi, Stefan Wermter

Figure 1 for An Incremental Self-Organizing Architecture for Sensorimotor Learning and Prediction

Figure 2 for An Incremental Self-Organizing Architecture for Sensorimotor Learning and Prediction

Figure 3 for An Incremental Self-Organizing Architecture for Sensorimotor Learning and Prediction

Figure 4 for An Incremental Self-Organizing Architecture for Sensorimotor Learning and Prediction

During visuomotor tasks, robots must compensate for temporal delays inherent in their sensorimotor processing systems. Delay compensation becomes crucial in a dynamic environment where the visual input is constantly changing, e.g., during the interacting with a human demonstrator. For this purpose, the robot must be equipped with a prediction mechanism for using the acquired perceptual experience to estimate possible future motor commands. In this paper, we present a novel neural network architecture that learns prototypical visuomotor representations and provides reliable predictions on the basis of the visual input. These predictions are used to compensate for the delayed motor behavior in an online manner. We investigate the performance of our method with a set of experiments comprising a humanoid robot that has to learn and generate visually perceived arm motion trajectories. We evaluate the accuracy in terms of mean prediction error and analyze the response of the network to novel movement demonstrations. Additionally, we report experiments with incomplete data sequences, showing the robustness of the proposed architecture in the case of a noisy and faulty visual sensor.

Via

Access Paper or Ask Questions

Closing the loop on multisensory interactions: A neural architecture for multisensory causal inference and recalibration

Mar 04, 2018
Jonathan Tong, German I. Parisi, Stefan Wermter, Brigitte Röder

Figure 1 for Closing the loop on multisensory interactions: A neural architecture for multisensory causal inference and recalibration

Figure 2 for Closing the loop on multisensory interactions: A neural architecture for multisensory causal inference and recalibration

Figure 3 for Closing the loop on multisensory interactions: A neural architecture for multisensory causal inference and recalibration

Figure 4 for Closing the loop on multisensory interactions: A neural architecture for multisensory causal inference and recalibration

When the brain receives input from multiple sensory systems, it is faced with the question of whether it is appropriate to process the inputs in combination, as if they originated from the same event, or separately, as if they originated from distinct events. Furthermore, it must also have a mechanism through which it can keep sensory inputs calibrated to maintain the accuracy of its internal representations. We have developed a neural network architecture capable of i) approximating optimal multisensory spatial integration, based on Bayesian causal inference, and ii) recalibrating the spatial encoding of sensory systems. The architecture is based on features of the dorsal processing hierarchy, including the spatial tuning properties of unisensory neurons and the convergence of different sensory inputs onto multisensory neurons. Furthermore, we propose that these unisensory and multisensory neurons play dual roles in i) encoding spatial location as separate or integrated estimates and ii) accumulating evidence for the independence or relatedness of multisensory stimuli. We further propose that top-down feedback connections spanning the dorsal pathway play key a role in recalibrating spatial encoding at the level of early unisensory cortices. Our proposed architecture provides possible explanations for a number of human electrophysiological and neuroimaging results and generates testable predictions linking neurophysiology with behaviour.

* The code to implement the network model and simulations can be found online at: https://github.com/jonathan-tong/multisensory-network-model

Via

Access Paper or Ask Questions

A self-organizing neural network architecture for learning human-object interactions

Mar 02, 2018
Luiza Mici, German I. Parisi, Stefan Wermter

Figure 1 for A self-organizing neural network architecture for learning human-object interactions

Figure 2 for A self-organizing neural network architecture for learning human-object interactions

Figure 3 for A self-organizing neural network architecture for learning human-object interactions

Figure 4 for A self-organizing neural network architecture for learning human-object interactions

The visual recognition of transitive actions comprising human-object interactions is a key component for artificial systems operating in natural environments. This challenging task requires jointly the recognition of articulated body actions as well as the extraction of semantic elements from the scene such as the identity of the manipulated objects. In this paper, we present a self-organizing neural network for the recognition of human-object interactions from RGB-D videos. Our model consists of a hierarchy of Grow-When-Required (GWR) networks that learn prototypical representations of body motion patterns and objects, accounting for the development of action-object mappings in an unsupervised fashion. We report experimental results on a dataset of daily activities collected for the purpose of this study as well as on a publicly available benchmark dataset. In line with neurophysiological studies, our self-organizing architecture exhibits higher neural activation for congruent action-object pairs learned during training sessions with respect to synthetically created incongruent ones. We show that our unsupervised model shows competitive classification results on the benchmark dataset with respect to strictly supervised approaches.

Via

Access Paper or Ask Questions