Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Danica Kragic

Bayesian Meta-Learning for Few-Shot Policy Adaptation Across Robotic Platforms

Mar 05, 2021

Ali Ghadirzadeh, Xi Chen, Petra Poklukar, Chelsea Finn, Mårten Björkman, Danica Kragic

Figure 1 for Bayesian Meta-Learning for Few-Shot Policy Adaptation Across Robotic Platforms

Figure 2 for Bayesian Meta-Learning for Few-Shot Policy Adaptation Across Robotic Platforms

Figure 3 for Bayesian Meta-Learning for Few-Shot Policy Adaptation Across Robotic Platforms

Figure 4 for Bayesian Meta-Learning for Few-Shot Policy Adaptation Across Robotic Platforms

Abstract:Reinforcement learning methods can achieve significant performance but require a large amount of training data collected on the same robotic platform. A policy trained with expensive data is rendered useless after making even a minor change to the robot hardware. In this paper, we address the challenging problem of adapting a policy, trained to perform a task, to a novel robotic hardware platform given only few demonstrations of robot motion trajectories on the target robot. We formulate it as a few-shot meta-learning problem where the goal is to find a meta-model that captures the common structure shared across different robotic platforms such that data-efficient adaptation can be performed. We achieve such adaptation by introducing a learning framework consisting of a probabilistic gradient-based meta-learning algorithm that models the uncertainty arising from the few-shot setting with a low-dimensional latent variable. We experimentally evaluate our framework on a simulated reaching and a real-robot picking task using 400 simulated robots generated by varying the physical parameters of an existing set of robotic platforms. Our results show that the proposed method can successfully adapt a trained policy to different robotic platforms with novel physical parameters and the superiority of our meta-learning algorithm compared to state-of-the-art methods for the introduced few-shot policy adaptation problem.

Via

Access Paper or Ask Questions

Graph-based Task-specific Prediction Models for Interactions between Deformable and Rigid Objects

Mar 04, 2021

Zehang Weng, Fabian Paus, Anastasiia Varava, Hang Yin, Tamim Asfour, Danica Kragic

Figure 1 for Graph-based Task-specific Prediction Models for Interactions between Deformable and Rigid Objects

Figure 2 for Graph-based Task-specific Prediction Models for Interactions between Deformable and Rigid Objects

Figure 3 for Graph-based Task-specific Prediction Models for Interactions between Deformable and Rigid Objects

Figure 4 for Graph-based Task-specific Prediction Models for Interactions between Deformable and Rigid Objects

Abstract:Capturing scene dynamics and predicting the future scene state is challenging but essential for robotic manipulation tasks, especially when the scene contains both rigid and deformable objects. In this work, we contribute a simulation environment and generate a novel dataset for task-specific manipulation, involving interactions between rigid objects and a deformable bag. The dataset incorporates a rich variety of scenarios including different object sizes, object numbers and manipulation actions. We approach dynamics learning by proposing an object-centric graph representation and two modules which are Active Prediction Module (APM) and Position Prediction Module (PPM) based on graph neural networks with an encode-process-decode architecture. At the inference stage, we build a two-stage model based on the learned modules for single time step prediction. We combine modules with different prediction horizons into a mixed-horizon model which addresses long-term prediction. In an ablation study, we show the benefits of the two-stage model for single time step prediction and the effectiveness of the mixed-horizon model for long-term prediction tasks. Supplementary material is available at https://github.com/wengzehang/deformable_rigid_interaction_prediction

* IROS 2021 submission, Zehang Weng and Fabian Paus have equal contribution to this paper

Via

Access Paper or Ask Questions

Enabling Visual Action Planning for Object Manipulation through Latent Space Roadmap

Mar 03, 2021

Martina Lippi, Petra Poklukar, Michael C. Welle, Anastasiia Varava, Hang Yin, Alessandro Marino, Danica Kragic

Figure 1 for Enabling Visual Action Planning for Object Manipulation through Latent Space Roadmap

Figure 2 for Enabling Visual Action Planning for Object Manipulation through Latent Space Roadmap

Figure 3 for Enabling Visual Action Planning for Object Manipulation through Latent Space Roadmap

Figure 4 for Enabling Visual Action Planning for Object Manipulation through Latent Space Roadmap

Abstract:We present a framework for visual action planning of complex manipulation tasks with high-dimensional state spaces, focusing on manipulation of deformable objects. We propose a Latent Space Roadmap (LSR) for task planning, a graph-based structure capturing globally the system dynamics in a low-dimensional latent space. Our framework consists of three parts: (1) a Mapping Module (MM) that maps observations, given in the form of images, into a structured latent space extracting the respective states, that generates observations from the latent states, (2) the LSR which builds and connects clusters containing similar states in order to find the latent plans between start and goal states extracted by MM, and (3) the Action Proposal Module that complements the latent plan found by the LSR with the corresponding actions. We present a thorough investigation of our framework on two simulated box stacking tasks and a folding task executed on a real robot.

Via

Access Paper or Ask Questions

Interpretability in Contact-Rich Manipulation via Kinodynamic Images

Feb 23, 2021

Ioanna Mitsioni, Joonatan Mänttäri, Yiannis Karayiannidis, John Folkesson, Danica Kragic

Figure 1 for Interpretability in Contact-Rich Manipulation via Kinodynamic Images

Figure 2 for Interpretability in Contact-Rich Manipulation via Kinodynamic Images

Figure 3 for Interpretability in Contact-Rich Manipulation via Kinodynamic Images

Figure 4 for Interpretability in Contact-Rich Manipulation via Kinodynamic Images

Abstract:Deep Neural Networks (NNs) have been widely utilized in contact-rich manipulation tasks to model the complicated contact dynamics. However, NN-based models are often difficult to decipher which can lead to seemingly inexplicable behaviors and unidentifiable failure cases. In this work, we address the interpretability of NN-based models by introducing the kinodynamic images. We propose a methodology that creates images from the kinematic and dynamic data of a contact-rich manipulation task. Our formulation visually reflects the task's state by encoding its kinodynamic variations and temporal evolution. By using images as the state representation, we enable the application of interpretability modules that were previously limited to vision-based tasks. We use this representation to train Convolution-based Networks and we extract interpretations of the model's decisions with Grad-CAM, a technique that produces visual explanations. Our method is versatile and can be applied to any classification problem using synchronous features in manipulation to visually interpret which parts of the input drive the model's decisions and distinguish its failure modes. We evaluate this approach on two examples of real-world contact-rich manipulation: pushing and cutting, with known and unknown objects. Finally, we demonstrate that our method enables both detailed visual inspections of sequences in a task, as well as high-level evaluations of a model's behavior and tendencies. Data and code for this work are available at https://github.com/imitsioni/interpretable_manipulation.

Via

Access Paper or Ask Questions

Sequential Topological Representations for Predictive Models of Deformable Objects

Nov 23, 2020

Rika Antonova, Anastasiia Varava, Peiyang Shi, J. Frederico Carvalho, Danica Kragic

Figure 1 for Sequential Topological Representations for Predictive Models of Deformable Objects

Figure 2 for Sequential Topological Representations for Predictive Models of Deformable Objects

Figure 3 for Sequential Topological Representations for Predictive Models of Deformable Objects

Figure 4 for Sequential Topological Representations for Predictive Models of Deformable Objects

Abstract:Deformable objects present a formidable challenge for robotic manipulation due to the lack of canonical low-dimensional representations and the difficulty of capturing, predicting, and controlling such objects. We construct compact topological representations to capture the state of highly deformable objects that are topologically nontrivial. We develop an approach that tracks the evolution of this topological state through time. Under several mild assumptions, we prove that the topology of the scene and its evolution can be recovered from point clouds representing the scene. Our further contribution is a method to learn predictive models that take a sequence of past point cloud observations as input and predict a sequence of topological states, conditioned on target/future control actions. Our experiments with highly deformable objects in simulation show that the proposed multistep predictive models yield more precise results than those obtained from computational topology libraries. These models can leverage patterns inferred across various objects and offer fast multistep predictions suitable for real-time applications.

* The first two authors made equal contributions

Via

Access Paper or Ask Questions

Learning Stable Normalizing-Flow Control for Robotic Manipulation

Oct 30, 2020

Shahbaz Abdul Khader, Hang Yin, Pietro Falco, Danica Kragic

Figure 1 for Learning Stable Normalizing-Flow Control for Robotic Manipulation

Figure 2 for Learning Stable Normalizing-Flow Control for Robotic Manipulation

Figure 3 for Learning Stable Normalizing-Flow Control for Robotic Manipulation

Figure 4 for Learning Stable Normalizing-Flow Control for Robotic Manipulation

Abstract:Reinforcement Learning (RL) of robotic manipulation skills, despite its impressive successes, stands to benefit from incorporating domain knowledge from control theory. One of the most important properties that is of interest is control stability. Ideally, one would like to achieve stability guarantees while staying within the framework of state-of-the-art deep RL algorithms. Such a solution does not exist in general, especially one that scales to complex manipulation tasks. We contribute towards closing this gap by introducing $\textit{normalizing-flow}$ control structure, that can be deployed in any latest deep RL algorithms. While stable exploration is not guaranteed, our method is designed to ultimately produce deterministic controllers with provable stability. In addition to demonstrating our method on challenging contact-rich manipulation tasks, we also show that it is possible to achieve considerable exploration efficiency--reduced state space coverage and actuation efforts--without losing learning efficiency.

* 7 pages, 8 figures

Via

Access Paper or Ask Questions

Data-efficient visuomotor policy training using reinforcement learning and generative models

Jul 26, 2020

Ali Ghadirzadeh, Petra Poklukar, Ville Kyrki, Danica Kragic, Mårten Björkman

Figure 1 for Data-efficient visuomotor policy training using reinforcement learning and generative models

Figure 2 for Data-efficient visuomotor policy training using reinforcement learning and generative models

Figure 3 for Data-efficient visuomotor policy training using reinforcement learning and generative models

Figure 4 for Data-efficient visuomotor policy training using reinforcement learning and generative models

Abstract:We present a data-efficient framework for solving deep visuomotor sequential decision-making problems which exploits the combination of reinforcement learning (RL) with the latent variable generative models. Our framework trains deep visuomotor policies by introducing an action latent variable such that the feed-forward policy search can be divided into two parts: (1) training a sub-policy that outputs a distribution over the action latent variable given a state of the system, and (2) training a generative model that outputs a sequence of motor actions given a latent action representation. Our approach enables safe exploration and alleviates the data-inefficiency problem as it exploits prior knowledge about valid sequences of motor actions. Moreover, by evaluating the quality of the generative models we are able to predict the performance of the RL policy training prior to the actual training on the physical robot. We achieve this by defining two novel measures, disentanglement and local linearity, for assessing the quality of generative models' latent spaces, and complementing them with the existing measures for evaluation of generative models. We demonstrate the efficiency of our approach on a picking task using several different generative models and determine which of their properties have the most influence on the final policy training.

Via

Access Paper or Ask Questions

Human-centered collaborative robots with deep reinforcement learning

Jul 02, 2020

Ali Ghadirzadeh, Xi Chen, Wenjie Yin, Zhengrong Yi, Mårten Björkman, Danica Kragic

Figure 1 for Human-centered collaborative robots with deep reinforcement learning

Figure 2 for Human-centered collaborative robots with deep reinforcement learning

Figure 3 for Human-centered collaborative robots with deep reinforcement learning

Figure 4 for Human-centered collaborative robots with deep reinforcement learning

Abstract:We present a reinforcement learning based framework for human-centered collaborative systems. The framework is proactive and balances the benefits of timely actions with the risk of taking improper actions by minimizing the total time spent to complete the task. The framework is learned end-to-end in an unsupervised fashion addressing the perception uncertainties and decision making in an integrated manner. The framework is shown to provide more fluent coordination between human and robot partners on an example task of packaging compared to alternatives for which perception and decision-making systems are learned independently, using supervised learning. The foremost benefit of the proposed approach is that it allows for fast adaptation to new human partners and tasks since tedious annotation of motion data is avoided and the learning is performed on-line.

Via

Access Paper or Ask Questions

Analytic Manifold Learning: Unifying and Evaluating Representations for Continuous Control

Jun 15, 2020

Rika Antonova, Maksim Maydanskiy, Danica Kragic, Sam Devlin, Katja Hofmann

Figure 1 for Analytic Manifold Learning: Unifying and Evaluating Representations for Continuous Control

Figure 2 for Analytic Manifold Learning: Unifying and Evaluating Representations for Continuous Control

Figure 3 for Analytic Manifold Learning: Unifying and Evaluating Representations for Continuous Control

Figure 4 for Analytic Manifold Learning: Unifying and Evaluating Representations for Continuous Control

Abstract:We address the problem of learning reusable state representations from streaming high-dimensional observations. This is important for areas like Reinforcement Learning (RL), which yields non-stationary data distributions during training. We make two key contributions. First, we propose an evaluation suite that measures alignment between latent and true low-dimensional states. We benchmark several widely used unsupervised learning approaches. This uncovers the strengths and limitations of existing approaches that impose additional constraints/objectives on the latent space. Our second contribution is a unifying mathematical formulation for learning latent relations. We learn analytic relations on source domains, then use these relations to help structure the latent space when learning on target domains. This formulation enables a more general, flexible and principled way of shaping the latent space. It formalizes the notion of learning independent relations, without imposing restrictive simplifying assumptions or requiring domain-specific information. We present mathematical properties, concrete algorithms for implementation and experimental validation of successful learning and transfer of latent relations.

Via

Access Paper or Ask Questions

The effect of Target Normalization and Momentum on Dying ReLU

May 13, 2020

Isac Arnekvist, J. Frederico Carvalho, Danica Kragic, Johannes A. Stork

Figure 1 for The effect of Target Normalization and Momentum on Dying ReLU

Figure 2 for The effect of Target Normalization and Momentum on Dying ReLU

Figure 3 for The effect of Target Normalization and Momentum on Dying ReLU

Figure 4 for The effect of Target Normalization and Momentum on Dying ReLU

Abstract:Optimizing parameters with momentum, normalizing data values, and using rectified linear units (ReLUs) are popular choices in neural network (NN) regression. Although ReLUs are popular, they can collapse to a constant function and "die", effectively removing their contribution from the model. While some mitigations are known, the underlying reasons of ReLUs dying during optimization are currently poorly understood. In this paper, we consider the effects of target normalization and momentum on dying ReLUs. We find empirically that unit variance targets are well motivated and that ReLUs die more easily, when target variance approaches zero. To further investigate this matter, we analyze a discrete-time linear autonomous system, and show theoretically how this relates to a model with a single ReLU and how common properties can result in dying ReLU. We also analyze the gradients of a single-ReLU model to identify saddle points and regions corresponding to dying ReLU and how parameters evolve into these regions when momentum is used. Finally, we show empirically that this problem persist, and is aggravated, for deeper models including residual networks.

Via

Access Paper or Ask Questions