Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Mykel J. Kochenderfer

Stanford University

Diffusion-Based Failure Sampling for Cyber-Physical Systems

Jun 20, 2024

Harrison Delecki, Marc R. Schlichting, Mansur Arief, Anthony Corso, Marcell Vazquez-Chanlatte, Mykel J. Kochenderfer

Figure 1 for Diffusion-Based Failure Sampling for Cyber-Physical Systems

Figure 2 for Diffusion-Based Failure Sampling for Cyber-Physical Systems

Figure 3 for Diffusion-Based Failure Sampling for Cyber-Physical Systems

Figure 4 for Diffusion-Based Failure Sampling for Cyber-Physical Systems

Abstract:Validating safety-critical autonomous systems in high-dimensional domains such as robotics presents a significant challenge. Existing black-box approaches based on Markov chain Monte Carlo may require an enormous number of samples, while methods based on importance sampling often rely on simple parametric families that may struggle to represent the distribution over failures. We propose to sample the distribution over failures using a conditional denoising diffusion model, which has shown success in complex high-dimensional problems such as robotic task planning. We iteratively train a diffusion model to produce state trajectories closer to failure. We demonstrate the effectiveness of our approach on high-dimensional robotic validation tasks, improving sample efficiency and mode coverage compared to existing black-box techniques.

* Under review at RA-L

Via

Access Paper or Ask Questions

Distributed Online Planning for Min-Max Problems in Networked Markov Games

May 29, 2024

Alexandros E. Tzikas, Jinkyoo Park, Mykel J. Kochenderfer, Ross E. Allen

Figure 1 for Distributed Online Planning for Min-Max Problems in Networked Markov Games

Figure 2 for Distributed Online Planning for Min-Max Problems in Networked Markov Games

Figure 3 for Distributed Online Planning for Min-Max Problems in Networked Markov Games

Abstract:Min-max problems are important in multi-agent sequential decision-making because they improve the performance of the worst-performing agent in the network. However, solving the multi-agent min-max problem is challenging. We propose a modular, distributed, online planning-based algorithm that is able to approximate the solution of the min-max objective in networked Markov games, assuming that the agents communicate within a network topology and the transition and reward functions are neighborhood-dependent. This set-up is encountered in the multi-robot setting. Our method consists of two phases at every planning step. In the first phase, each agent obtains sample returns based on its local reward function, by performing online planning. Using the samples from online planning, each agent constructs a concave approximation of its underlying local return as a function of only the action of its neighborhood at the next planning step. In the second phase, the agents deploy a distributed optimization framework that converges to the optimal immediate next action for each agent, based on the function approximations of the first phase. We demonstrate our algorithm's performance through formation control simulations.

* Accepted to appear in the IEEE Robotics and Automation Letters

Via

Access Paper or Ask Questions

SEEK: Semantic Reasoning for Object Goal Navigation in Real World Inspection Tasks

May 16, 2024

Muhammad Fadhil Ginting, Sung-Kyun Kim, David D. Fan, Matteo Palieri, Mykel J. Kochenderfer, Ali-akbar Agha-Mohammadi

Figure 1 for SEEK: Semantic Reasoning for Object Goal Navigation in Real World Inspection Tasks

Figure 2 for SEEK: Semantic Reasoning for Object Goal Navigation in Real World Inspection Tasks

Figure 3 for SEEK: Semantic Reasoning for Object Goal Navigation in Real World Inspection Tasks

Figure 4 for SEEK: Semantic Reasoning for Object Goal Navigation in Real World Inspection Tasks

Abstract:This paper addresses the problem of object-goal navigation in autonomous inspections in real-world environments. Object-goal navigation is crucial to enable effective inspections in various settings, often requiring the robot to identify the target object within a large search space. Current object inspection methods fall short of human efficiency because they typically cannot bootstrap prior and common sense knowledge as humans do. In this paper, we introduce a framework that enables robots to use semantic knowledge from prior spatial configurations of the environment and semantic common sense knowledge. We propose SEEK (Semantic Reasoning for Object Inspection Tasks) that combines semantic prior knowledge with the robot's observations to search for and navigate toward target objects more efficiently. SEEK maintains two representations: a Dynamic Scene Graph (DSG) and a Relational Semantic Network (RSN). The RSN is a compact and practical model that estimates the probability of finding the target object across spatial elements in the DSG. We propose a novel probabilistic planning framework to search for the object using relational semantic knowledge. Our simulation analyses demonstrate that SEEK outperforms the classical planning and Large Language Models (LLMs)-based methods that are examined in this study in terms of efficiency for object-goal inspection tasks. We validated our approach on a physical legged robot in urban environments, showcasing its practicality and effectiveness in real-world inspection scenarios.

Via

Access Paper or Ask Questions

ConstrainedZero: Chance-Constrained POMDP Planning using Learned Probabilistic Failure Surrogates and Adaptive Safety Constraints

May 01, 2024

Robert J. Moss, Arec Jamgochian, Johannes Fischer, Anthony Corso, Mykel J. Kochenderfer

Figure 1 for ConstrainedZero: Chance-Constrained POMDP Planning using Learned Probabilistic Failure Surrogates and Adaptive Safety Constraints

Figure 2 for ConstrainedZero: Chance-Constrained POMDP Planning using Learned Probabilistic Failure Surrogates and Adaptive Safety Constraints

Figure 3 for ConstrainedZero: Chance-Constrained POMDP Planning using Learned Probabilistic Failure Surrogates and Adaptive Safety Constraints

Figure 4 for ConstrainedZero: Chance-Constrained POMDP Planning using Learned Probabilistic Failure Surrogates and Adaptive Safety Constraints

Abstract:To plan safely in uncertain environments, agents must balance utility with safety constraints. Safe planning problems can be modeled as a chance-constrained partially observable Markov decision process (CC-POMDP) and solutions often use expensive rollouts or heuristics to estimate the optimal value and action-selection policy. This work introduces the ConstrainedZero policy iteration algorithm that solves CC-POMDPs in belief space by learning neural network approximations of the optimal value and policy with an additional network head that estimates the failure probability given a belief. This failure probability guides safe action selection during online Monte Carlo tree search (MCTS). To avoid overemphasizing search based on the failure estimates, we introduce $\Delta$-MCTS, which uses adaptive conformal inference to update the failure threshold during planning. The approach is tested on a safety-critical POMDP benchmark, an aircraft collision avoidance system, and the sustainability problem of safe CO$_2$ storage. Results show that by separating safety constraints from the objective we can achieve a target level of safety without optimizing the balance between rewards and costs.

* In Proceedings of the 2024 International Joint Conference on Artificial Intelligence (IJCAI)

Via

Access Paper or Ask Questions

Robotic Learning for Adaptive Informative Path Planning

Apr 15, 2024

Marija Popovic, Joshua Ott, Julius Rückin, Mykel J. Kochenderfer

Abstract:Adaptive informative path planning (AIPP) is important to many robotics applications, enabling mobile robots to efficiently collect useful data about initially unknown environments. In addition, learning-based methods are increasingly used in robotics to enhance adaptability, versatility, and robustness across diverse and complex tasks. Our survey explores research on applying robotic learning to AIPP, bridging the gap between these two research fields. We begin by providing a unified mathematical framework for general AIPP problems. Next, we establish two complementary taxonomies of current work from the perspectives of (i) learning algorithms and (ii) robotic applications. We explore synergies, recent trends, and highlight the benefits of learning-based methods in AIPP frameworks. Finally, we discuss key challenges and promising future directions to enable more generally applicable and robust robotic data-gathering systems through learning. We provide a comprehensive catalogue of papers reviewed in our survey, including publicly available repositories, to facilitate future studies in the field.

* 22 pages, 1 figure

Via

Access Paper or Ask Questions

Addressing Myopic Constrained POMDP Planning with Recursive Dual Ascent

Mar 26, 2024

Paula Stocco, Suhas Chundi, Arec Jamgochian, Mykel J. Kochenderfer

Figure 1 for Addressing Myopic Constrained POMDP Planning with Recursive Dual Ascent

Figure 2 for Addressing Myopic Constrained POMDP Planning with Recursive Dual Ascent

Figure 3 for Addressing Myopic Constrained POMDP Planning with Recursive Dual Ascent

Figure 4 for Addressing Myopic Constrained POMDP Planning with Recursive Dual Ascent

Abstract:Lagrangian-guided Monte Carlo tree search with global dual ascent has been applied to solve large constrained partially observable Markov decision processes (CPOMDPs) online. In this work, we demonstrate that these global dual parameters can lead to myopic action selection during exploration, ultimately leading to suboptimal decision making. To address this, we introduce history-dependent dual variables that guide local action selection and are optimized with recursive dual ascent. We empirically compare the performance of our approach on a motivating toy example and two large CPOMDPs, demonstrating improved exploration, and ultimately, safer outcomes.

* Accepted to the 2024 International Conference on Automated Planning and Scheduling (ICAPS)

Via

Access Paper or Ask Questions

Entropy-regularized Point-based Value Iteration

Feb 14, 2024

Harrison Delecki, Marcell Vazquez-Chanlatte, Esen Yel, Kyle Wray, Tomer Arnon, Stefan Witwicki, Mykel J. Kochenderfer

Figure 1 for Entropy-regularized Point-based Value Iteration

Figure 2 for Entropy-regularized Point-based Value Iteration

Figure 3 for Entropy-regularized Point-based Value Iteration

Figure 4 for Entropy-regularized Point-based Value Iteration

Abstract:Model-based planners for partially observable problems must accommodate both model uncertainty during planning and goal uncertainty during objective inference. However, model-based planners may be brittle under these types of uncertainty because they rely on an exact model and tend to commit to a single optimal behavior. Inspired by results in the model-free setting, we propose an entropy-regularized model-based planner for partially observable problems. Entropy regularization promotes policy robustness for planning and objective inference by encouraging policies to be no more committed to a single action than necessary. We evaluate the robustness and objective inference performance of entropy-regularized policies in three problem domains. Our results show that entropy-regularized policies outperform non-entropy-regularized baselines in terms of higher expected returns under modeling errors and higher accuracy during objective inference.

Via

Access Paper or Ask Questions

Approximate Sequential Optimization for Informative Path Planning

Feb 13, 2024

Joshua Ott, Mykel J. Kochenderfer, Stephen Boyd

Figure 1 for Approximate Sequential Optimization for Informative Path Planning

Figure 2 for Approximate Sequential Optimization for Informative Path Planning

Figure 3 for Approximate Sequential Optimization for Informative Path Planning

Figure 4 for Approximate Sequential Optimization for Informative Path Planning

Abstract:We consider the problem of finding an informative path through a graph, given initial and terminal nodes and a given maximum path length. We assume that a linear noise corrupted measurement is taken at each node of an underlying unknown vector that we wish to estimate. The informativeness is measured by the reduction in uncertainty in our estimate, evaluated using several metrics. We present a convex relaxation for this informative path planning problem, which we can readily solve to obtain a bound on the possible performance. We develop an approximate sequential method where the path is constructed segment by segment through dynamic programming. This involves solving an orienteering problem, with the node reward acting as a surrogate for informativeness, taking the first step, and then repeating the process. The method scales to very large problem instances and achieves performance not too far from the bound produced by the convex relaxation. We also demonstrate our method's ability to handle adaptive objectives, multimodal sensing, and multi-agent variations of the informative path planning problem.

Via

Access Paper or Ask Questions

Semantic Belief Behavior Graph: Enabling Autonomous Robot Inspection in Unknown Environments

Jan 30, 2024

Muhammad Fadhil Ginting, David D. Fan, Sung-Kyun Kim, Mykel J. Kochenderfer, Ali-akbar Agha-mohammadi

Figure 1 for Semantic Belief Behavior Graph: Enabling Autonomous Robot Inspection in Unknown Environments

Figure 2 for Semantic Belief Behavior Graph: Enabling Autonomous Robot Inspection in Unknown Environments

Figure 3 for Semantic Belief Behavior Graph: Enabling Autonomous Robot Inspection in Unknown Environments

Figure 4 for Semantic Belief Behavior Graph: Enabling Autonomous Robot Inspection in Unknown Environments

Abstract:This paper addresses the problem of autonomous robotic inspection in complex and unknown environments. This capability is crucial for efficient and precise inspections in various real-world scenarios, even when faced with perceptual uncertainty and lack of prior knowledge of the environment. Existing methods for real-world autonomous inspections typically rely on predefined targets and waypoints and often fail to adapt to dynamic or unknown settings. In this work, we introduce the Semantic Belief Behavior Graph (SB2G) framework as a novel approach to semantic-aware autonomous robot inspection. SB2G generates a control policy for the robot, featuring behavior nodes that encapsulate various semantic-based policies designed for inspecting different classes of objects. We design an active semantic search behavior to guide the robot in locating objects for inspection while reducing semantic information uncertainty. The edges in the SB2G encode transitions between these behaviors. We validate our approach through simulation and real-world urban inspections using a legged robotic platform. Our results show that SB2G enables a more efficient inspection policy, exhibiting performance comparable to human-operated inspections.

Via

Access Paper or Ask Questions

Distributed Markov Chain Monte Carlo Sampling based on the Alternating Direction Method of Multipliers

Jan 29, 2024

Alexandros E. Tzikas, Licio Romao, Mert Pilanci, Alessandro Abate, Mykel J. Kochenderfer

Figure 1 for Distributed Markov Chain Monte Carlo Sampling based on the Alternating Direction Method of Multipliers

Figure 2 for Distributed Markov Chain Monte Carlo Sampling based on the Alternating Direction Method of Multipliers

Figure 3 for Distributed Markov Chain Monte Carlo Sampling based on the Alternating Direction Method of Multipliers

Figure 4 for Distributed Markov Chain Monte Carlo Sampling based on the Alternating Direction Method of Multipliers

Abstract:Many machine learning applications require operating on a spatially distributed dataset. Despite technological advances, privacy considerations and communication constraints may prevent gathering the entire dataset in a central unit. In this paper, we propose a distributed sampling scheme based on the alternating direction method of multipliers, which is commonly used in the optimization literature due to its fast convergence. In contrast to distributed optimization, distributed sampling allows for uncertainty quantification in Bayesian inference tasks. We provide both theoretical guarantees of our algorithm's convergence and experimental evidence of its superiority to the state-of-the-art. For our theoretical results, we use convex optimization tools to establish a fundamental inequality on the generated local sample iterates. This inequality enables us to show convergence of the distribution associated with these iterates to the underlying target distribution in Wasserstein distance. In simulation, we deploy our algorithm on linear and logistic regression tasks and illustrate its fast convergence compared to existing gradient-based methods.

Via

Access Paper or Ask Questions