Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Marco Pavone

Sanford University and

Interaction-Dynamics-Aware Perception Zones for Obstacle Detection Safety Evaluation

Jun 24, 2022

Sever Topan, Karen Leung, Yuxiao Chen, Pritish Tupekar, Edward Schmerling, Jonas Nilsson, Michael Cox, Marco Pavone

Figure 1 for Interaction-Dynamics-Aware Perception Zones for Obstacle Detection Safety Evaluation

Figure 2 for Interaction-Dynamics-Aware Perception Zones for Obstacle Detection Safety Evaluation

Figure 3 for Interaction-Dynamics-Aware Perception Zones for Obstacle Detection Safety Evaluation

Figure 4 for Interaction-Dynamics-Aware Perception Zones for Obstacle Detection Safety Evaluation

Abstract:To enable safe autonomous vehicle (AV) operations, it is critical that an AV's obstacle detection module can reliably detect obstacles that pose a safety threat (i.e., are safety-critical). It is therefore desirable that the evaluation metric for the perception system captures the safety-criticality of objects. Unfortunately, existing perception evaluation metrics tend to make strong assumptions about the objects and ignore the dynamic interactions between agents, and thus do not accurately capture the safety risks in reality. To address these shortcomings, we introduce an interaction-dynamics-aware obstacle detection evaluation metric by accounting for closed-loop dynamic interactions between an ego vehicle and obstacles in the scene. By borrowing existing theory from optimal control theory, namely Hamilton-Jacobi reachability, we present a computationally tractable method for constructing a ``safety zone'': a region in state space that defines where safety-critical obstacles lie for the purpose of defining safety metrics. Our proposed safety zone is mathematically complete, and can be easily computed to reflect a variety of safety requirements. Using an off-the-shelf detection algorithm from the nuScenes detection challenge leaderboard, we demonstrate that our approach is computationally lightweight, and can better capture safety-critical perception errors than a baseline approach.

* Accepted to Intelligent Vehicles Symposium 2022

Via

Access Paper or Ask Questions

ScePT: Scene-consistent, Policy-based Trajectory Predictions for Planning

Jun 18, 2022

Yuxiao Chen, Boris Ivanovic, Marco Pavone

Figure 1 for ScePT: Scene-consistent, Policy-based Trajectory Predictions for Planning

Figure 2 for ScePT: Scene-consistent, Policy-based Trajectory Predictions for Planning

Figure 3 for ScePT: Scene-consistent, Policy-based Trajectory Predictions for Planning

Figure 4 for ScePT: Scene-consistent, Policy-based Trajectory Predictions for Planning

Abstract:Trajectory prediction is a critical functionality of autonomous systems that share environments with uncontrolled agents, one prominent example being self-driving vehicles. Currently, most prediction methods do not enforce scene consistency, i.e., there are a substantial amount of self-collisions between predicted trajectories of different agents in the scene. Moreover, many approaches generate individual trajectory predictions per agent instead of joint trajectory predictions of the whole scene, which makes downstream planning difficult. In this work, we present ScePT, a policy planning-based trajectory prediction model that generates accurate, scene-consistent trajectory predictions suitable for autonomous system motion planning. It explicitly enforces scene consistency and learns an agent interaction policy that can be used for conditional prediction. Experiments on multiple real-world pedestrians and autonomous vehicle datasets show that ScePT} matches current state-of-the-art prediction accuracy with significantly improved scene consistency. We also demonstrate ScePT's ability to work with a downstream contingency planner.

Via

Access Paper or Ask Questions

Robust-RRT: Probabilistically-Complete Motion Planning for Uncertain Nonlinear Systems

May 16, 2022

Albert Wu, Thomas Lew, Kiril Solovey, Edward Schmerling, Marco Pavone

Figure 1 for Robust-RRT: Probabilistically-Complete Motion Planning for Uncertain Nonlinear Systems

Figure 2 for Robust-RRT: Probabilistically-Complete Motion Planning for Uncertain Nonlinear Systems

Figure 3 for Robust-RRT: Probabilistically-Complete Motion Planning for Uncertain Nonlinear Systems

Figure 4 for Robust-RRT: Probabilistically-Complete Motion Planning for Uncertain Nonlinear Systems

Abstract:Robust motion planning entails computing a global motion plan that is safe under all possible uncertainty realizations, be it in the system dynamics, the robot's initial position, or with respect to external disturbances. Current approaches for robust motion planning either lack theoretical guarantees, or make restrictive assumptions on the system dynamics and uncertainty distributions. In this paper, we address these limitations by proposing the robust rapidly-exploring random-tree (Robust-RRT) algorithm, which integrates forward reachability analysis directly into sampling-based control trajectory synthesis. We prove that Robust-RRT is probabilistically complete (PC) for nonlinear Lipschitz continuous dynamical systems with bounded uncertainty. In other words, Robust-RRT eventually finds a robust motion plan that is feasible under all possible uncertainty realizations assuming such a plan exists. Our analysis applies even to unstable systems that admit only short-horizon feasible plans; this is because we explicitly consider the time evolution of reachable sets along control trajectories. Thanks to the explicit consideration of time dependency in our analysis, PC applies to unstabilizable systems. To the best of our knowledge, this is the most general PC proof for robust sampling-based motion planning, in terms of the types of uncertainties and dynamical systems it can handle. Considering that an exact computation of reachable sets can be computationally expensive for some dynamical systems, we incorporate sampling-based reachability analysis into Robust-RRT and demonstrate our robust planner on nonlinear, underactuated, and hybrid systems.

* 16 pages of main text + 5 pages of appendix, 5 figures, submitted to the 2022 International Symposium on Robotics Research

Via

Access Paper or Ask Questions

Second-Order Sensitivity Analysis for Bilevel Optimization

May 04, 2022

Robert Dyro, Edward Schmerling, Nikos Arechiga, Marco Pavone

Figure 1 for Second-Order Sensitivity Analysis for Bilevel Optimization

Figure 2 for Second-Order Sensitivity Analysis for Bilevel Optimization

Figure 3 for Second-Order Sensitivity Analysis for Bilevel Optimization

Figure 4 for Second-Order Sensitivity Analysis for Bilevel Optimization

Abstract:In this work we derive a second-order approach to bilevel optimization, a type of mathematical programming in which the solution to a parameterized optimization problem (the "lower" problem) is itself to be optimized (in the "upper" problem) as a function of the parameters. Many existing approaches to bilevel optimization employ first-order sensitivity analysis, based on the implicit function theorem (IFT), for the lower problem to derive a gradient of the lower problem solution with respect to its parameters; this IFT gradient is then used in a first-order optimization method for the upper problem. This paper extends this sensitivity analysis to provide second-order derivative information of the lower problem (which we call the IFT Hessian), enabling the usage of faster-converging second-order optimization methods at the upper level. Our analysis shows that (i) much of the computation already used to produce the IFT gradient can be reused for the IFT Hessian, (ii) errors bounds derived for the IFT gradient readily apply to the IFT Hessian, (iii) computing IFT Hessians can significantly reduce overall computation by extracting more information from each lower level solve. We corroborate our findings and demonstrate the broad range of applications of our method by applying it to problem instances of least squares hyperparameter auto-tuning, multi-class SVM auto-tuning, and inverse optimal control.

* Proceedings of The 25th International Conference on Artificial Intelligence and Statistics, PMLR 151:9166-9181, 2022
* 16 pages, 6 figures

Via

Access Paper or Ask Questions

Safe Reinforcement Learning Using Black-Box Reachability Analysis

Apr 15, 2022

Mahmoud Selim, Amr Alanwar, Shreyas Kousik, Grace Gao, Marco Pavone, Karl H. Johansson

Figure 1 for Safe Reinforcement Learning Using Black-Box Reachability Analysis

Figure 2 for Safe Reinforcement Learning Using Black-Box Reachability Analysis

Figure 3 for Safe Reinforcement Learning Using Black-Box Reachability Analysis

Figure 4 for Safe Reinforcement Learning Using Black-Box Reachability Analysis

Abstract:Reinforcement learning (RL) is capable of sophisticated motion planning and control for robots in uncertain environments. However, state-of-the-art deep RL approaches typically lack safety guarantees, especially when the robot and environment models are unknown. To justify widespread deployment, robots must respect safety constraints without sacrificing performance. Thus, we propose a Black-box Reachability-based Safety Layer (BRSL) with three main components: (1) data-driven reachability analysis for a black-box robot model, (2) a trajectory rollout planner that predicts future actions and observations using an ensemble of neural networks trained online, and (3) a differentiable polytope collision check between the reachable set and obstacles that enables correcting unsafe actions. In simulation, BRSL outperforms other state-of-the-art safe RL methods on a Turtlebot 3, a quadrotor, and a trajectory-tracking point mass with an unsafe set adjacent to the area of highest reward.

Via

Access Paper or Ask Questions

Control-oriented meta-learning

Apr 14, 2022

Spencer M. Richards, Navid Azizan, Jean-Jacques Slotine, Marco Pavone

Figure 1 for Control-oriented meta-learning

Figure 2 for Control-oriented meta-learning

Figure 3 for Control-oriented meta-learning

Figure 4 for Control-oriented meta-learning

Abstract:Real-time adaptation is imperative to the control of robots operating in complex, dynamic environments. Adaptive control laws can endow even nonlinear systems with good trajectory tracking performance, provided that any uncertain dynamics terms are linearly parameterizable with known nonlinear features. However, it is often difficult to specify such features a priori, such as for aerodynamic disturbances on rotorcraft or interaction forces between a manipulator arm and various objects. In this paper, we turn to data-driven modeling with neural networks to learn, offline from past data, an adaptive controller with an internal parametric model of these nonlinear features. Our key insight is that we can better prepare the controller for deployment with control-oriented meta-learning of features in closed-loop simulation, rather than regression-oriented meta-learning of features to fit input-output data. Specifically, we meta-learn the adaptive controller with closed-loop tracking simulation as the base-learner and the average tracking error as the meta-objective. With both fully-actuated and underactuated nonlinear planar rotorcraft subject to wind, we demonstrate that our adaptive controller outperforms other controllers trained with regression-oriented meta-learning when deployed in closed-loop for trajectory tracking control.

* First published in Robotics: Science and Systems (RSS) 2021. This extended version is under review for a special issue in the International Journal of Robotics Research (IJRR). arXiv admin note: substantial text overlap with arXiv:2103.04490

Via

Access Paper or Ask Questions

Online Learning for Traffic Routing under Unknown Preferences

Mar 31, 2022

Devansh Jalota, Karthik Gopalakrishnan, Navid Azizan, Ramesh Johari, Marco Pavone

Figure 1 for Online Learning for Traffic Routing under Unknown Preferences

Figure 2 for Online Learning for Traffic Routing under Unknown Preferences

Figure 3 for Online Learning for Traffic Routing under Unknown Preferences

Figure 4 for Online Learning for Traffic Routing under Unknown Preferences

Abstract:In transportation networks, users typically choose routes in a decentralized and self-interested manner to minimize their individual travel costs, which, in practice, often results in inefficient overall outcomes for society. As a result, there has been a growing interest in designing road tolling schemes to cope with these efficiency losses and steer users toward a system-efficient traffic pattern. However, the efficacy of road tolling schemes often relies on having access to complete information on users' trip attributes, such as their origin-destination (O-D) travel information and their values of time, which may not be available in practice. Motivated by this practical consideration, we propose an online learning approach to set tolls in a traffic network to drive heterogeneous users with different values of time toward a system-efficient traffic pattern. In particular, we develop a simple yet effective algorithm that adjusts tolls at each time period solely based on the observed aggregate flows on the roads of the network without relying on any additional trip attributes of users, thereby preserving user privacy. In the setting where the O-D pairs and values of time of users are drawn i.i.d. at each period, we show that our approach obtains an expected regret and road capacity violation of $O(\sqrt{T})$, where $T$ is the number of periods over which tolls are updated. Our regret guarantee is relative to an offline oracle that has complete information on users' trip attributes. We further establish a $\Omega(\sqrt{T})$ lower bound on the regret of any algorithm, which establishes that our algorithm is optimal up to constants. Finally, we demonstrate the superior performance of our approach relative to several benchmarks on a real-world transportation network, thereby highlighting its practical applicability.

Via

Access Paper or Ask Questions

Motron: Multimodal Probabilistic Human Motion Forecasting

Mar 25, 2022

Tim Salzmann, Marco Pavone, Markus Ryll

Figure 1 for Motron: Multimodal Probabilistic Human Motion Forecasting

Figure 2 for Motron: Multimodal Probabilistic Human Motion Forecasting

Figure 3 for Motron: Multimodal Probabilistic Human Motion Forecasting

Figure 4 for Motron: Multimodal Probabilistic Human Motion Forecasting

Abstract:Autonomous systems and humans are increasingly sharing the same space. Robots work side by side or even hand in hand with humans to balance each other's limitations. Such cooperative interactions are ever more sophisticated. Thus, the ability to reason not just about a human's center of gravity position, but also its granular motion is an important prerequisite for human-robot interaction. Though, many algorithms ignore the multimodal nature of humans or neglect uncertainty in their motion forecasts. We present Motron, a multimodal, probabilistic, graph-structured model, that captures human's multimodality using probabilistic methods while being able to output deterministic maximum-likelihood motions and corresponding confidence values for each mode. Our model aims to be tightly integrated with the robotic planning-control-interaction loop; outputting physically feasible human motions and being computationally efficient. We demonstrate the performance of our model on several challenging real-world motion forecasting datasets, outperforming a wide array of generative/variational methods while providing state-of-the-art single-output motions if required. Both using significantly less computational power than state-of-the art algorithms.

* CVPR 2022

Via

Access Paper or Ask Questions

Neural-MPC: Deep Learning Model Predictive Control for Quadrotors and Agile Robotic Platforms

Mar 15, 2022

Tim Salzmann, Elia Kaufmann, Marco Pavone, Davide Scaramuzza, Markus Ryll

Figure 1 for Neural-MPC: Deep Learning Model Predictive Control for Quadrotors and Agile Robotic Platforms

Figure 2 for Neural-MPC: Deep Learning Model Predictive Control for Quadrotors and Agile Robotic Platforms

Figure 3 for Neural-MPC: Deep Learning Model Predictive Control for Quadrotors and Agile Robotic Platforms

Figure 4 for Neural-MPC: Deep Learning Model Predictive Control for Quadrotors and Agile Robotic Platforms

Abstract:Model Predictive Control (MPC) has become a popular framework in embedded control for high-performance autonomous systems. However, to achieve good control performance using MPC, an accurate dynamics model is key. To maintain real-time operation, the dynamics models used on embedded systems have been limited to simple first-principle models, which substantially limits their representative power. In contrast, neural networks can model complex effects purely from data. In contrast to such simple models, machine learning approaches such as neural networks have been shown to accurately model even complex dynamic effects, but their large computational complexity hindered combination with fast real-time iteration loops. With this work, we present Neural-MPC, a framework to efficiently integrate large, complex neural network architectures as dynamics models within a model-predictive control pipeline. Our experiments, performed in simulation and the real world on a highly agile quadrotor platform, demonstrate up to 83% reduction in positional tracking error when compared to state-of-the-art MPC approaches without neural network dynamics.

* submitted to RA-L

Via

Access Paper or Ask Questions

A Unified View of SDP-based Neural Network Verification through Completely Positive Programming

Mar 06, 2022

Robin Brown, Edward Schmerling, Navid Azizan, Marco Pavone

Figure 1 for A Unified View of SDP-based Neural Network Verification through Completely Positive Programming

Figure 2 for A Unified View of SDP-based Neural Network Verification through Completely Positive Programming

Figure 3 for A Unified View of SDP-based Neural Network Verification through Completely Positive Programming

Figure 4 for A Unified View of SDP-based Neural Network Verification through Completely Positive Programming

Abstract:Verifying that input-output relationships of a neural network conform to prescribed operational specifications is a key enabler towards deploying these networks in safety-critical applications. Semidefinite programming (SDP)-based approaches to Rectified Linear Unit (ReLU) network verification transcribe this problem into an optimization problem, where the accuracy of any such formulation reflects the level of fidelity in how the neural network computation is represented, as well as the relaxations of intractable constraints. While the literature contains much progress on improving the tightness of SDP formulations while maintaining tractability, comparatively little work has been devoted to the other extreme, i.e., how to most accurately capture the original verification problem before SDP relaxation. In this work, we develop an exact, convex formulation of verification as a completely positive program (CPP), and provide analysis showing that our formulation is minimal -- the removal of any constraint fundamentally misrepresents the neural network computation. We leverage our formulation to provide a unifying view of existing approaches, and give insight into the source of large relaxation gaps observed in some cases.

Via

Access Paper or Ask Questions