Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Sumeet Singh

Trajectory Optimization with Optimization-Based Dynamics

Sep 10, 2021
Taylor A. Howell, Simon Le Cleac'h, Sumeet Singh, Pete Florence, Zachary Manchester, Vikas Sindhwani

Figure 1 for Trajectory Optimization with Optimization-Based Dynamics

Figure 2 for Trajectory Optimization with Optimization-Based Dynamics

Figure 3 for Trajectory Optimization with Optimization-Based Dynamics

Figure 4 for Trajectory Optimization with Optimization-Based Dynamics

We present a framework for bi-level trajectory optimization in which a system's dynamics are encoded as the solution to a constrained optimization problem and smooth gradients of this lower-level problem are passed to an upper-level trajectory optimizer. This optimization-based dynamics representation enables constraint handling, additional variables, and non-smooth forces to be abstracted away from the upper-level optimizer, and allows classical unconstrained optimizers to synthesize trajectories for more complex systems. We provide a path-following method for efficient evaluation of constrained dynamics and utilize the implicit-function theorem to compute smooth gradients of this representation. We demonstrate the framework by modeling systems from locomotion, aerospace, and manipulation domains including: acrobot with joint limits, cart-pole subject to Coulomb friction, Raibert hopper, rocket landing with thrust limits, and planar-push task with optimization-based dynamics and then optimize trajectories using iterative LQR.

Via

Access Paper or Ask Questions

Learning Stabilizable Nonlinear Dynamics with Contraction-Based Regularization

Jul 29, 2019
Sumeet Singh, Spencer M. Richards, Vikas Sindhwani, Jean-Jacques E. Slotine, Marco Pavone

Figure 1 for Learning Stabilizable Nonlinear Dynamics with Contraction-Based Regularization

Figure 2 for Learning Stabilizable Nonlinear Dynamics with Contraction-Based Regularization

Figure 3 for Learning Stabilizable Nonlinear Dynamics with Contraction-Based Regularization

Figure 4 for Learning Stabilizable Nonlinear Dynamics with Contraction-Based Regularization

We propose a novel framework for learning stabilizable nonlinear dynamical systems for continuous control tasks in robotics. The key contribution is a control-theoretic regularizer for dynamics fitting rooted in the notion of stabilizability, a constraint which guarantees the existence of robust tracking controllers for arbitrary open-loop trajectories generated with the learned system. Leveraging tools from contraction theory and statistical learning in Reproducing Kernel Hilbert Spaces, we formulate stabilizable dynamics learning as a functional optimization with convex objective and bi-convex functional constraints. Under a mild structural assumption and relaxation of the functional constraints to sampling-based constraints, we derive the optimal solution with a modified Representer theorem. Finally, we utilize random matrix feature approximations to reduce the dimensionality of the search parameters and formulate an iterative convex optimization algorithm that jointly fits the dynamics functions and searches for a certificate of stabilizability. We validate the proposed algorithm in simulation for a planar quadrotor, and on a quadrotor hardware testbed emulating planar dynamics. We verify, both in simulation and on hardware, significantly improved trajectory generation and tracking performance with the control-theoretic regularized model over models learned using traditional regression techniques, especially when learning from small supervised datasets. The results support the conjecture that the use of stabilizability constraints as a form of regularization can help prune the hypothesis space in a manner that is tailored to the downstream task of trajectory generation and feedback control, resulting in models that are not only dramatically better conditioned, but also data efficient.

* Invited submission for IJRR; under review. arXiv admin note: text overlap with arXiv:1808.00113

Via

Access Paper or Ask Questions

Improving Robustness of Machine Translation with Synthetic Noise

Apr 10, 2019
Vaibhav Vaibhav, Sumeet Singh, Craig Stewart, Graham Neubig

Figure 1 for Improving Robustness of Machine Translation with Synthetic Noise

Figure 2 for Improving Robustness of Machine Translation with Synthetic Noise

Figure 3 for Improving Robustness of Machine Translation with Synthetic Noise

Figure 4 for Improving Robustness of Machine Translation with Synthetic Noise

Modern Machine Translation (MT) systems perform consistently well on clean, in-domain text. However most human generated text, particularly in the realm of social media, is full of typos, slang, dialect, idiolect and other noise which can have a disastrous impact on the accuracy of output translation. In this paper we leverage the Machine Translation of Noisy Text (MTNT) dataset to enhance the robustness of MT systems by emulating naturally occurring noise in otherwise clean data. Synthesizing noise in this manner we are ultimately able to make a vanilla MT system resilient to naturally occurring noise and partially mitigate loss in accuracy resulting therefrom.

* Accepted at NAACL 2019

Via

Access Paper or Ask Questions

Learning Stabilizable Dynamical Systems via Control Contraction Metrics

Nov 11, 2018
Sumeet Singh, Vikas Sindhwani, Jean-Jacques E. Slotine, Marco Pavone

Figure 1 for Learning Stabilizable Dynamical Systems via Control Contraction Metrics

Figure 2 for Learning Stabilizable Dynamical Systems via Control Contraction Metrics

Figure 3 for Learning Stabilizable Dynamical Systems via Control Contraction Metrics

We propose a novel framework for learning stabilizable nonlinear dynamical systems for continuous control tasks in robotics. The key idea is to develop a new control-theoretic regularizer for dynamics fitting rooted in the notion of stabilizability, which guarantees that the learned system can be accompanied by a robust controller capable of stabilizing any open-loop trajectory that the system may generate. By leveraging tools from contraction theory, statistical learning, and convex optimization, we provide a general and tractable semi-supervised algorithm to learn stabilizable dynamics, which can be applied to complex underactuated systems. We validated the proposed algorithm on a simulated planar quadrotor system and observed notably improved trajectory generation and tracking performance with the control-theoretic regularized model over models learned using traditional regression techniques, especially when using a small number of demonstration examples. The results presented illustrate the need to infuse standard model-based reinforcement learning algorithms with concepts drawn from nonlinear control theory for improved reliability.

* To appear at WAFR 2018. v2: re-structured Sections 3 & 4 to improve clarity; expanded discussion on limitations & future work in Section 5; added details on training & validation, significantly expanded experiments

Via

Access Paper or Ask Questions

Robust Tracking with Model Mismatch for Fast and Safe Planning: an SOS Optimization Approach

Aug 02, 2018
Sumeet Singh, Mo Chen, Sylvia L. Herbert, Claire J. Tomlin, Marco Pavone

Figure 1 for Robust Tracking with Model Mismatch for Fast and Safe Planning: an SOS Optimization Approach

Figure 2 for Robust Tracking with Model Mismatch for Fast and Safe Planning: an SOS Optimization Approach

Figure 3 for Robust Tracking with Model Mismatch for Fast and Safe Planning: an SOS Optimization Approach

In the pursuit of real-time motion planning, a commonly adopted practice is to compute a trajectory by running a planning algorithm on a simplified, low-dimensional dynamical model, and then employ a feedback tracking controller that tracks such a trajectory by accounting for the full, high-dimensional system dynamics. While this strategy of planning with model mismatch generally yields fast computation times, there are no guarantees of dynamic feasibility, which hampers application to safety-critical systems. Building upon recent work that addressed this problem through the lens of Hamilton-Jacobi (HJ) reachability, we devise an algorithmic framework whereby one computes, offline, for a pair of "planner" (i.e., low-dimensional) and "tracking" (i.e., high-dimensional) models, a feedback tracking controller and associated tracking bound. This bound is then used as a safety margin when generating motion plans via the low-dimensional model. Specifically, we harness the computational tool of sum-of-squares (SOS) programming to design a bilinear optimization algorithm for the computation of the feedback tracking controller and associated tracking bound. The algorithm is demonstrated via numerical experiments, with an emphasis on investigating the trade-off between the increased computational scalability afforded by SOS and its intrinsic conservativeness. Collectively, our results enable scaling the appealing strategy of planning with model mismatch to systems that are beyond the reach of HJ analysis, while maintaining safety guarantees.

* Submitted to WAFR 2018

Via

Access Paper or Ask Questions

Risk-sensitive Inverse Reinforcement Learning via Semi- and Non-Parametric Methods

Mar 22, 2018
Sumeet Singh, Jonathan Lacotte, Anirudha Majumdar, Marco Pavone

Figure 1 for Risk-sensitive Inverse Reinforcement Learning via Semi- and Non-Parametric Methods

Figure 2 for Risk-sensitive Inverse Reinforcement Learning via Semi- and Non-Parametric Methods

Figure 3 for Risk-sensitive Inverse Reinforcement Learning via Semi- and Non-Parametric Methods

Figure 4 for Risk-sensitive Inverse Reinforcement Learning via Semi- and Non-Parametric Methods

The literature on Inverse Reinforcement Learning (IRL) typically assumes that humans take actions in order to minimize the expected value of a cost function, i.e., that humans are risk neutral. Yet, in practice, humans are often far from being risk neutral. To fill this gap, the objective of this paper is to devise a framework for risk-sensitive IRL in order to explicitly account for a human's risk sensitivity. To this end, we propose a flexible class of models based on coherent risk measures, which allow us to capture an entire spectrum of risk preferences from risk-neutral to worst-case. We propose efficient non-parametric algorithms based on linear programming and semi-parametric algorithms based on maximum likelihood for inferring a human's underlying risk measure and cost function for a rich class of static and dynamic decision-making settings. The resulting approach is demonstrated on a simulated driving game with ten human participants. Our method is able to infer and mimic a wide range of qualitatively different driving styles from highly risk-averse to risk-neutral in a data-efficient manner. Moreover, comparisons of the Risk-Sensitive (RS) IRL approach with a risk-neutral model show that the RS-IRL framework more accurately captures observed participant behavior both qualitatively and quantitatively, especially in scenarios where catastrophic outcomes such as collisions can occur.

* Submitted to International Journal of Robotics Research; Revision 1: (i) Clarified minor technical points; (ii) Revised proof for Theorem 3 to hold under weaker assumptions; (iii) Added additional figures and expanded discussions to improve readability

Via

Access Paper or Ask Questions

Decentralized Algorithms for 3D Symmetric Formations in Robotic Networks: a Contraction Theory Approach

Nov 09, 2015
Sumeet Singh, Edward Schmerling, Marco Pavone

Figure 1 for Decentralized Algorithms for 3D Symmetric Formations in Robotic Networks: a Contraction Theory Approach

Figure 2 for Decentralized Algorithms for 3D Symmetric Formations in Robotic Networks: a Contraction Theory Approach

Figure 3 for Decentralized Algorithms for 3D Symmetric Formations in Robotic Networks: a Contraction Theory Approach

Figure 4 for Decentralized Algorithms for 3D Symmetric Formations in Robotic Networks: a Contraction Theory Approach

This paper presents decentralized algorithms for formation control of multiple robots in three dimensions. Specifically, we leverage the mathematical properties of cyclic pursuit along with results from contraction and partial contraction theory to design decentralized control algorithms that ensure global convergence to symmetric formations. We first consider regular polygon formations as a base case, and then extend the results to Johnson solid and other polygonal mesh formations. The algorithms are further augmented to allow control over formation size and avoid collisions with other robots in the formation. The robustness properties of the algorithms are assessed in the presence of bounded additive disturbances and their effect on the quality of the formation is quantified. Finally, we present a general methodology for embedding the control laws on complex dynamical systems, in this case, quadcopters, and validate this approach via simulations and experiments on a fleet of quadcopters.

* Submitted to IEEE Transactions in Robotics

Via

Access Paper or Ask Questions