Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Masayoshi Tomizuka

Spatio-Temporal Graph Dual-Attention Network for Multi-Agent Prediction and Tracking

Feb 18, 2021
Jiachen Li, Hengbo Ma, Zhihao Zhang, Jinning Li, Masayoshi Tomizuka

Figure 1 for Spatio-Temporal Graph Dual-Attention Network for Multi-Agent Prediction and Tracking

Figure 2 for Spatio-Temporal Graph Dual-Attention Network for Multi-Agent Prediction and Tracking

Figure 3 for Spatio-Temporal Graph Dual-Attention Network for Multi-Agent Prediction and Tracking

Figure 4 for Spatio-Temporal Graph Dual-Attention Network for Multi-Agent Prediction and Tracking

An effective understanding of the environment and accurate trajectory prediction of surrounding dynamic obstacles are indispensable for intelligent mobile systems (e.g. autonomous vehicles and social robots) to achieve safe and high-quality planning when they navigate in highly interactive and crowded scenarios. Due to the existence of frequent interactions and uncertainty in the scene evolution, it is desired for the prediction system to enable relational reasoning on different entities and provide a distribution of future trajectories for each agent. In this paper, we propose a generic generative neural system (called STG-DAT) for multi-agent trajectory prediction involving heterogeneous agents. The system takes a step forward to explicit interaction modeling by incorporating relational inductive biases with a dynamic graph representation and leverages both trajectory and scene context information. We also employ an efficient kinematic constraint layer applied to vehicle trajectory prediction. The constraint not only ensures physical feasibility but also enhances model performance. Moreover, the proposed prediction model can be easily adopted by multi-target tracking frameworks. The tracking accuracy proves to be improved by empirical results. The proposed system is evaluated on three public benchmark datasets for trajectory prediction, where the agents cover pedestrians, cyclists and on-road vehicles. The experimental results demonstrate that our model achieves better performance than various baseline approaches in terms of prediction and tracking accuracy.

* This work has been submitted to the IEEE for possible publication. Copyright may be transferred without notice, after which this version may no longer be accessible

Via

Access Paper or Ask Questions

Learning Variable Impedance Control via Inverse Reinforcement Learning for Force-Related Tasks

Feb 13, 2021
Xiang Zhang, Liting Sun, Zhian Kuang, Masayoshi Tomizuka

Figure 1 for Learning Variable Impedance Control via Inverse Reinforcement Learning for Force-Related Tasks

Figure 2 for Learning Variable Impedance Control via Inverse Reinforcement Learning for Force-Related Tasks

Figure 3 for Learning Variable Impedance Control via Inverse Reinforcement Learning for Force-Related Tasks

Figure 4 for Learning Variable Impedance Control via Inverse Reinforcement Learning for Force-Related Tasks

Many manipulation tasks require robots to interact with unknown environments. In such applications, the ability to adapt the impedance according to different task phases and environment constraints is crucial for safety and performance. Although many approaches based on deep reinforcement learning (RL) and learning from demonstration (LfD) have been proposed to obtain variable impedance skills on contact-rich manipulation tasks, these skills are typically task-specific and could be sensitive to changes in task settings. This paper proposes an inverse reinforcement learning (IRL) based approach to recover both the variable impedance policy and reward function from expert demonstrations. We explore different action space of the reward functions to achieve a more general representation of expert variable impedance skills. Experiments on two variable impedance tasks (Peg-in-Hole and Cup-on-Plate) were conducted in both simulations and on a real FANUC LR Mate 200iD/7L industrial robot. The comparison results with behavior cloning and force-based IRL proved that the learned reward function in the gain action space has better transferability than in the force space. Experiment videos are available at https://msc.berkeley.edu/research/impedance-irl.html.

* Accepted by IEEE Robotics and Automation Letters. Feb 2020

Via

Access Paper or Ask Questions

Practical Fractional-Order Variable-Gain Super-Twisting Control with Application to Wafer Stages of Photolithography Systems

Feb 06, 2021
Zhian Kuang, Liting Sun, Huijun Gao, Masayoshi Tomizuka

Figure 1 for Practical Fractional-Order Variable-Gain Super-Twisting Control with Application to Wafer Stages of Photolithography Systems

Figure 2 for Practical Fractional-Order Variable-Gain Super-Twisting Control with Application to Wafer Stages of Photolithography Systems

Figure 3 for Practical Fractional-Order Variable-Gain Super-Twisting Control with Application to Wafer Stages of Photolithography Systems

Figure 4 for Practical Fractional-Order Variable-Gain Super-Twisting Control with Application to Wafer Stages of Photolithography Systems

In this paper, a practical fractional-order variable-gain super-twisting algorithm (PFVSTA) is proposed to improve the tracking performance of wafer stages for semiconductor manufacturing. Based on the sliding mode control (SMC), the proposed PFVSTA enhances the tracking performance from three aspects: 1) alleviating the chattering phenomenon via super-twisting algorithm and a novel fractional-order sliding surface~(FSS) design, 2) improving the dynamics of states on the sliding surface with fast response and small overshoots via the designed novel FSS and 3) compensating for disturbances via variable-gain control law. Based on practical conditions, this paper analyzes the stability of the controller and illustrates the theoretical principle to compensate for the uncertainties caused by accelerations. Moreover, numerical simulations prove the effectiveness of the proposed sliding surface and control scheme, and they are in agreement with the theoretical analysis. Finally, practice-based comparative experiments are conducted. The results show that the proposed PFVSTA can achieve much better tracking performance than the conventional methods from various perspectives.

* This paper has been accepted by IEEE Trans. Mechatronics

Via

Access Paper or Ask Questions

Feedback-based Digital Higher-order Terminal Sliding Mode for 6-DOF Industrial Manipulators

Feb 06, 2021
Zhian Kuang, Xiang Zhang, Liting Sun, Huijun Gao, Masayoshi Tomizuka

Figure 1 for Feedback-based Digital Higher-order Terminal Sliding Mode for 6-DOF Industrial Manipulators

Figure 2 for Feedback-based Digital Higher-order Terminal Sliding Mode for 6-DOF Industrial Manipulators

Figure 3 for Feedback-based Digital Higher-order Terminal Sliding Mode for 6-DOF Industrial Manipulators

Figure 4 for Feedback-based Digital Higher-order Terminal Sliding Mode for 6-DOF Industrial Manipulators

The precise motion control of a multi-degree of freedom~(DOF) robot manipulator is always challenging due to its nonlinear dynamics, disturbances, and uncertainties. Because most manipulators are controlled by digital signals, a novel higher-order sliding mode controller in the discrete-time form with time delay estimation is proposed in this paper. The dynamic model of the manipulator used in the design allows proper handling of nonlinearities, uncertainties and disturbances involved in the problem. Specifically, parametric uncertainties and disturbances are handled by the time delay estimation and the nonlinearity of the manipulator is addressed by the feedback structure of the controller. The combination of terminal sliding mode surface and higher-order control scheme in the controller guarantees a fast response with a small chattering amplitude. Moreover, the controller is designed with a modified sliding mode surface and variable-gain structure, so that the performance of the controller is further enhanced. We also analyze the condition to guarantee the stability of the closed-loop system in this paper. Finally, the simulation and experimental results prove that the proposed control scheme has a precise performance in a robot manipulator system.

* This paper has been accepted by American Control Conference 2021

Via

Access Paper or Ask Questions

Precise Motion Control of Wafer Stages via Adaptive Neural Network and Fractional-Order Super-Twisting Algorithm

Jan 31, 2021
Zhian Kuang, Liting Sun, Huijun Gao, Masayoshi Tomizuka

Figure 1 for Precise Motion Control of Wafer Stages via Adaptive Neural Network and Fractional-Order Super-Twisting Algorithm

Figure 2 for Precise Motion Control of Wafer Stages via Adaptive Neural Network and Fractional-Order Super-Twisting Algorithm

Figure 3 for Precise Motion Control of Wafer Stages via Adaptive Neural Network and Fractional-Order Super-Twisting Algorithm

Figure 4 for Precise Motion Control of Wafer Stages via Adaptive Neural Network and Fractional-Order Super-Twisting Algorithm

To obtain precise motion control of wafer stages, an adaptive neural network and fractional-order super-twisting control strategy is proposed. Based on sliding mode control (SMC), the proposed controller aims to address two challenges in SMC: 1) reducing the chattering phenomenon, and 2) attenuating the influence of model uncertainties and disturbances. For the first challenge, a fractional-order terminal sliding mode surface and a super-twisting algorithm are integrated into the SMC design. To attenuate uncertainties and disturbances, an add-on control structure based on the radial basis function (RBF) neural network is introduced. Stability analysis of the closed-loop control system is provided. Finally, experiments on a wafer stage testbed system are conducted, which proves that the proposed controller can robustly improve the tracking performance in the presence of uncertainties and disturbances compared to conventional and previous controllers.

* Published in IFAC World Congress 2020

Via

Access Paper or Ask Questions

Contact Pose Identification for Peg-in-Hole Assembly under Uncertainties

Jan 29, 2021
Shiyu Jin, Xinghao Zhu, Changhao Wang, Masayoshi Tomizuka

Figure 1 for Contact Pose Identification for Peg-in-Hole Assembly under Uncertainties

Figure 2 for Contact Pose Identification for Peg-in-Hole Assembly under Uncertainties

Figure 3 for Contact Pose Identification for Peg-in-Hole Assembly under Uncertainties

Figure 4 for Contact Pose Identification for Peg-in-Hole Assembly under Uncertainties

Peg-in-hole assembly is a challenging contact-rich manipulation task. There is no general solution to identify the relative position and orientation between the peg and the hole. In this paper, we propose a novel method to classify the contact poses based on a sequence of contact measurements. When the peg contacts the hole with pose uncertainties, a tilt-then-rotate strategy is applied, and the contacts are measured as a group of patterns to encode the contact pose. A convolutional neural network (CNN) is trained to classify the contact poses according to the patterns. In the end, an admittance controller guides the peg towards the error direction and finishes the peg-in-hole assembly. Simulations and experiments are provided to show that the proposed method can be applied to the peg-in-hole assembly of different geometries. We also demonstrate the ability to alleviate the sim-to-real gap.

Via

Access Paper or Ask Questions

A Safe Hierarchical Planning Framework for Complex Driving Scenarios based on Reinforcement Learning

Jan 17, 2021
Jinning Li, Liting Sun, Masayoshi Tomizuka, Wei Zhan

Figure 1 for A Safe Hierarchical Planning Framework for Complex Driving Scenarios based on Reinforcement Learning

Figure 2 for A Safe Hierarchical Planning Framework for Complex Driving Scenarios based on Reinforcement Learning

Figure 3 for A Safe Hierarchical Planning Framework for Complex Driving Scenarios based on Reinforcement Learning

Figure 4 for A Safe Hierarchical Planning Framework for Complex Driving Scenarios based on Reinforcement Learning

Autonomous vehicles need to handle various traffic conditions and make safe and efficient decisions and maneuvers. However, on the one hand, a single optimization/sampling-based motion planner cannot efficiently generate safe trajectories in real time, particularly when there are many interactive vehicles near by. On the other hand, end-to-end learning methods cannot assure the safety of the outcomes. To address this challenge, we propose a hierarchical behavior planning framework with a set of low-level safe controllers and a high-level reinforcement learning algorithm (H-CtRL) as a coordinator for the low-level controllers. Safety is guaranteed by the low-level optimization/sampling-based controllers, while the high-level reinforcement learning algorithm makes H-CtRL an adaptive and efficient behavior planner. To train and test our proposed algorithm, we built a simulator that can reproduce traffic scenes using real-world datasets. The proposed H-CtRL is proved to be effective in various realistic simulation scenarios, with satisfying performance in terms of both safety and efficiency.

Via

Access Paper or Ask Questions

Interaction-Aware Behavior Planning for Autonomous Vehicles Validated with Real Traffic Data

Jan 15, 2021
Jinning Li, Liting Sun, Wei Zhan, Masayoshi Tomizuka

Figure 1 for Interaction-Aware Behavior Planning for Autonomous Vehicles Validated with Real Traffic Data

Figure 2 for Interaction-Aware Behavior Planning for Autonomous Vehicles Validated with Real Traffic Data

Figure 3 for Interaction-Aware Behavior Planning for Autonomous Vehicles Validated with Real Traffic Data

Figure 4 for Interaction-Aware Behavior Planning for Autonomous Vehicles Validated with Real Traffic Data

Autonomous vehicles (AVs) need to interact with other traffic participants who can be either cooperative or aggressive, attentive or inattentive. Such different characteristics can lead to quite different interactive behaviors. Hence, to achieve safe and efficient autonomous driving, AVs need to be aware of such uncertainties when they plan their own behaviors. In this paper, we formulate such a behavior planning problem as a partially observable Markov Decision Process (POMDP) where the cooperativeness of other traffic participants is treated as an unobservable state. Under different cooperativeness levels, we learn the human behavior models from real traffic data via the principle of maximum likelihood. Based on that, the POMDP problem is solved by Monte-Carlo Tree Search. We verify the proposed algorithm in both simulations and real traffic data on a lane change scenario, and the results show that the proposed algorithm can successfully finish the lane changes without collisions.

Via

Access Paper or Ask Questions

Labels Are Not Perfect: Inferring Spatial Uncertainty in Object Detection

Dec 18, 2020
Di Feng, Zining Wang, Yiyang Zhou, Lars Rosenbaum, Fabian Timm, Klaus Dietmayer, Masayoshi Tomizuka, Wei Zhan

Figure 1 for Labels Are Not Perfect: Inferring Spatial Uncertainty in Object Detection

Figure 2 for Labels Are Not Perfect: Inferring Spatial Uncertainty in Object Detection

Figure 3 for Labels Are Not Perfect: Inferring Spatial Uncertainty in Object Detection

Figure 4 for Labels Are Not Perfect: Inferring Spatial Uncertainty in Object Detection

The availability of many real-world driving datasets is a key reason behind the recent progress of object detection algorithms in autonomous driving. However, there exist ambiguity or even failures in object labels due to error-prone annotation process or sensor observation noise. Current public object detection datasets only provide deterministic object labels without considering their inherent uncertainty, as does the common training process or evaluation metrics for object detectors. As a result, an in-depth evaluation among different object detection methods remains challenging, and the training process of object detectors is sub-optimal, especially in probabilistic object detection. In this work, we infer the uncertainty in bounding box labels from LiDAR point clouds based on a generative model, and define a new representation of the probabilistic bounding box through a spatial uncertainty distribution. Comprehensive experiments show that the proposed model reflects complex environmental noises in LiDAR perception and the label quality. Furthermore, we propose Jaccard IoU (JIoU) as a new evaluation metric that extends IoU by incorporating label uncertainty. We conduct an in-depth comparison among several LiDAR-based object detectors using the JIoU metric. Finally, we incorporate the proposed label uncertainty in a loss function to train a probabilistic object detector and to improve its detection accuracy. We verify our proposed methods on two public datasets (KITTI, Waymo), as well as on simulation data. Code is released at https://bit.ly/2W534yo.

Via

Access Paper or Ask Questions

Sparse R-CNN: End-to-End Object Detection with Learnable Proposals

Nov 25, 2020
Peize Sun, Rufeng Zhang, Yi Jiang, Tao Kong, Chenfeng Xu, Wei Zhan, Masayoshi Tomizuka, Lei Li, Zehuan Yuan, Changhu Wang, Ping Luo

Figure 1 for Sparse R-CNN: End-to-End Object Detection with Learnable Proposals

Figure 2 for Sparse R-CNN: End-to-End Object Detection with Learnable Proposals

Figure 3 for Sparse R-CNN: End-to-End Object Detection with Learnable Proposals

Figure 4 for Sparse R-CNN: End-to-End Object Detection with Learnable Proposals

We present Sparse R-CNN, a purely sparse method for object detection in images. Existing works on object detection heavily rely on dense object candidates, such as $k$ anchor boxes pre-defined on all grids of image feature map of size $H\times W$. In our method, however, a fixed sparse set of learned object proposals, total length of $N$, are provided to object recognition head to perform classification and location. By eliminating $HWk$ (up to hundreds of thousands) hand-designed object candidates to $N$ (e.g. 100) learnable proposals, Sparse R-CNN completely avoids all efforts related to object candidates design and many-to-one label assignment. More importantly, final predictions are directly output without non-maximum suppression post-procedure. Sparse R-CNN demonstrates accuracy, run-time and training convergence performance on par with the well-established detector baselines on the challenging COCO dataset, e.g., achieving 44.5 AP in standard $3\times$ training schedule and running at 22 fps using ResNet-50 FPN model. We hope our work could inspire re-thinking the convention of dense prior in object detectors. The code is available at: https://github.com/PeizeSun/SparseR-CNN.

Via

Access Paper or Ask Questions