Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Mykel J. Kochenderfer

Conditional Approximate Normalizing Flows for Joint Multi-Step Probabilistic Electricity Demand Forecasting

Jan 08, 2022
Arec Jamgochian, Di Wu, Kunal Menda, Soyeon Jung, Mykel J. Kochenderfer

Figure 1 for Conditional Approximate Normalizing Flows for Joint Multi-Step Probabilistic Electricity Demand Forecasting

Figure 2 for Conditional Approximate Normalizing Flows for Joint Multi-Step Probabilistic Electricity Demand Forecasting

Figure 3 for Conditional Approximate Normalizing Flows for Joint Multi-Step Probabilistic Electricity Demand Forecasting

Figure 4 for Conditional Approximate Normalizing Flows for Joint Multi-Step Probabilistic Electricity Demand Forecasting

Some real-world decision-making problems require making probabilistic forecasts over multiple steps at once. However, methods for probabilistic forecasting may fail to capture correlations in the underlying time-series that exist over long time horizons as errors accumulate. One such application is with resource scheduling under uncertainty in a grid environment, which requires forecasting electricity demand that is inherently noisy, but often cyclic. In this paper, we introduce the conditional approximate normalizing flow (CANF) to make probabilistic multi-step time-series forecasts when correlations are present over long time horizons. We first demonstrate our method's efficacy on estimating the density of a toy distribution, finding that CANF improves the KL divergence by one-third compared to that of a Gaussian mixture model while still being amenable to explicit conditioning. We then use a publicly available household electricity consumption dataset to showcase the effectiveness of CANF on joint probabilistic multi-step forecasting. Empirical results show that conditional approximate normalizing flows outperform other methods in terms of multi-step forecast accuracy and lead to up to 10x better scheduling decisions. Our implementation is available at https://github.com/sisl/JointDemandForecasting.

Via

Access Paper or Ask Questions

Multi-Vehicle Control in Roundabouts using Decentralized Game-Theoretic Planning

Jan 08, 2022
Arec Jamgochian, Kunal Menda, Mykel J. Kochenderfer

Figure 1 for Multi-Vehicle Control in Roundabouts using Decentralized Game-Theoretic Planning

Figure 2 for Multi-Vehicle Control in Roundabouts using Decentralized Game-Theoretic Planning

Figure 3 for Multi-Vehicle Control in Roundabouts using Decentralized Game-Theoretic Planning

Figure 4 for Multi-Vehicle Control in Roundabouts using Decentralized Game-Theoretic Planning

Safe navigation in dense, urban driving environments remains an open problem and an active area of research. Unlike typical predict-then-plan approaches, game-theoretic planning considers how one vehicle's plan will affect the actions of another. Recent work has demonstrated significant improvements in the time required to find local Nash equilibria in general-sum games with nonlinear objectives and constraints. When applied trivially to driving, these works assume all vehicles in a scene play a game together, which can result in intractable computation times for dense traffic. We formulate a decentralized approach to game-theoretic planning by assuming that agents only play games within their observational vicinity, which we believe to be a more reasonable assumption for human driving. Games are played in parallel for all strongly connected components of an interaction graph, significantly reducing the number of players and constraints in each game, and therefore the time required for planning. We demonstrate that our approach can achieve collision-free, efficient driving in urban environments by comparing performance against an adaptation of the Intelligent Driver Model and centralized game-theoretic planning when navigating roundabouts in the INTERACTION dataset. Our implementation is available at http://github.com/sisl/DecNashPlanning.

* Presented in the Artificial Intelligence for Autonomous Driving (AI4AD) Workshop at the 30th International Joint Conference on Artificial Intelligence (IJCAI 2021)

Via

Access Paper or Ask Questions

Dyadic Sex Composition and Task Classification Using fNIRS Hyperscanning Data

Dec 07, 2021
Liam A. Kruse, Allan L. Reiss, Mykel J. Kochenderfer, Stephanie Balters

Figure 1 for Dyadic Sex Composition and Task Classification Using fNIRS Hyperscanning Data

Figure 2 for Dyadic Sex Composition and Task Classification Using fNIRS Hyperscanning Data

Figure 3 for Dyadic Sex Composition and Task Classification Using fNIRS Hyperscanning Data

Figure 4 for Dyadic Sex Composition and Task Classification Using fNIRS Hyperscanning Data

Hyperscanning with functional near-infrared spectroscopy (fNIRS) is an emerging neuroimaging application that measures the nuanced neural signatures underlying social interactions. Researchers have assessed the effect of sex and task type (e.g., cooperation versus competition) on inter-brain coherence during human-to-human interactions. However, no work has yet used deep learning-based approaches to extract insights into sex and task-based differences in an fNIRS hyperscanning context. This work proposes a convolutional neural network-based approach to dyadic sex composition and task classification for an extensive hyperscanning dataset with $N = 222$ participants. Inter-brain signal similarity computed using dynamic time warping is used as the input data. The proposed approach achieves a maximum classification accuracy of greater than $80$ percent, thereby providing a new avenue for exploring and understanding complex brain behavior.

* 20th IEEE International Conference on Machine Learning and Applications

Via

Access Paper or Ask Questions

Autonomous Attack Mitigation for Industrial Control Systems

Nov 03, 2021
John Mern, Kyle Hatch, Ryan Silva, Cameron Hickert, Tamim Sookoor, Mykel J. Kochenderfer

Figure 1 for Autonomous Attack Mitigation for Industrial Control Systems

Figure 2 for Autonomous Attack Mitigation for Industrial Control Systems

Figure 3 for Autonomous Attack Mitigation for Industrial Control Systems

Figure 4 for Autonomous Attack Mitigation for Industrial Control Systems

Defending computer networks from cyber attack requires timely responses to alerts and threat intelligence. Decisions about how to respond involve coordinating actions across multiple nodes based on imperfect indicators of compromise while minimizing disruptions to network operations. Currently, playbooks are used to automate portions of a response process, but often leave complex decision-making to a human analyst. In this work, we present a deep reinforcement learning approach to autonomous response and recovery in large industrial control networks. We propose an attention-based neural architecture that is flexible to the size of the network under protection. To train and evaluate the autonomous defender agent, we present an industrial control network simulation environment suitable for reinforcement learning. Experiments show that the learned agent can effectively mitigate advanced attacks that progress with few observable signals over several months before execution. The proposed deep reinforcement learning approach outperforms a fully automated playbook method in simulation, taking less disruptive actions while also defending more nodes on the network. The learned policy is also more robust to changes in attacker behavior than playbook approaches.

* 11 pages

Via

Access Paper or Ask Questions

Evidential Softmax for Sparse Multimodal Distributions in Deep Generative Models

Oct 27, 2021
Phil Chen, Masha Itkina, Ransalu Senanayake, Mykel J. Kochenderfer

Figure 1 for Evidential Softmax for Sparse Multimodal Distributions in Deep Generative Models

Figure 2 for Evidential Softmax for Sparse Multimodal Distributions in Deep Generative Models

Figure 3 for Evidential Softmax for Sparse Multimodal Distributions in Deep Generative Models

Figure 4 for Evidential Softmax for Sparse Multimodal Distributions in Deep Generative Models

Many applications of generative models rely on the marginalization of their high-dimensional output probability distributions. Normalization functions that yield sparse probability distributions can make exact marginalization more computationally tractable. However, sparse normalization functions usually require alternative loss functions for training since the log-likelihood is undefined for sparse probability distributions. Furthermore, many sparse normalization functions often collapse the multimodality of distributions. In this work, we present $\textit{ev-softmax}$, a sparse normalization function that preserves the multimodality of probability distributions. We derive its properties, including its gradient in closed-form, and introduce a continuous family of approximations to $\textit{ev-softmax}$ that have full support and can be trained with probabilistic loss functions such as negative log-likelihood and Kullback-Leibler divergence. We evaluate our method on a variety of generative models, including variational autoencoders and auto-regressive architectures. Our method outperforms existing dense and sparse normalization techniques in distributional accuracy. We demonstrate that $\textit{ev-softmax}$ successfully reduces the dimensionality of probability distributions while maintaining multimodality.

* Accepted to NeurIPS 2021. Code is available at https://github.com/sisl/EvSoftmax

Via

Access Paper or Ask Questions

Interpretable Local Tree Surrogate Policies

Sep 16, 2021
John Mern, Sidhart Krishnan, Anil Yildiz, Kyle Hatch, Mykel J. Kochenderfer

Figure 1 for Interpretable Local Tree Surrogate Policies

Figure 2 for Interpretable Local Tree Surrogate Policies

Figure 3 for Interpretable Local Tree Surrogate Policies

Figure 4 for Interpretable Local Tree Surrogate Policies

High-dimensional policies, such as those represented by neural networks, cannot be reasonably interpreted by humans. This lack of interpretability reduces the trust users have in policy behavior, limiting their use to low-impact tasks such as video games. Unfortunately, many methods rely on neural network representations for effective learning. In this work, we propose a method to build predictable policy trees as surrogates for policies such as neural networks. The policy trees are easily human interpretable and provide quantitative predictions of future behavior. We demonstrate the performance of this approach on several simulated tasks.

* pre-print, submitted to AAAI 2022 Conference, 7 pages

Via

Access Paper or Ask Questions

Multi-Agent Variational Occlusion Inference Using People as Sensors

Sep 05, 2021
Masha Itkina, Ye-Ji Mun, Katherine Driggs-Campbell, Mykel J. Kochenderfer

Figure 1 for Multi-Agent Variational Occlusion Inference Using People as Sensors

Figure 2 for Multi-Agent Variational Occlusion Inference Using People as Sensors

Figure 3 for Multi-Agent Variational Occlusion Inference Using People as Sensors

Figure 4 for Multi-Agent Variational Occlusion Inference Using People as Sensors

Autonomous vehicles must reason about spatial occlusions in urban environments to ensure safety without being overly cautious. Prior work explored occlusion inference from observed social behaviors of road agents. Inferring occupancy from agent behaviors is an inherently multimodal problem; a driver may behave in the same manner for different occupancy patterns ahead of them (e.g., a driver may move at constant speed in traffic or on an open road). Past work, however, does not account for this multimodality, thus neglecting to model this source of aleatoric uncertainty in the relationship between driver behaviors and their environment. We propose an occlusion inference method that characterizes observed behaviors of human agents as sensor measurements, and fuses them with those from a standard sensor suite. To capture the aleatoric uncertainty, we train a conditional variational autoencoder with a discrete latent space to learn a multimodal mapping from observed driver trajectories to an occupancy grid representation of the view ahead of the driver. Our method handles multi-agent scenarios, combining measurements from multiple observed drivers using evidential theory to solve the sensor fusion problem. Our approach is validated on a real-world dataset, outperforming baselines and demonstrating real-time capable performance. Our code is available at https://github.com/sisl/MultiAgentVariationalOcclusionInference .

* 21 pages, 11 figures

Via

Access Paper or Ask Questions

OVERT: An Algorithm for Safety Verification of Neural Network Control Policies for Nonlinear Systems

Aug 03, 2021
Chelsea Sidrane, Amir Maleki, Ahmed Irfan, Mykel J. Kochenderfer

Figure 1 for OVERT: An Algorithm for Safety Verification of Neural Network Control Policies for Nonlinear Systems

Figure 2 for OVERT: An Algorithm for Safety Verification of Neural Network Control Policies for Nonlinear Systems

Figure 3 for OVERT: An Algorithm for Safety Verification of Neural Network Control Policies for Nonlinear Systems

Figure 4 for OVERT: An Algorithm for Safety Verification of Neural Network Control Policies for Nonlinear Systems

Deep learning methods can be used to produce control policies, but certifying their safety is challenging. The resulting networks are nonlinear and often very large. In response to this challenge, we present OVERT: a sound algorithm for safety verification of nonlinear discrete-time closed loop dynamical systems with neural network control policies. The novelty of OVERT lies in combining ideas from the classical formal methods literature with ideas from the newer neural network verification literature. The central concept of OVERT is to abstract nonlinear functions with a set of optimally tight piecewise linear bounds. Such piecewise linear bounds are designed for seamless integration into ReLU neural network verification tools. OVERT can be used to prove bounded-time safety properties by either computing reachable sets or solving feasibility queries directly. We demonstrate various examples of safety verification for several classical benchmark examples. OVERT compares favorably to existing methods both in computation time and in tightness of the reachable set.

* 44 pages, under review

Via

Access Paper or Ask Questions

Finding Failures in High-Fidelity Simulation using Adaptive Stress Testing and the Backward Algorithm

Jul 27, 2021
Mark Koren, Ahmed Nassar, Mykel J. Kochenderfer

Figure 1 for Finding Failures in High-Fidelity Simulation using Adaptive Stress Testing and the Backward Algorithm

Figure 2 for Finding Failures in High-Fidelity Simulation using Adaptive Stress Testing and the Backward Algorithm

Figure 3 for Finding Failures in High-Fidelity Simulation using Adaptive Stress Testing and the Backward Algorithm

Figure 4 for Finding Failures in High-Fidelity Simulation using Adaptive Stress Testing and the Backward Algorithm

Validating the safety of autonomous systems generally requires the use of high-fidelity simulators that adequately capture the variability of real-world scenarios. However, it is generally not feasible to exhaustively search the space of simulation scenarios for failures. Adaptive stress testing (AST) is a method that uses reinforcement learning to find the most likely failure of a system. AST with a deep reinforcement learning solver has been shown to be effective in finding failures across a range of different systems. This approach generally involves running many simulations, which can be very expensive when using a high-fidelity simulator. To improve efficiency, we present a method that first finds failures in a low-fidelity simulator. It then uses the backward algorithm, which trains a deep neural network policy using a single expert demonstration, to adapt the low-fidelity failures to high-fidelity. We have created a series of autonomous vehicle validation case studies that represent some of the ways low-fidelity and high-fidelity simulators can differ, such as time discretization. We demonstrate in a variety of case studies that this new AST approach is able to find failures with significantly fewer high-fidelity simulation steps than are needed when just running AST directly in high-fidelity. As a proof of concept, we also demonstrate AST on NVIDIA's DriveSim simulator, an industry state-of-the-art high-fidelity simulator for finding failures in autonomous vehicles.

* Accepted to IROS 2021

Via

Access Paper or Ask Questions

3D Radar Velocity Maps for Uncertain Dynamic Environments

Jul 23, 2021
Ransalu Senanayake, Kyle Beltran Hatch, Jason Zheng, Mykel J. Kochenderfer

Figure 1 for 3D Radar Velocity Maps for Uncertain Dynamic Environments

Figure 2 for 3D Radar Velocity Maps for Uncertain Dynamic Environments

Figure 3 for 3D Radar Velocity Maps for Uncertain Dynamic Environments

Figure 4 for 3D Radar Velocity Maps for Uncertain Dynamic Environments

Future urban transportation concepts include a mixture of ground and air vehicles with varying degrees of autonomy in a congested environment. In such dynamic environments, occupancy maps alone are not sufficient for safe path planning. Safe and efficient transportation requires reasoning about the 3D flow of traffic and properly modeling uncertainty. Several different approaches can be taken for developing 3D velocity maps. This paper explores a Bayesian approach that captures our uncertainty in the map given training data. The approach involves projecting spatial coordinates into a high-dimensional feature space and then applying Bayesian linear regression to make predictions and quantify uncertainty in our estimates. On a collection of air and ground datasets, we demonstrate that this approach is effective and more scalable than several alternative approaches.

* Accepted to IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS), 2021

Via

Access Paper or Ask Questions