Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Sriram Sankaranarayanan

Optimal Abstractions for Verifying Properties of Kolmogorov-Arnold Networks (KANs)

Feb 06, 2026

Noah Schwartz, Chandra Kanth Nagesh, Sriram Sankaranarayanan, Ramneet Kaur, Tuhin Sahai, Susmit Jha

Abstract:We present a novel approach for verifying properties of Kolmogorov-Arnold Networks (KANs), a class of neural networks characterized by nonlinear, univariate activation functions typically implemented as piecewise polynomial splines or Gaussian processes. Our method creates mathematical ``abstractions'' by replacing each KAN unit with a piecewise affine (PWA) function, providing both local and global error estimates between the original network and its approximation. These abstractions enable property verification by encoding the problem as a Mixed Integer Linear Program (MILP), determining whether outputs satisfy specified properties when inputs belong to a given set. A critical challenge lies in balancing the number of pieces in the PWA approximation: too many pieces add binary variables that make verification computationally intractable, while too few pieces create excessive error margins that yield uninformative bounds. Our key contribution is a systematic framework that exploits KAN structure to find optimal abstractions. By combining dynamic programming at the unit level with a knapsack optimization across the network, we minimize the total number of pieces while guaranteeing specified error bounds. This approach determines the optimal approximation strategy for each unit while maintaining overall accuracy requirements. Empirical evaluation across multiple KAN benchmarks demonstrates that the upfront analysis costs of our method are justified by superior verification results.

Via

Access Paper or Ask Questions

Anticipating Oblivious Opponents in Stochastic Games

Sep 18, 2024

Shadi Tasdighi Kalat, Sriram Sankaranarayanan, Ashutosh Trivedi

Figure 1 for Anticipating Oblivious Opponents in Stochastic Games

Figure 2 for Anticipating Oblivious Opponents in Stochastic Games

Figure 3 for Anticipating Oblivious Opponents in Stochastic Games

Figure 4 for Anticipating Oblivious Opponents in Stochastic Games

Abstract:We present an approach for systematically anticipating the actions and policies employed by \emph{oblivious} environments in concurrent stochastic games, while maximizing a reward function. Our main contribution lies in the synthesis of a finite \emph{information state machine} whose alphabet ranges over the actions of the environment. Each state of the automaton is mapped to a belief state about the policy used by the environment. We introduce a notion of consistency that guarantees that the belief states tracked by our automaton stays within a fixed distance of the precise belief state obtained by knowledge of the full history. We provide methods for checking consistency of an automaton and a synthesis approach which upon successful termination yields such a machine. We show how the information state machine yields an MDP that serves as the starting point for computing optimal policies for maximizing a reward function defined over plays. We present an experimental evaluation over benchmark examples including human activity data for tasks such as cataract surgery and furniture assembly, wherein our approach successfully anticipates the policies and actions of the environment in order to maximize the reward.

Via

Access Paper or Ask Questions

Large Language Models Enable Automated Formative Feedback in Human-Robot Interaction Tasks

May 25, 2024

Emily Jensen, Sriram Sankaranarayanan, Bradley Hayes

Abstract:We claim that LLMs can be paired with formal analysis methods to provide accessible, relevant feedback for HRI tasks. While logic specifications are useful for defining and assessing a task, these representations are not easily interpreted by non-experts. Luckily, LLMs are adept at generating easy-to-understand text that explains difficult concepts. By integrating task assessment outcomes and other contextual information into an LLM prompt, we can effectively synthesize a useful set of recommendations for the learner to improve their performance.

* Presented at Human-LLM Interaction Workshop at HRI 2024

Via

Access Paper or Ask Questions

Automated Assessment and Adaptive Multimodal Formative Feedback Improves Psychomotor Skills Training Outcomes in Quadrotor Teleoperation

May 24, 2024

Emily Jensen, Sriram Sankaranarayanan, Bradley Hayes

Abstract:The workforce will need to continually upskill in order to meet the evolving demands of industry, especially working with robotic and autonomous systems. Current training methods are not scalable and do not adapt to the skills that learners already possess. In this work, we develop a system that automatically assesses learner skill in a quadrotor teleoperation task using temporal logic task specifications. This assessment is used to generate multimodal feedback based on the principles of effective formative feedback. Participants perceived the feedback positively. Those receiving formative feedback viewed the feedback as more actionable compared to receiving summary statistics. Participants in the multimodal feedback condition were more likely to achieve a safe landing and increased their safe landings more over the experiment compared to other feedback conditions. Finally, we identify themes to improve adaptive feedback and discuss and how training for complex psychomotor tasks can be integrated with learning theories.

* Under review at Human-Agent Interaction 2024 conference

Via

Access Paper or Ask Questions

Worst-Case Convergence Time of ML Algorithms via Extreme Value Theory

Apr 10, 2024

Saeid Tizpaz-Niari, Sriram Sankaranarayanan

Figure 1 for Worst-Case Convergence Time of ML Algorithms via Extreme Value Theory

Figure 2 for Worst-Case Convergence Time of ML Algorithms via Extreme Value Theory

Figure 3 for Worst-Case Convergence Time of ML Algorithms via Extreme Value Theory

Figure 4 for Worst-Case Convergence Time of ML Algorithms via Extreme Value Theory

Abstract:This paper leverages the statistics of extreme values to predict the worst-case convergence times of machine learning algorithms. Timing is a critical non-functional property of ML systems, and providing the worst-case converge times is essential to guarantee the availability of ML and its services. However, timing properties such as worst-case convergence times (WCCT) are difficult to verify since (1) they are not encoded in the syntax or semantics of underlying programming languages of AI, (2) their evaluations depend on both algorithmic implementations and underlying systems, and (3) their measurements involve uncertainty and noise. Therefore, prevalent formal methods and statistical models fail to provide rich information on the amounts and likelihood of WCCT. Our key observation is that the timing information we seek represents the extreme tail of execution times. Therefore, extreme value theory (EVT), a statistical discipline that focuses on understanding and predicting the distribution of extreme values in the tail of outcomes, provides an ideal framework to model and analyze WCCT in the training and inference phases of ML paradigm. Building upon the mathematical tools from EVT, we propose a practical framework to predict the worst-case timing properties of ML. Over a set of linear ML training algorithms, we show that EVT achieves a better accuracy for predicting WCCTs than relevant statistical methods such as the Bayesian factor. On the set of larger machine learning training algorithms and deep neural network inference, we show the feasibility and usefulness of EVT models to accurately predict WCCTs, their expected return periods, and their likelihood.

* In 3rd International Conference on AI Engineering: Software Engineering for AI (CAIN 2024)

Via

Access Paper or Ask Questions

Certifiably-correct Control Policies for Safe Learning and Adaptation in Assistive Robotics

Mar 12, 2023

Keyvan Majd, Geoffrey Clark, Tanmay Khandait, Siyu Zhou, Sriram Sankaranarayanan, Georgios Fainekos, Heni Ben Amor

Figure 1 for Certifiably-correct Control Policies for Safe Learning and Adaptation in Assistive Robotics

Figure 2 for Certifiably-correct Control Policies for Safe Learning and Adaptation in Assistive Robotics

Figure 3 for Certifiably-correct Control Policies for Safe Learning and Adaptation in Assistive Robotics

Figure 4 for Certifiably-correct Control Policies for Safe Learning and Adaptation in Assistive Robotics

Abstract:Guaranteeing safety in human-centric applications is critical in robot learning as the learned policies may demonstrate unsafe behaviors in formerly unseen scenarios. We present a framework to locally repair an erroneous policy network to satisfy a set of formal safety constraints using Mixed Integer Quadratic Programming (MIQP). Our MIQP formulation explicitly imposes the safety constraints to the learned policy while minimizing the original loss function. The policy network is then verified to be locally safe. We demonstrate the application of our framework to derive safe policies for a robotic lower-leg prosthesis.

* Appeared in the 36th Conference on Neural Information Processing Systems (NeurIPS) - Robot Learning Workshop. arXiv admin note: substantial text overlap with arXiv:2303.04431

Via

Access Paper or Ask Questions

Safe Robot Learning in Assistive Devices through Neural Network Repair

Mar 08, 2023

Keyvan Majd, Geoffrey Clark, Tanmay Khandait, Siyu Zhou, Sriram Sankaranarayanan, Georgios Fainekos, Heni Ben Amor

Figure 1 for Safe Robot Learning in Assistive Devices through Neural Network Repair

Figure 2 for Safe Robot Learning in Assistive Devices through Neural Network Repair

Figure 3 for Safe Robot Learning in Assistive Devices through Neural Network Repair

Figure 4 for Safe Robot Learning in Assistive Devices through Neural Network Repair

Abstract:Assistive robotic devices are a particularly promising field of application for neural networks (NN) due to the need for personalization and hard-to-model human-machine interaction dynamics. However, NN based estimators and controllers may produce potentially unsafe outputs over previously unseen data points. In this paper, we introduce an algorithm for updating NN control policies to satisfy a given set of formal safety constraints, while also optimizing the original loss function. Given a set of mixed-integer linear constraints, we define the NN repair problem as a Mixed Integer Quadratic Program (MIQP). In extensive experiments, we demonstrate the efficacy of our repair method in generating safe policies for a lower-leg prosthesis.

Via

Access Paper or Ask Questions

Mathematical Models of Human Drivers Using Artificial Risk Fields

May 24, 2022

Emily Jensen, Maya Luster, Hansol Yoon, Brandon Pitts, Sriram Sankaranarayanan

Figure 1 for Mathematical Models of Human Drivers Using Artificial Risk Fields

Figure 2 for Mathematical Models of Human Drivers Using Artificial Risk Fields

Figure 3 for Mathematical Models of Human Drivers Using Artificial Risk Fields

Figure 4 for Mathematical Models of Human Drivers Using Artificial Risk Fields

Abstract:In this paper, we use the concept of artificial risk fields to predict how human operators control a vehicle in response to upcoming road situations. A risk field assigns a non-negative risk measure to the state of the system in order to model how close that state is to violating a safety property, such as hitting an obstacle or exiting the road. Using risk fields, we construct a stochastic model of the operator that maps from states to likely actions. We demonstrate our approach on a driving task wherein human subjects are asked to drive a car inside a realistic driving simulator while avoiding obstacles placed on the road. We show that the most likely risk field given the driving data is obtained by solving a convex optimization problem. Next, we apply the inferred risk fields to generate distinct driving behaviors while comparing predicted trajectories against ground truth measurements. We observe that the risk fields are excellent at predicting future trajectory distributions with high prediction accuracy for up to twenty seconds prediction horizons. At the same time, we observe some challenges such as the inability to account for how drivers choose to accelerate/decelerate based on the road conditions.

* 8 pages, 4 figures, submitted to Intelligent Transportation Systems Conference

Via

Access Paper or Ask Questions

Local Repair of Neural Networks Using Optimization

Sep 28, 2021

Keyvan Majd, Siyu Zhou, Heni Ben Amor, Georgios Fainekos, Sriram Sankaranarayanan

Figure 1 for Local Repair of Neural Networks Using Optimization

Figure 2 for Local Repair of Neural Networks Using Optimization

Figure 3 for Local Repair of Neural Networks Using Optimization

Figure 4 for Local Repair of Neural Networks Using Optimization

Abstract:In this paper, we propose a framework to repair a pre-trained feed-forward neural network (NN) to satisfy a set of properties. We formulate the properties as a set of predicates that impose constraints on the output of NN over the target input domain. We define the NN repair problem as a Mixed Integer Quadratic Program (MIQP) to adjust the weights of a single layer subject to the given predicates while minimizing the original loss function over the original training domain. We demonstrate the application of our framework in bounding an affine transformation, correcting an erroneous NN in classification, and bounding the inputs of a NN controller.

Via

Access Paper or Ask Questions

Static analysis of ReLU neural networks with tropical polyhedra

Aug 23, 2021

Eric Goubault, Sébastien Palumby, Sylvie Putot, Louis Rustenholz, Sriram Sankaranarayanan

Figure 1 for Static analysis of ReLU neural networks with tropical polyhedra

Figure 2 for Static analysis of ReLU neural networks with tropical polyhedra

Figure 3 for Static analysis of ReLU neural networks with tropical polyhedra

Figure 4 for Static analysis of ReLU neural networks with tropical polyhedra

Abstract:This paper studies the problem of range analysis for feedforward neural networks, which is a basic primitive for applications such as robustness of neural networks, compliance to specifications and reachability analysis of neural-network feedback systems. Our approach focuses on ReLU (rectified linear unit) feedforward neural nets that present specific difficulties: approaches that exploit derivatives do not apply in general, the number of patterns of neuron activations can be quite large even for small networks, and convex approximations are generally too coarse. In this paper, we employ set-based methods and abstract interpretation that have been very successful in coping with similar difficulties in classical program verification. We present an approach that abstracts ReLU feedforward neural networks using tropical polyhedra. We show that tropical polyhedra can efficiently abstract ReLU activation function, while being able to control the loss of precision due to linear computations. We show how the connection between ReLU networks and tropical rational functions can provide approaches for range analysis of ReLU neural networks.

Via

Access Paper or Ask Questions