Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Sylvie Thiébaux

Learning Lifted Action Models from Unsupervised Visual Traces

Apr 21, 2026

Kai Xi, Stephen Gould, Sylvie Thiébaux

Abstract:Efficient construction of models capturing the preconditions and effects of actions is essential for applying AI planning in real-world domains. Extensive prior work has explored learning such models from high-level descriptions of state and/or action sequences. In this paper, we tackle a more challenging setting: learning lifted action models from sequences of state images, without action observation. We propose a deep learning framework that jointly learns state prediction, action prediction, and a lifted action model. We also introduce a mixed-integer linear program (MILP) to prevent prediction collapse and self-reinforcing errors among predictions. The MILP takes the predicted states, actions, and action model over a subset of traces and solves for logically consistent states, actions, and action model that are as close as possible to the original predictions. Pseudo-labels extracted from the MILP solution are then used to guide further training. Experiments across multiple domains show that integrating MILP-based correction helps the model escape local optima and converge toward globally consistent solutions.

* Accepted to the 36th International Conference on Automated Planning and Scheduling (ICAPS-26)

Via

Access Paper or Ask Questions

On the Ability of Transformers to Verify Plans

Mar 20, 2026

Yash Sarrof, Yupei Du, Katharina Stein, Alexander Koller, Sylvie Thiébaux, Michael Hahn

Abstract:Transformers have shown inconsistent success in AI planning tasks, and theoretical understanding of when generalization should be expected has been limited. We take important steps towards addressing this gap by analyzing the ability of decoder-only models to verify whether a given plan correctly solves a given planning instance. To analyse the general setting where the number of objects -- and thus the effective input alphabet -- grows at test time, we introduce C*-RASP, an extension of C-RASP designed to establish length generalization guarantees for transformers under the simultaneous growth in sequence length and vocabulary size. Our results identify a large class of classical planning domains for which transformers can provably learn to verify long plans, and structural properties that significantly affects the learnability of length generalizable solutions. Empirical experiments corroborate our theory.

Via

Access Paper or Ask Questions

Exploring Plan Space through Conversation: An Agentic Framework for LLM-Mediated Explanations in Planning

Mar 02, 2026

Guilhem Fouilhé, Rebecca Eifler, Antonin Poché, Sylvie Thiébaux, Nicholas Asher

Abstract:When automating plan generation for a real-world sequential decision problem, the goal is often not to replace the human planner, but to facilitate an iterative reasoning and elicitation process, where the human's role is to guide the AI planner according to their preferences and expertise. In this context, explanations that respond to users' questions are crucial to improve their understanding of potential solutions and increase their trust in the system. To enable natural interaction with such a system, we present a multi-agent Large Language Model (LLM) architecture that is agnostic to the explanation framework and enables user- and context-dependent interactive explanations. We also describe an instantiation of this framework for goal-conflict explanations, which we use to conduct a user study comparing the LLM-powered interaction with a baseline template-based explanation interface.

* Preprint

Via

Access Paper or Ask Questions

AI Planning: A Primer and Survey (Preliminary Report)

Dec 07, 2024

Dillon Z. Chen, Pulkit Verma, Siddharth Srivastava, Michael Katz, Sylvie Thiébaux

Figure 1 for AI Planning: A Primer and Survey (Preliminary Report)

Abstract:Automated decision-making is a fundamental topic that spans multiple sub-disciplines in AI: reinforcement learning (RL), AI planning (AP), foundation models, and operations research, among others. Despite recent efforts to ``bridge the gaps'' between these communities, there remain many insights that have not yet transcended the boundaries. Our goal in this paper is to provide a brief and non-exhaustive primer on ideas well-known in AP, but less so in other sub-disciplines. We do so by introducing the classical AP problem and representation, and extensions that handle uncertainty and time through the Markov Decision Process formalism. Next, we survey state-of-the-art techniques and ideas for solving AP problems, focusing on their ability to exploit problem structure. Lastly, we cover subfields within AP for learning structure from unstructured inputs and learning to generalise to unseen scenarios and situations.

Via

Access Paper or Ask Questions

Graph Learning for Planning: The Story Thus Far and Open Challenges

Dec 03, 2024

Dillon Z. Chen, Mingyu Hao, Sylvie Thiébaux, Felipe Trevizan

Figure 1 for Graph Learning for Planning: The Story Thus Far and Open Challenges

Figure 2 for Graph Learning for Planning: The Story Thus Far and Open Challenges

Figure 3 for Graph Learning for Planning: The Story Thus Far and Open Challenges

Figure 4 for Graph Learning for Planning: The Story Thus Far and Open Challenges

Abstract:Graph learning is naturally well suited for use in planning due to its ability to exploit relational structures exhibited in planning domains and to take as input planning instances with arbitrary number of objects. In this paper, we study the usage of graph learning for planning thus far by studying the theoretical and empirical effects on learning and planning performance of (1) graph representations of planning tasks, (2) graph learning architectures, and (3) optimisation formulations for learning. Our studies accumulate in the GOOSE framework which learns domain knowledge from small planning tasks in order to scale up to much larger planning tasks. In this paper, we also highlight and propose the 5 open challenges in the general Learning for Planning field that we believe need to be addressed for advancing the state-of-the-art.

Via

Access Paper or Ask Questions

Graph Learning for Numeric Planning

Oct 31, 2024

Dillon Z. Chen, Sylvie Thiébaux

Figure 1 for Graph Learning for Numeric Planning

Figure 2 for Graph Learning for Numeric Planning

Figure 3 for Graph Learning for Numeric Planning

Figure 4 for Graph Learning for Numeric Planning

Abstract:Graph learning is naturally well suited for use in symbolic, object-centric planning due to its ability to exploit relational structures exhibited in planning domains and to take as input planning instances with arbitrary numbers of objects. Numeric planning is an extension of symbolic planning in which states may now also exhibit numeric variables. In this work, we propose data-efficient and interpretable machine learning models for learning to solve numeric planning tasks. This involves constructing a new graph kernel for graphs with both continuous and categorical attributes, as well as new optimisation methods for learning heuristic functions for numeric planning. Experiments show that our graph kernels are vastly more efficient and generalise better than graph neural networks for numeric planning, and also yield competitive coverage performance compared to domain-independent numeric planners. Code is available at https://github.com/DillonZChen/goose

* Extended version of NeurIPS 2024 paper

Via

Access Paper or Ask Questions

Novelty Heuristics, Multi-Queue Search, and Portfolios for Numeric Planning

Apr 11, 2024

Dillon Z. Chen, Sylvie Thiébaux

Figure 1 for Novelty Heuristics, Multi-Queue Search, and Portfolios for Numeric Planning

Figure 2 for Novelty Heuristics, Multi-Queue Search, and Portfolios for Numeric Planning

Figure 3 for Novelty Heuristics, Multi-Queue Search, and Portfolios for Numeric Planning

Figure 4 for Novelty Heuristics, Multi-Queue Search, and Portfolios for Numeric Planning

Abstract:Heuristic search is a powerful approach for solving planning problems and numeric planning is no exception. In this paper, we boost the performance of heuristic search for numeric planning with various powerful techniques orthogonal to improving heuristic informedness: numeric novelty heuristics, the Manhattan distance heuristic, and exploring the use of multi-queue search and portfolios for combining heuristics.

* Extended version of SoCS 2024 paper

Via

Access Paper or Ask Questions

Return to Tradition: Learning Reliable Heuristics with Classical Machine Learning

Mar 25, 2024

Dillon Z. Chen, Felipe Trevizan, Sylvie Thiébaux

Figure 1 for Return to Tradition: Learning Reliable Heuristics with Classical Machine Learning

Figure 2 for Return to Tradition: Learning Reliable Heuristics with Classical Machine Learning

Figure 3 for Return to Tradition: Learning Reliable Heuristics with Classical Machine Learning

Figure 4 for Return to Tradition: Learning Reliable Heuristics with Classical Machine Learning

Abstract:Current approaches for learning for planning have yet to achieve competitive performance against classical planners in several domains, and have poor overall performance. In this work, we construct novel graph representations of lifted planning tasks and use the WL algorithm to generate features from them. These features are used with classical machine learning methods which have up to 2 orders of magnitude fewer parameters and train up to 3 orders of magnitude faster than the state-of-the-art deep learning for planning models. Our novel approach, WL-GOOSE, reliably learns heuristics from scratch and outperforms the $h^{\text{FF}}$ heuristic in a fair competition setting. It also outperforms or ties with LAMA on 4 out of 10 domains on coverage and 7 out of 10 domains on plan quality. WL-GOOSE is the first learning for planning model which achieves these feats. Furthermore, we study the connections between our novel WL feature generation method, previous theoretically flavoured learning architectures, and Description Logic Features for planning.

* Extended version of ICAPS 2024 paper

Via

Access Paper or Ask Questions

Learning Domain-Independent Heuristics for Grounded and Lifted Planning

Dec 20, 2023

Dillon Z. Chen, Sylvie Thiébaux, Felipe Trevizan

Figure 1 for Learning Domain-Independent Heuristics for Grounded and Lifted Planning

Figure 2 for Learning Domain-Independent Heuristics for Grounded and Lifted Planning

Figure 3 for Learning Domain-Independent Heuristics for Grounded and Lifted Planning

Figure 4 for Learning Domain-Independent Heuristics for Grounded and Lifted Planning

Abstract:We present three novel graph representations of planning tasks suitable for learning domain-independent heuristics using Graph Neural Networks (GNNs) to guide search. In particular, to mitigate the issues caused by large grounded GNNs we present the first method for learning domain-independent heuristics with only the lifted representation of a planning task. We also provide a theoretical analysis of the expressiveness of our models, showing that some are more powerful than STRIPS-HGN, the only other existing model for learning domain-independent heuristics. Our experiments show that our heuristics generalise to much larger problems than those in the training set, vastly surpassing STRIPS-HGN heuristics.

* Extended version of AAAI 2024 paper

Via

Access Paper or Ask Questions

A More General Theory of Diagnosis from First Principles

Sep 28, 2023

Alban Grastien, Patrik Haslum, Sylvie Thiébaux

Figure 1 for A More General Theory of Diagnosis from First Principles

Figure 2 for A More General Theory of Diagnosis from First Principles

Figure 3 for A More General Theory of Diagnosis from First Principles

Figure 4 for A More General Theory of Diagnosis from First Principles

Abstract:Model-based diagnosis has been an active research topic in different communities including artificial intelligence, formal methods, and control. This has led to a set of disparate approaches addressing different classes of systems and seeking different forms of diagnoses. In this paper, we resolve such disparities by generalising Reiter's theory to be agnostic to the types of systems and diagnoses considered. This more general theory of diagnosis from first principles defines the minimal diagnosis as the set of preferred diagnosis candidates in a search space of hypotheses. Computing the minimal diagnosis is achieved by exploring the space of diagnosis hypotheses, testing sets of hypotheses for consistency with the system's model and the observation, and generating conflicts that rule out successors and other portions of the search space. Under relatively mild assumptions, our algorithms correctly compute the set of preferred diagnosis candidates. The main difficulty here is that the search space is no longer a powerset as in Reiter's theory, and that, as consequence, many of the implicit properties (such as finiteness of the search space) no longer hold. The notion of conflict also needs to be generalised and we present such a more general notion. We present two implementations of these algorithms, using test solvers based on satisfiability and heuristic search, respectively, which we evaluate on instances from two real world discrete event problems. Despite the greater generality of our theory, these implementations surpass the special purpose algorithms designed for discrete event systems, and enable solving instances that were out of reach of existing diagnosis approaches.

Via

Access Paper or Ask Questions