Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Marcus Dees

Integrating Domain Knowledge into Process Discovery Using Large Language Models

Oct 08, 2025

Ali Norouzifar, Humam Kourani, Marcus Dees, Wil van der Aalst

Abstract:Process discovery aims to derive process models from event logs, providing insights into operational behavior and forming a foundation for conformance checking and process improvement. However, models derived solely from event data may not accurately reflect the real process, as event logs are often incomplete or affected by noise, and domain knowledge, an important complementary resource, is typically disregarded. As a result, the discovered models may lack reliability for downstream tasks. We propose an interactive framework that incorporates domain knowledge, expressed in natural language, into the process discovery pipeline using Large Language Models (LLMs). Our approach leverages LLMs to extract declarative rules from textual descriptions provided by domain experts. These rules are used to guide the IMr discovery algorithm, which recursively constructs process models by combining insights from both the event log and the extracted rules, helping to avoid problematic process structures that contradict domain knowledge. The framework coordinates interactions among the LLM, domain experts, and a set of backend services. We present a fully implemented tool that supports this workflow and conduct an extensive evaluation of multiple LLMs and prompt engineering strategies. Our empirical study includes a case study based on a real-life event log with the involvement of domain experts, who assessed the usability and effectiveness of the framework.

* This paper is currently under review for publication in a journal

Via

Access Paper or Ask Questions

Bridging Domain Knowledge and Process Discovery Using Large Language Models

Aug 30, 2024

Ali Norouzifar, Humam Kourani, Marcus Dees, Wil van der Aalst

Figure 1 for Bridging Domain Knowledge and Process Discovery Using Large Language Models

Figure 2 for Bridging Domain Knowledge and Process Discovery Using Large Language Models

Figure 3 for Bridging Domain Knowledge and Process Discovery Using Large Language Models

Figure 4 for Bridging Domain Knowledge and Process Discovery Using Large Language Models

Abstract:Discovering good process models is essential for different process analysis tasks such as conformance checking and process improvements. Automated process discovery methods often overlook valuable domain knowledge. This knowledge, including insights from domain experts and detailed process documentation, remains largely untapped during process discovery. This paper leverages Large Language Models (LLMs) to integrate such knowledge directly into process discovery. We use rules derived from LLMs to guide model construction, ensuring alignment with both domain knowledge and actual process executions. By integrating LLMs, we create a bridge between process knowledge expressed in natural language and the discovery of robust process models, advancing process discovery methodologies significantly. To showcase the usability of our framework, we conducted a case study with the UWV employee insurance agency, demonstrating its practical benefits and effectiveness.

* This paper is accepted at the AI4BPM 2024 workshop and to be published in their proceedings

Via

Access Paper or Ask Questions