Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Amin Hosseininasab

Memory Efficient Tries for Sequential Pattern Mining

Feb 06, 2022

Amin Hosseininasab, Willem-Jan van Hoeve, Andre A. Cire

Figure 1 for Memory Efficient Tries for Sequential Pattern Mining

Figure 2 for Memory Efficient Tries for Sequential Pattern Mining

Figure 3 for Memory Efficient Tries for Sequential Pattern Mining

Figure 4 for Memory Efficient Tries for Sequential Pattern Mining

Abstract:The rapid and continuous growth of data has increased the need for scalable mining algorithms in unsupervised learning and knowledge discovery. In this paper, we focus on Sequential Pattern Mining (SPM), a fundamental topic in knowledge discovery that faces a well-known memory bottleneck. We examine generic dataset modeling techniques and show how they can be used to improve SPM algorithms in time and memory usage. In particular, we develop trie-based dataset models and associated mining algorithms that can represent as well as effectively mine orders of magnitude larger datasets compared to the state of the art. Numerical results on real-life large-size test instances show that our algorithms are also faster and more memory efficient in practice.

Via

Access Paper or Ask Questions

Constraint-based Sequential Pattern Mining with Decision Diagrams

Nov 14, 2018

Amin Hosseininasab, Willem-Jan van Hoeve, Andre A. Cire

Figure 1 for Constraint-based Sequential Pattern Mining with Decision Diagrams

Figure 2 for Constraint-based Sequential Pattern Mining with Decision Diagrams

Figure 3 for Constraint-based Sequential Pattern Mining with Decision Diagrams

Figure 4 for Constraint-based Sequential Pattern Mining with Decision Diagrams

Abstract:Constrained sequential pattern mining aims at identifying frequent patterns on a sequential database of items while observing constraints defined over the item attributes. We introduce novel techniques for constraint-based sequential pattern mining that rely on a multi-valued decision diagram representation of the database. Specifically, our representation can accommodate multiple item attributes and various constraint types, including a number of non-monotone constraints. To evaluate the applicability of our approach, we develop an MDD-based prefix-projection algorithm and compare its performance against a typical generate-and-check variant, as well as a state-of-the-art constraint-based sequential pattern mining algorithm. Results show that our approach is competitive with or superior to these other methods in terms of scalability and efficiency.

* AAAI2019

Via

Access Paper or Ask Questions