PSP-AMS: Progressive mining of sequential patterns across multiple streams

Bijay Prasad Jaysawal, Jen Wei Huang

Research output: Contribution to journalArticlepeer-review

10 Citations (Scopus)

Abstract

Sequential pattern mining is used to find frequent data sequences over time. When sequential patterns are generated, the newly arriving patterns may not be identified as frequent sequential patterns due to the existence of old data and sequences. Progressive sequential pattern mining aims to find the most up-to-date sequential patterns given that obsolete items will be deleted from the sequences. When sequences come with multiple data streams, it is difficult to maintain and update the current sequential patterns. Even worse, when we consider the sequences across multiple streams, previous methods cannot efficiently compute the frequent sequential patterns. In this work, we propose an efficient algorithm PSP-AMS to address this problem. PSP-AMS uses a novel data structure PSP-MS-tree to insert new items, update current items, and delete obsolete items. By maintaining a PSP-MS-tree, PSP-AMS efficiently finds the frequent sequential patterns across multiple streams. The experimental results show that PSP-AMS significantly outperforms previous algorithms for mining of progressive sequential patterns across multiple streams on synthetic data as well as real data.

Original languageEnglish
Article numbera5
JournalACM Transactions on Knowledge Discovery from Data
Volume13
Issue number1
DOIs
Publication statusPublished - 2018 Dec

All Science Journal Classification (ASJC) codes

  • General Computer Science

Fingerprint

Dive into the research topics of 'PSP-AMS: Progressive mining of sequential patterns across multiple streams'. Together they form a unique fingerprint.

Cite this