SOHUPDS: A single-pass one-phase algorithm for mining high utility patterns over a data stream

Bijay Prasad Jaysawal, Jen Wei Huang

Research output: Chapter in Book/Report/Conference proceedingConference contribution

4 Citations (Scopus)

Abstract

High utility pattern mining has emerged to overcome the limitation of frequent pattern mining where only frequency is taken as importance without considering the actual importance of items. Existing algorithms for mining high utility patterns over a data stream are two-phase algorithms that are not scalable due to the large number of candidates generation in the first phase, particularly when the minimum utility threshold is low. Moreover, in the second phase, the algorithm needs to scan the database again to find out actual utility for candidates. In this paper, we propose a novel algorithm SOHUPDS to mine high utility patterns over a data stream with the sliding window technique using the projected database approach. In addition, we propose a data structure IUDataListSW, which stores utility and upper-bound values of the items in the current sliding window. Moreover, IUDataListSW stores position of items in the transaction to get the initial projected database of items efficiently. Furthermore, we propose an update strategy to utilize mined high utility patterns from the previous sliding window to update high utility patterns in the current sliding window. Therefore, SOHUPDS is able to mine high utility patterns over a data stream in a single pass and one phase. Experimental results illustrate that SOHUPDS is more efficient than the state-of-the-art algorithms in terms of execution time as well as memory usage.

Original languageEnglish
Title of host publication35th Annual ACM Symposium on Applied Computing, SAC 2020
PublisherAssociation for Computing Machinery
Pages490-497
Number of pages8
ISBN (Electronic)9781450368667
DOIs
Publication statusPublished - 2020 Mar 30
Event35th Annual ACM Symposium on Applied Computing, SAC 2020 - Brno, Czech Republic
Duration: 2020 Mar 302020 Apr 3

Publication series

NameProceedings of the ACM Symposium on Applied Computing

Conference

Conference35th Annual ACM Symposium on Applied Computing, SAC 2020
Country/TerritoryCzech Republic
CityBrno
Period20-03-3020-04-03

All Science Journal Classification (ASJC) codes

  • Software

Fingerprint

Dive into the research topics of 'SOHUPDS: A single-pass one-phase algorithm for mining high utility patterns over a data stream'. Together they form a unique fingerprint.

Cite this