TY - GEN
T1 - SOHUPDS
T2 - 35th Annual ACM Symposium on Applied Computing, SAC 2020
AU - Jaysawal, Bijay Prasad
AU - Huang, Jen Wei
N1 - Publisher Copyright:
© 2020 ACM.
PY - 2020/3/30
Y1 - 2020/3/30
N2 - High utility pattern mining has emerged to overcome the limitation of frequent pattern mining where only frequency is taken as importance without considering the actual importance of items. Existing algorithms for mining high utility patterns over a data stream are two-phase algorithms that are not scalable due to the large number of candidates generation in the first phase, particularly when the minimum utility threshold is low. Moreover, in the second phase, the algorithm needs to scan the database again to find out actual utility for candidates. In this paper, we propose a novel algorithm SOHUPDS to mine high utility patterns over a data stream with the sliding window technique using the projected database approach. In addition, we propose a data structure IUDataListSW, which stores utility and upper-bound values of the items in the current sliding window. Moreover, IUDataListSW stores position of items in the transaction to get the initial projected database of items efficiently. Furthermore, we propose an update strategy to utilize mined high utility patterns from the previous sliding window to update high utility patterns in the current sliding window. Therefore, SOHUPDS is able to mine high utility patterns over a data stream in a single pass and one phase. Experimental results illustrate that SOHUPDS is more efficient than the state-of-the-art algorithms in terms of execution time as well as memory usage.
AB - High utility pattern mining has emerged to overcome the limitation of frequent pattern mining where only frequency is taken as importance without considering the actual importance of items. Existing algorithms for mining high utility patterns over a data stream are two-phase algorithms that are not scalable due to the large number of candidates generation in the first phase, particularly when the minimum utility threshold is low. Moreover, in the second phase, the algorithm needs to scan the database again to find out actual utility for candidates. In this paper, we propose a novel algorithm SOHUPDS to mine high utility patterns over a data stream with the sliding window technique using the projected database approach. In addition, we propose a data structure IUDataListSW, which stores utility and upper-bound values of the items in the current sliding window. Moreover, IUDataListSW stores position of items in the transaction to get the initial projected database of items efficiently. Furthermore, we propose an update strategy to utilize mined high utility patterns from the previous sliding window to update high utility patterns in the current sliding window. Therefore, SOHUPDS is able to mine high utility patterns over a data stream in a single pass and one phase. Experimental results illustrate that SOHUPDS is more efficient than the state-of-the-art algorithms in terms of execution time as well as memory usage.
UR - http://www.scopus.com/inward/record.url?scp=85083024714&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85083024714&partnerID=8YFLogxK
U2 - 10.1145/3341105.3373928
DO - 10.1145/3341105.3373928
M3 - Conference contribution
AN - SCOPUS:85083024714
T3 - Proceedings of the ACM Symposium on Applied Computing
SP - 490
EP - 497
BT - 35th Annual ACM Symposium on Applied Computing, SAC 2020
PB - Association for Computing Machinery
Y2 - 30 March 2020 through 3 April 2020
ER -