Resource-aware mining with variable granularities in data streams

Wei-Guang Teng, Ming Syan Chen, Philip S. Yu

Research output: Contribution to conferencePaper

12 Citations (Scopus)

Abstract

For data stream applications, both approximation and adaptability are important issues for effective mining. We explore in this paper a fundamental problem that how the limited resources, e.g., memory space and computation power, can be well utilized to produce accurate estimates. Two important features for tracking mined patterns with properly utilized resources are examined. The first issue is temporal granularity which refers to the phenomenon that as time advances, people are more interested in recent events, meaning that more resources can be utilized to explore more recent data with finer granularities. Second, with the mining task of discovering frequent temporal patterns, more resources are expected to be allocated to the processing of those borderline patterns whose statistics, e.g., occurrence frequencies, are close to the specified threshold so as to have proper frequent itemset identification. This feature is called mining with support count granularity. Consequently, algorithm RAM-DS (Resource-Aware Mining for Data Streams) is designed to not only reduce the memory required for data storage but also retain good approximation of target time series. Experimental results have shown that the memory required for storing significant wavelet coefficients is very small and the quality of approximation is stable when performing incremental data updates, indicating that algorithm RAM-DS is feasible and suitable for adaptive mining in data streams.

Original languageEnglish
Pages527-531
Number of pages5
Publication statusPublished - 2004 Jan 1
EventProceedings of the Fourth SIAM International Conference on Data Mining - Lake Buena Vista, FL, United States
Duration: 2004 Apr 222004 Apr 24

Other

OtherProceedings of the Fourth SIAM International Conference on Data Mining
CountryUnited States
CityLake Buena Vista, FL
Period04-04-2204-04-24

All Science Journal Classification (ASJC) codes

  • Mathematics(all)

Fingerprint Dive into the research topics of 'Resource-aware mining with variable granularities in data streams'. Together they form a unique fingerprint.

  • Cite this

    Teng, W-G., Chen, M. S., & Yu, P. S. (2004). Resource-aware mining with variable granularities in data streams. 527-531. Paper presented at Proceedings of the Fourth SIAM International Conference on Data Mining, Lake Buena Vista, FL, United States.