A parallel elastic net clustering algorithm

Tzu Yi Feng, Chun Wei Tsai, Ming Chao Chiang, Chu-Sing Yang

Research output: Chapter in Book/Report/Conference proceedingConference contribution

1 Citation (Scopus)

Abstract

The elastic net clustering algorithm (ENCA) can typically provide an effective way for classifying non-linearly separable data. However, the computation time it takes will be significantly increased for large datasets. To deal with this issue, a parallel version of the ENCA, built on the Apache Spark framework, called parallel elastic net clustering algorithm (PENCA), is presented in this paper. To evaluate the performance of the proposed algorithm, it is compared with ENCA and two well-known clustering algorithms, k-means and genetic k-means algorithm (GKA). The results show that PENCA not only outperforms k-means and GKA in terms of the accuracy rate, it also provides an efficient way to reduce the response time of ENCA-based clustering algorithms for large-scale datasets.

Original languageEnglish
Title of host publicationProceedings - 2018 IEEE International Conference on Smart Internet of Things, SmartIoT 2018
PublisherInstitute of Electrical and Electronics Engineers Inc.
Pages40-45
Number of pages6
ISBN (Print)9781538685426
DOIs
Publication statusPublished - 2018 Sep 13
Event2018 IEEE International Conference on Smart Internet of Things, SmartIoT 2018 - Xi'an, China
Duration: 2018 Aug 172018 Aug 19

Publication series

NameProceedings - 2018 IEEE International Conference on Smart Internet of Things, SmartIoT 2018

Other

Other2018 IEEE International Conference on Smart Internet of Things, SmartIoT 2018
CountryChina
CityXi'an
Period18-08-1718-08-19

Fingerprint

Clustering algorithms
Clustering algorithm
Electric sparks
K-means

All Science Journal Classification (ASJC) codes

  • Information Systems and Management
  • Artificial Intelligence
  • Computer Networks and Communications
  • Computer Science Applications

Cite this

Feng, T. Y., Tsai, C. W., Chiang, M. C., & Yang, C-S. (2018). A parallel elastic net clustering algorithm. In Proceedings - 2018 IEEE International Conference on Smart Internet of Things, SmartIoT 2018 (pp. 40-45). [8465523] (Proceedings - 2018 IEEE International Conference on Smart Internet of Things, SmartIoT 2018). Institute of Electrical and Electronics Engineers Inc.. https://doi.org/10.1109/SmartIoT.2018.00017
Feng, Tzu Yi ; Tsai, Chun Wei ; Chiang, Ming Chao ; Yang, Chu-Sing. / A parallel elastic net clustering algorithm. Proceedings - 2018 IEEE International Conference on Smart Internet of Things, SmartIoT 2018. Institute of Electrical and Electronics Engineers Inc., 2018. pp. 40-45 (Proceedings - 2018 IEEE International Conference on Smart Internet of Things, SmartIoT 2018).
@inproceedings{2099b55654494cdd90b6e5713db39a04,
title = "A parallel elastic net clustering algorithm",
abstract = "The elastic net clustering algorithm (ENCA) can typically provide an effective way for classifying non-linearly separable data. However, the computation time it takes will be significantly increased for large datasets. To deal with this issue, a parallel version of the ENCA, built on the Apache Spark framework, called parallel elastic net clustering algorithm (PENCA), is presented in this paper. To evaluate the performance of the proposed algorithm, it is compared with ENCA and two well-known clustering algorithms, k-means and genetic k-means algorithm (GKA). The results show that PENCA not only outperforms k-means and GKA in terms of the accuracy rate, it also provides an efficient way to reduce the response time of ENCA-based clustering algorithms for large-scale datasets.",
author = "Feng, {Tzu Yi} and Tsai, {Chun Wei} and Chiang, {Ming Chao} and Chu-Sing Yang",
year = "2018",
month = "9",
day = "13",
doi = "10.1109/SmartIoT.2018.00017",
language = "English",
isbn = "9781538685426",
series = "Proceedings - 2018 IEEE International Conference on Smart Internet of Things, SmartIoT 2018",
publisher = "Institute of Electrical and Electronics Engineers Inc.",
pages = "40--45",
booktitle = "Proceedings - 2018 IEEE International Conference on Smart Internet of Things, SmartIoT 2018",
address = "United States",

}

Feng, TY, Tsai, CW, Chiang, MC & Yang, C-S 2018, A parallel elastic net clustering algorithm. in Proceedings - 2018 IEEE International Conference on Smart Internet of Things, SmartIoT 2018., 8465523, Proceedings - 2018 IEEE International Conference on Smart Internet of Things, SmartIoT 2018, Institute of Electrical and Electronics Engineers Inc., pp. 40-45, 2018 IEEE International Conference on Smart Internet of Things, SmartIoT 2018, Xi'an, China, 18-08-17. https://doi.org/10.1109/SmartIoT.2018.00017

A parallel elastic net clustering algorithm. / Feng, Tzu Yi; Tsai, Chun Wei; Chiang, Ming Chao; Yang, Chu-Sing.

Proceedings - 2018 IEEE International Conference on Smart Internet of Things, SmartIoT 2018. Institute of Electrical and Electronics Engineers Inc., 2018. p. 40-45 8465523 (Proceedings - 2018 IEEE International Conference on Smart Internet of Things, SmartIoT 2018).

Research output: Chapter in Book/Report/Conference proceedingConference contribution

TY - GEN

T1 - A parallel elastic net clustering algorithm

AU - Feng, Tzu Yi

AU - Tsai, Chun Wei

AU - Chiang, Ming Chao

AU - Yang, Chu-Sing

PY - 2018/9/13

Y1 - 2018/9/13

N2 - The elastic net clustering algorithm (ENCA) can typically provide an effective way for classifying non-linearly separable data. However, the computation time it takes will be significantly increased for large datasets. To deal with this issue, a parallel version of the ENCA, built on the Apache Spark framework, called parallel elastic net clustering algorithm (PENCA), is presented in this paper. To evaluate the performance of the proposed algorithm, it is compared with ENCA and two well-known clustering algorithms, k-means and genetic k-means algorithm (GKA). The results show that PENCA not only outperforms k-means and GKA in terms of the accuracy rate, it also provides an efficient way to reduce the response time of ENCA-based clustering algorithms for large-scale datasets.

AB - The elastic net clustering algorithm (ENCA) can typically provide an effective way for classifying non-linearly separable data. However, the computation time it takes will be significantly increased for large datasets. To deal with this issue, a parallel version of the ENCA, built on the Apache Spark framework, called parallel elastic net clustering algorithm (PENCA), is presented in this paper. To evaluate the performance of the proposed algorithm, it is compared with ENCA and two well-known clustering algorithms, k-means and genetic k-means algorithm (GKA). The results show that PENCA not only outperforms k-means and GKA in terms of the accuracy rate, it also provides an efficient way to reduce the response time of ENCA-based clustering algorithms for large-scale datasets.

UR - http://www.scopus.com/inward/record.url?scp=85054480764&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=85054480764&partnerID=8YFLogxK

U2 - 10.1109/SmartIoT.2018.00017

DO - 10.1109/SmartIoT.2018.00017

M3 - Conference contribution

AN - SCOPUS:85054480764

SN - 9781538685426

T3 - Proceedings - 2018 IEEE International Conference on Smart Internet of Things, SmartIoT 2018

SP - 40

EP - 45

BT - Proceedings - 2018 IEEE International Conference on Smart Internet of Things, SmartIoT 2018

PB - Institute of Electrical and Electronics Engineers Inc.

ER -

Feng TY, Tsai CW, Chiang MC, Yang C-S. A parallel elastic net clustering algorithm. In Proceedings - 2018 IEEE International Conference on Smart Internet of Things, SmartIoT 2018. Institute of Electrical and Electronics Engineers Inc. 2018. p. 40-45. 8465523. (Proceedings - 2018 IEEE International Conference on Smart Internet of Things, SmartIoT 2018). https://doi.org/10.1109/SmartIoT.2018.00017