TY - JOUR
T1 - Evolution-based tabu search approach to automatic clustering
AU - Pan, Shih Ming
AU - Cheng, Kuo Sheng
N1 - Funding Information:
Manuscript received February 1, 2005; revised September 19, 2005. This work was supported by the National Science Council, Taiwan, R.O.C., under Grant NSC92-2218-E006-068. This paper was recommended by Associate Editor M. Lim.
PY - 2007/9
Y1 - 2007/9
N2 - Traditional clustering algorithms (e.g., the K-means algorithm and its variants) are used only for a fixed number of clusters. However, in many clustering applications, the actual number of clusters is unknown beforehand. The general solution to this type of a clustering problem is that one selects or defines a cluster validity index and performs a traditional clustering algorithm for all possible numbers of clusters in sequence to find the clustering with the best cluster validity. This is tedious and time-consuming work. To easily and effectively determine the optimal number of clusters and, at the same time, construct the clusters with good validity, we propose a framework of automatic clustering algorithms (called ETSAs) that do not require users to give each possible value of required parameters (including the number of clusters). ETSAs treat the number of clusters as a variable, and evolve it to an optimal number. Through experiments conducted on nine test data sets, we compared the ETSA with five traditional clustering algorithms. We demonstrate the superiority of the ETSA in finding the correct number of clusters while constructing clusters with good validity.
AB - Traditional clustering algorithms (e.g., the K-means algorithm and its variants) are used only for a fixed number of clusters. However, in many clustering applications, the actual number of clusters is unknown beforehand. The general solution to this type of a clustering problem is that one selects or defines a cluster validity index and performs a traditional clustering algorithm for all possible numbers of clusters in sequence to find the clustering with the best cluster validity. This is tedious and time-consuming work. To easily and effectively determine the optimal number of clusters and, at the same time, construct the clusters with good validity, we propose a framework of automatic clustering algorithms (called ETSAs) that do not require users to give each possible value of required parameters (including the number of clusters). ETSAs treat the number of clusters as a variable, and evolve it to an optimal number. Through experiments conducted on nine test data sets, we compared the ETSA with five traditional clustering algorithms. We demonstrate the superiority of the ETSA in finding the correct number of clusters while constructing clusters with good validity.
UR - http://www.scopus.com/inward/record.url?scp=34548260888&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=34548260888&partnerID=8YFLogxK
U2 - 10.1109/TSMCC.2007.900666
DO - 10.1109/TSMCC.2007.900666
M3 - Article
AN - SCOPUS:34548260888
SN - 1094-6977
VL - 37
SP - 827
EP - 838
JO - IEEE Transactions on Systems, Man and Cybernetics Part C: Applications and Reviews
JF - IEEE Transactions on Systems, Man and Cybernetics Part C: Applications and Reviews
IS - 5
ER -