An efficient data preprocessing procedure for support vector clustering

Jeen Shing Wang, Jen Chieh Chiang

Research output: Contribution to journalArticlepeer-review

9 Citations (Scopus)


This paper presents an efficient data preprocessing procedure for the support of vector clustering (SVC) to reduce the size of a training dataset. Solving the optimization problem and labeling the data points with cluster labels are time-consuming in the SVC training procedure. This makes using SVC to process large datasets inefficient. We proposed a data preprocessing procedure to solve the problem. The procedure contains a shared nearest neighbor (SNN) algorithm, and utilizes the concept of unit vectors for eliminating insignificant data points from the dataset. Computer simulations have been conducted on artificial and benchmark datasets to demonstrate the effectiveness of the proposed method.

Original languageEnglish
Pages (from-to)705-721
Number of pages17
JournalJournal of Universal Computer Science
Issue number4
Publication statusPublished - 2009 Jul 15

All Science Journal Classification (ASJC) codes

  • Theoretical Computer Science
  • Computer Science(all)

Fingerprint Dive into the research topics of 'An efficient data preprocessing procedure for support vector clustering'. Together they form a unique fingerprint.

Cite this