TY - JOUR
T1 - Two-stage classification methods for microarray data
AU - Wong, Tzu Tsung
AU - Hsu, Ching Han
PY - 2008/1
Y1 - 2008/1
N2 - Gene expression data are a key factor for the success of medical diagnosis, and two-stage classification methods are therefore developed for processing microarray data. The first stage for this kind of classification methods is to select a pre-specified number of genes, which are likely to be the most relevant to the occurrence of a disease, and passes these genes to the second stage for classification. In this paper, we use four gene selection mechanisms and two classification tools to compose eight two-stage classification methods, and test these eight methods on eight microarray data sets for analyzing their performance. The first interesting finding is that the genes chosen by different categories of gene selection mechanisms are less than half in common but result in insignificantly different classification accuracies. A subset-gene-ranking mechanism can be beneficial in classification accuracy, but its computational effort is much heavier. Whether the classification tool employed at the second stage should be accompanied with a dimension reduction technique depends on the characteristics of a data set.
AB - Gene expression data are a key factor for the success of medical diagnosis, and two-stage classification methods are therefore developed for processing microarray data. The first stage for this kind of classification methods is to select a pre-specified number of genes, which are likely to be the most relevant to the occurrence of a disease, and passes these genes to the second stage for classification. In this paper, we use four gene selection mechanisms and two classification tools to compose eight two-stage classification methods, and test these eight methods on eight microarray data sets for analyzing their performance. The first interesting finding is that the genes chosen by different categories of gene selection mechanisms are less than half in common but result in insignificantly different classification accuracies. A subset-gene-ranking mechanism can be beneficial in classification accuracy, but its computational effort is much heavier. Whether the classification tool employed at the second stage should be accompanied with a dimension reduction technique depends on the characteristics of a data set.
UR - http://www.scopus.com/inward/record.url?scp=34248550574&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=34248550574&partnerID=8YFLogxK
U2 - 10.1016/j.eswa.2006.09.005
DO - 10.1016/j.eswa.2006.09.005
M3 - Article
AN - SCOPUS:34248550574
SN - 0957-4174
VL - 34
SP - 375
EP - 383
JO - Expert Systems With Applications
JF - Expert Systems With Applications
IS - 1
ER -