A gene selection method for microarray data based on risk genes

Tzu-Tsung Wong, Ding Qun Chen

Research output: Contribution to journalArticle

7 Citations (Scopus)

Abstract

Many gene selection methods have been proposed to select a subset of genes that can have a high prediction accuracy for cancer classification, and most set the same preference for all genes. However, many biological reports have pointed out that mutated or flawed genes, named as risk genes, can be one of the major causes of a specific disease. This study proposes a gene selection method based on the risk genes found in biological reports. The information provided by risk genes can reduce the time complexity for gene selection and increase the accuracy of cancer classification. This gene selection method is composed of two stages. Since all risk genes must be chosen, the first stage is to remove the genes that have similar expression levels or functions to risk genes. The next stage is to perform gene selection and gene replacement based on the results of a process that divides the remaining genes into clusters. Based on the test results from four microarray data sets, our gene selection method outperforms those proposed by previous studies, and genes that have the potential to be new risk genes are presented.

Original languageEnglish
Pages (from-to)14065-14071
Number of pages7
JournalExpert Systems With Applications
Volume38
Issue number11
DOIs
Publication statusPublished - 2011 Oct 1

Fingerprint

Microarrays
Genes
Set theory

All Science Journal Classification (ASJC) codes

  • Engineering(all)
  • Computer Science Applications
  • Artificial Intelligence

Cite this

@article{aefbce4a00a24e34897cecc8f641c628,
title = "A gene selection method for microarray data based on risk genes",
abstract = "Many gene selection methods have been proposed to select a subset of genes that can have a high prediction accuracy for cancer classification, and most set the same preference for all genes. However, many biological reports have pointed out that mutated or flawed genes, named as risk genes, can be one of the major causes of a specific disease. This study proposes a gene selection method based on the risk genes found in biological reports. The information provided by risk genes can reduce the time complexity for gene selection and increase the accuracy of cancer classification. This gene selection method is composed of two stages. Since all risk genes must be chosen, the first stage is to remove the genes that have similar expression levels or functions to risk genes. The next stage is to perform gene selection and gene replacement based on the results of a process that divides the remaining genes into clusters. Based on the test results from four microarray data sets, our gene selection method outperforms those proposed by previous studies, and genes that have the potential to be new risk genes are presented.",
author = "Tzu-Tsung Wong and Chen, {Ding Qun}",
year = "2011",
month = "10",
day = "1",
doi = "10.1016/j.eswa.2011.04.215",
language = "English",
volume = "38",
pages = "14065--14071",
journal = "Expert Systems with Applications",
issn = "0957-4174",
publisher = "Elsevier Limited",
number = "11",

}

A gene selection method for microarray data based on risk genes. / Wong, Tzu-Tsung; Chen, Ding Qun.

In: Expert Systems With Applications, Vol. 38, No. 11, 01.10.2011, p. 14065-14071.

Research output: Contribution to journalArticle

TY - JOUR

T1 - A gene selection method for microarray data based on risk genes

AU - Wong, Tzu-Tsung

AU - Chen, Ding Qun

PY - 2011/10/1

Y1 - 2011/10/1

N2 - Many gene selection methods have been proposed to select a subset of genes that can have a high prediction accuracy for cancer classification, and most set the same preference for all genes. However, many biological reports have pointed out that mutated or flawed genes, named as risk genes, can be one of the major causes of a specific disease. This study proposes a gene selection method based on the risk genes found in biological reports. The information provided by risk genes can reduce the time complexity for gene selection and increase the accuracy of cancer classification. This gene selection method is composed of two stages. Since all risk genes must be chosen, the first stage is to remove the genes that have similar expression levels or functions to risk genes. The next stage is to perform gene selection and gene replacement based on the results of a process that divides the remaining genes into clusters. Based on the test results from four microarray data sets, our gene selection method outperforms those proposed by previous studies, and genes that have the potential to be new risk genes are presented.

AB - Many gene selection methods have been proposed to select a subset of genes that can have a high prediction accuracy for cancer classification, and most set the same preference for all genes. However, many biological reports have pointed out that mutated or flawed genes, named as risk genes, can be one of the major causes of a specific disease. This study proposes a gene selection method based on the risk genes found in biological reports. The information provided by risk genes can reduce the time complexity for gene selection and increase the accuracy of cancer classification. This gene selection method is composed of two stages. Since all risk genes must be chosen, the first stage is to remove the genes that have similar expression levels or functions to risk genes. The next stage is to perform gene selection and gene replacement based on the results of a process that divides the remaining genes into clusters. Based on the test results from four microarray data sets, our gene selection method outperforms those proposed by previous studies, and genes that have the potential to be new risk genes are presented.

UR - http://www.scopus.com/inward/record.url?scp=79959925185&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=79959925185&partnerID=8YFLogxK

U2 - 10.1016/j.eswa.2011.04.215

DO - 10.1016/j.eswa.2011.04.215

M3 - Article

VL - 38

SP - 14065

EP - 14071

JO - Expert Systems with Applications

JF - Expert Systems with Applications

SN - 0957-4174

IS - 11

ER -