Mrnasi index: Machine learning in mining lung adenocarcinoma stem cell biomarkers

Yitong Zhang, Hui Li, Joseph Ta Chien Tseng, I. Chia Lien, Fenglan Li, Wei Wu

Research output: Contribution to journalArticlepeer-review

21 Citations (Scopus)


Cancer stem cells (CSCs), characterized by self-renewal and unlimited proliferation, lead to therapeutic resistance in lung cancer. In this study, we aimed to investigate the expressions of stem cell-related genes in lung adenocarcinoma (LUAD). The stemness index based on mRNA expression (mRNAsi) was utilized to analyze LUAD cases in the Cancer Genome Atlas (TCGA). First, mRNAsi was analyzed with differential expressions, survival analysis, clinical stages, and gender in LUADs. Then, the weighted gene co-expression network analysis was performed to discover modules of stemness and key genes. The interplay among the key genes was explored at the transcription and protein levels. The enrichment analysis was performed to annotate the function and pathways of the key genes. The expression levels of key genes were validated in a pan-cancer scale. The pathological stage associated gene expression level and survival probability were also validated. The Gene Expression Omnibus (GEO) database was additionally used for validation. The mRNAsi was significantly upregulated in cancer cases. In general, the mRNAsi score increases according to clinical stages and differs in gender significantly. Lower mRNAsi groups had a better overall survival in major LUADs, within five years. The distinguished modules and key genes were selected according to the correlations to the mRNAsi. Thirteen key genes (CCNB1, BUB1, BUB1B, CDC20, PLK1, TTK, CDC45, ESPL1, CCNA2, MCM6, ORC1, MCM2, and CHEK1) were enriched from the cell cycle Kyoto Encyclopedia of Genes and Genomes (KEGG) pathway, relating to cell proliferation Gene Ontology (GO) terms, as well. Eight of the thirteen genes have been reported to be associated with the CSC characteristics. However, all of them have been previously ignored in LUADs. Their expression increased according to the pathological stages of LUAD, and these genes were clearly upregulated in pan-cancers. In the GEO database, only the tumor necrosis factor receptor associated factor-interacting protein (TRAIP) from the blue module was matched with the stemness microarray data. These key genes were found to have strong correlations as a whole, and could be used as therapeutic targets in the treatment of LUAD, by inhibiting the stemness features.

Original languageEnglish
Article number257
Issue number3
Publication statusPublished - 2020 Mar

All Science Journal Classification (ASJC) codes

  • Genetics
  • Genetics(clinical)


Dive into the research topics of 'Mrnasi index: Machine learning in mining lung adenocarcinoma stem cell biomarkers'. Together they form a unique fingerprint.

Cite this