TY - JOUR
T1 - Model for the distributions of k-mers in DNA sequences
AU - Chen, Yaw Hwang
AU - Nyeo, Su Long
AU - Yeh, Chiung Yuh
PY - 2005/7
Y1 - 2005/7
N2 - The evolutionary features based on the distributions of k-mers in the DNA sequences of various organisms are studied. The organisms are classified into three groups based on their evolutionary periods: (a) E. coli and T. pallidum (b) yeast, zebrafish, A. thaliana, and fruit fly, (c) mouse, chicken, and human. The distributions of 6-mers of these three groups are shown to be, respectively, (a) unimodal, (b) unimodal with peaks generally shifted to smaller frequencies of occurrence, (c) bimodal. To describe the bimodal feature of the k-mer distributions of group (c), a model based on the cytosine-guanine "CG" content of the DNA sequences is introduced and shown to provide reasonably good agreements.
AB - The evolutionary features based on the distributions of k-mers in the DNA sequences of various organisms are studied. The organisms are classified into three groups based on their evolutionary periods: (a) E. coli and T. pallidum (b) yeast, zebrafish, A. thaliana, and fruit fly, (c) mouse, chicken, and human. The distributions of 6-mers of these three groups are shown to be, respectively, (a) unimodal, (b) unimodal with peaks generally shifted to smaller frequencies of occurrence, (c) bimodal. To describe the bimodal feature of the k-mer distributions of group (c), a model based on the cytosine-guanine "CG" content of the DNA sequences is introduced and shown to provide reasonably good agreements.
UR - http://www.scopus.com/inward/record.url?scp=27244460374&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=27244460374&partnerID=8YFLogxK
U2 - 10.1103/PhysRevE.72.011908
DO - 10.1103/PhysRevE.72.011908
M3 - Article
C2 - 16090002
AN - SCOPUS:27244460374
SN - 1539-3755
VL - 72
JO - Physical Review E - Statistical, Nonlinear, and Soft Matter Physics
JF - Physical Review E - Statistical, Nonlinear, and Soft Matter Physics
IS - 1
M1 - 011908
ER -