The distance distributions of the four types of bases A, C, G and T in the complete sequences of human genome are shown to have long-tail power-law but short-distance exponential behaviors. However, the random sequences with identical numbers of bases show only short-distance exponential behaviors. The DNA sequence of E. coli, which is much shorter than human's, is shown to exhibit essentially exponential behavior as its corresponding random sequence. Also, DNA sequence of human's with smaller C + G content is seen to enjoy longer power-law tail in its distributions of C and G. The coefficient in the exponential decaying function shows a linear dependent with the C + G content but the decay exponent in the power-law shows no such dependence.
All Science Journal Classification (ASJC) codes
- Agricultural and Biological Sciences (miscellaneous)
- Applied Mathematics