TY - JOUR
T1 - An overview of the Phalaenopsis orchid genome through BAC end sequence analysis
AU - Hsu, Chia Chi
AU - Chung, Yu Lin
AU - Chen, Tien Chih
AU - Lee, Yu Ling
AU - Kuo, Yi Tzu
AU - Tsai, Wen Chieh
AU - Hsiao, Yu Yun
AU - Chen, Yun Wen
AU - Wu, Wen Luan
AU - Chen, Hong Hwa
N1 - Funding Information:
We thank Dr. Michel Delseny (Laboratory of Plant Genome and Plant Physiology, University of Perpignan, France) for helpful discussions and critical reading of the manuscript. We acknowledge the technical services provided by the Sequencing Core Facility at the National Yang-Ming University Genome Research Center (YMGC, Taipei, Taiwan). The Sequencing Core Facility is supported by the National Research Program for Genomic Medicine (NRPGM) of the National Science Council, Taiwan. We also acknowledge the technical services provided by the Arizona Genomics Institute (AGI) DNA Sequencing Center and the AGI Physical Mapping Center, Arizona, USA. This work was supported by grants 98-2321-B-006-004-MY3 from National Science Council, Taiwan, and 98AS-1.2.1-ST-a4 and 100AS-1.2.2-ST-a2 from Council of Agriculture, Taiwan.
PY - 2011/1/6
Y1 - 2011/1/6
N2 - Background: Phalaenopsis orchids are popular floral crops, and development of new cultivars is economically important to floricultural industries worldwide. Analysis of orchid genes could facilitate orchid improvement. Bacterial artificial chromosome (BAC) end sequences (BESs) can provide the first glimpses into the sequence composition of a novel genome and can yield molecular markers for use in genetic mapping and breeding.Results: We used two BAC libraries (constructed using the BamHI and HindIII restriction enzymes) of Phalaenopsis equestris to generate pair-end sequences from 2,920 BAC clones (71.4% and 28.6% from the BamHI and HindIII libraries, respectively), at a success rate of 95.7%. A total of 5,535 BESs were generated, representing 4.5 Mb, or about 0.3% of the Phalaenopsis genome. The trimmed sequences ranged from 123 to 1,397 base pairs (bp) in size, with an average edited read length of 821 bp. When these BESs were subjected to sequence homology searches, it was found that 641 (11.6%) were predicted to represent protein-encoding regions, whereas 1,272 (23.0%) contained repetitive DNA. Most of the repetitive DNA sequences were gypsy- and copia-like retrotransposons (41.9% and 12.8%, respectively), whereas only 10.8% were DNA transposons. Further, 950 potential simple sequence repeats (SSRs) were discovered. Dinucleotides were the most abundant repeat motifs; AT/TA dimer repeats were the most frequent SSRs, representing 253 (26.6%) of all identified SSRs. Microsynteny analysis revealed that more BESs mapped to the whole-genome sequences of poplar than to those of grape or Arabidopsis, and even fewer mapped to the rice genome. This work will facilitate analysis of the Phalaenopsis genome, and will help clarify similarities and differences in genome composition between orchids and other plant species.Conclusion: Using BES analysis, we obtained an overview of the Phalaenopsis genome in terms of gene abundance, the presence of repetitive DNA and SSR markers, and the extent of microsynteny with other plant species. This work provides a basis for future physical mapping of the Phalaenopsis genome and advances our knowledge thereof.
AB - Background: Phalaenopsis orchids are popular floral crops, and development of new cultivars is economically important to floricultural industries worldwide. Analysis of orchid genes could facilitate orchid improvement. Bacterial artificial chromosome (BAC) end sequences (BESs) can provide the first glimpses into the sequence composition of a novel genome and can yield molecular markers for use in genetic mapping and breeding.Results: We used two BAC libraries (constructed using the BamHI and HindIII restriction enzymes) of Phalaenopsis equestris to generate pair-end sequences from 2,920 BAC clones (71.4% and 28.6% from the BamHI and HindIII libraries, respectively), at a success rate of 95.7%. A total of 5,535 BESs were generated, representing 4.5 Mb, or about 0.3% of the Phalaenopsis genome. The trimmed sequences ranged from 123 to 1,397 base pairs (bp) in size, with an average edited read length of 821 bp. When these BESs were subjected to sequence homology searches, it was found that 641 (11.6%) were predicted to represent protein-encoding regions, whereas 1,272 (23.0%) contained repetitive DNA. Most of the repetitive DNA sequences were gypsy- and copia-like retrotransposons (41.9% and 12.8%, respectively), whereas only 10.8% were DNA transposons. Further, 950 potential simple sequence repeats (SSRs) were discovered. Dinucleotides were the most abundant repeat motifs; AT/TA dimer repeats were the most frequent SSRs, representing 253 (26.6%) of all identified SSRs. Microsynteny analysis revealed that more BESs mapped to the whole-genome sequences of poplar than to those of grape or Arabidopsis, and even fewer mapped to the rice genome. This work will facilitate analysis of the Phalaenopsis genome, and will help clarify similarities and differences in genome composition between orchids and other plant species.Conclusion: Using BES analysis, we obtained an overview of the Phalaenopsis genome in terms of gene abundance, the presence of repetitive DNA and SSR markers, and the extent of microsynteny with other plant species. This work provides a basis for future physical mapping of the Phalaenopsis genome and advances our knowledge thereof.
UR - http://www.scopus.com/inward/record.url?scp=78650907887&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=78650907887&partnerID=8YFLogxK
U2 - 10.1186/1471-2229-11-3
DO - 10.1186/1471-2229-11-3
M3 - Article
C2 - 21208460
AN - SCOPUS:78650907887
SN - 1471-2229
VL - 11
JO - BMC Plant Biology
JF - BMC Plant Biology
M1 - 3
ER -