TY - JOUR
T1 - DBD2BS
T2 - Connecting a DNA-binding protein with its binding sites
AU - Chien, Ting Ying
AU - Lin, Chih Kang
AU - Lin, Chih Wei
AU - Weng, Yi Zhong
AU - Chen, Chien Yu
AU - Chang, Darby Tien Hao
N1 - Funding Information:
National Science Council of Taiwan [NSC 100-2627-B-002-002] and the Center for Systems Biology, National Taiwan University. Funding for open access charge: National Science Council of Taiwan.
Funding Information:
The authors would like to thank National Science Council of Taiwan and the Center for Systems Biology, National Taiwan University for the financial support.
PY - 2012/7
Y1 - 2012/7
N2 - By binding to short and highly conserved DNA sequences in genomes, DNA-binding proteins initiate, enhance or repress biological processes. Accurately identifying such binding sites, often represented by position weight matrices (PWMs), is an important step in understanding the control mechanisms of cells. When given coordinates of a DNA-binding domain (DBD) bound with DNA, a potential function can be used to estimate the change of binding affinity after base substitutions, where the changes can be summarized as a PWM. This technique provides an effective alternative when the chromatin immunoprecipitation data are unavailable for PWM inference. To facilitate the procedure of predicting PWMs based on protein-DNA complexes or even structures of the unbound state, the web server, DBD2BS, is presented in this study. The DBD2BS uses an atom-level knowledge-based potential function to predict PWMs characterizing the sequences to which the query DBD structure can bind. For unbound queries, a list of 1066 DBD-DNA complexes (including 1813 protein chains) is compiled for use as templates for synthesizing bound structures. The DBD2BS provides users with an easy-to-use interface for visualizing the PWMs predicted based on different templates and the spatial relationships of the query protein, the DBDs and the DNAs. The DBD2BS is the first attempt to predict PWMs of DBDs from unbound structures rather than from bound ones. This approach increases the number of existing protein structures that can be exploited when analyzing protein-DNA interactions. In a recent study, the authors showed that the kernel adopted by the DBD2BS can generate PWMs consistent with those obtained from the experimental data. The use of DBD2BS to predict PWMs can be incorporated with sequence-based methods to discover binding sites in genome-wide studies.Available at: http://dbd2bs.csie.ntu.edu.tw/, http://dbd2bs.csbb.ntu. edu.tw/, and http://dbd2bs.ee.ncku.edu.tw.
AB - By binding to short and highly conserved DNA sequences in genomes, DNA-binding proteins initiate, enhance or repress biological processes. Accurately identifying such binding sites, often represented by position weight matrices (PWMs), is an important step in understanding the control mechanisms of cells. When given coordinates of a DNA-binding domain (DBD) bound with DNA, a potential function can be used to estimate the change of binding affinity after base substitutions, where the changes can be summarized as a PWM. This technique provides an effective alternative when the chromatin immunoprecipitation data are unavailable for PWM inference. To facilitate the procedure of predicting PWMs based on protein-DNA complexes or even structures of the unbound state, the web server, DBD2BS, is presented in this study. The DBD2BS uses an atom-level knowledge-based potential function to predict PWMs characterizing the sequences to which the query DBD structure can bind. For unbound queries, a list of 1066 DBD-DNA complexes (including 1813 protein chains) is compiled for use as templates for synthesizing bound structures. The DBD2BS provides users with an easy-to-use interface for visualizing the PWMs predicted based on different templates and the spatial relationships of the query protein, the DBDs and the DNAs. The DBD2BS is the first attempt to predict PWMs of DBDs from unbound structures rather than from bound ones. This approach increases the number of existing protein structures that can be exploited when analyzing protein-DNA interactions. In a recent study, the authors showed that the kernel adopted by the DBD2BS can generate PWMs consistent with those obtained from the experimental data. The use of DBD2BS to predict PWMs can be incorporated with sequence-based methods to discover binding sites in genome-wide studies.Available at: http://dbd2bs.csie.ntu.edu.tw/, http://dbd2bs.csbb.ntu. edu.tw/, and http://dbd2bs.ee.ncku.edu.tw.
UR - http://www.scopus.com/inward/record.url?scp=84864443165&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=84864443165&partnerID=8YFLogxK
U2 - 10.1093/nar/gks564
DO - 10.1093/nar/gks564
M3 - Article
C2 - 22693214
AN - SCOPUS:84864443165
SN - 0305-1048
VL - 40
SP - W173-W179
JO - Nucleic acids research
JF - Nucleic acids research
IS - W1
ER -