TY - JOUR
T1 - Prediction of leucine-rich nuclear export signal containing proteins with NESsential
AU - Fu, Szu Chin
AU - Imai, Kenichiro
AU - Horton, Paul
N1 - Funding Information:
Interchange association Japan (IAJ; to S.C.F.); and a research fellowship of the Japan Society for the Promotion of Science (JSPS; to K.I.). Funding for open access charge: research institute (AIST, Computational Biology Research Center, Tokyo, Japan).
PY - 2011/9
Y1 - 2011/9
N2 - The classical nuclear export signal (NES), also known as the leucine-rich NES, is a protein localization signal often involved in important processes such as signal transduction and cell cycle regulation. Although 15 years has passed since its discovery, limited structural information and high sequence diversity have hampered understanding of the NES. Several consensus sequences have been proposed to describe it, but they suffer from poor predictive power. On the other hand, the NetNES server provides the only computational method currently available. Although these two methods have been widely used to attempt to find the correct NES position within potential NES-containing proteins, their performance has not yet been evaluated on the basic task of identifying NES-containing proteins. We propose a new predictor, NESsential, which uses sequence derived meta-features, such as predicted disorder and solvent accessibility, in addition to primary sequence. We demonstrate that it can identify promising NES-containing candidate proteins (albeit at low coverage), but other methods cannot. We also quantitatively demonstrate that predicted disorder is a useful feature for prediction and investigate the different features of (predicted) ordered versus disordered NES's. Finally, we list 70 recently discovered NES-containing proteins, doubling the number available to the community.
AB - The classical nuclear export signal (NES), also known as the leucine-rich NES, is a protein localization signal often involved in important processes such as signal transduction and cell cycle regulation. Although 15 years has passed since its discovery, limited structural information and high sequence diversity have hampered understanding of the NES. Several consensus sequences have been proposed to describe it, but they suffer from poor predictive power. On the other hand, the NetNES server provides the only computational method currently available. Although these two methods have been widely used to attempt to find the correct NES position within potential NES-containing proteins, their performance has not yet been evaluated on the basic task of identifying NES-containing proteins. We propose a new predictor, NESsential, which uses sequence derived meta-features, such as predicted disorder and solvent accessibility, in addition to primary sequence. We demonstrate that it can identify promising NES-containing candidate proteins (albeit at low coverage), but other methods cannot. We also quantitatively demonstrate that predicted disorder is a useful feature for prediction and investigate the different features of (predicted) ordered versus disordered NES's. Finally, we list 70 recently discovered NES-containing proteins, doubling the number available to the community.
UR - http://www.scopus.com/inward/record.url?scp=80052492888&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=80052492888&partnerID=8YFLogxK
U2 - 10.1093/nar/gkr493
DO - 10.1093/nar/gkr493
M3 - Article
C2 - 21705415
AN - SCOPUS:80052492888
SN - 0305-1048
VL - 39
SP - e111
JO - Nucleic Acids Research
JF - Nucleic Acids Research
IS - 16
ER -