TY - JOUR
T1 - ThermalProGAN
T2 - A sequence-based thermally stable protein generator trained using unpaired data
AU - Huang, Hui Ling
AU - Weng, Chong Heng
AU - Nordling, Torbjorn E.M.
AU - Liou, Yi Fan
N1 - Publisher Copyright:
© 2023 World Scientific Publishing Europe Ltd.
PY - 2023/2/1
Y1 - 2023/2/1
N2 - Motivation: The synthesis of proteins with novel desired properties is challenging but sought after by the industry and academia. The dominating approach is based on trial-and-error inducing point mutations, assisted by structural information or predictive models built with paired data that are difficult to collect. This study proposes a sequence-based unpaired-sample of novel protein inventor (SUNI) to build ThermalProGAN for generating thermally stable proteins based on sequence information. Results: The ThermalProGAN can strongly mutate the input sequence with a median number of 32 residues. A known normal protein, 1RG0, was used to generate a thermally stable form by mutating 51 residues. After superimposing the two structures, high similarity is shown, indicating that the basic function would be conserved. Eighty four molecular dynamics simulation results of 1RG0 and the COVID-19 vaccine candidates with a total simulation time of 840ns indicate that the thermal stability increased. Conclusion: This proof of concept demonstrated that transfer of a desired protein property from one set of proteins is feasible. Availability and implementation: The source code of ThermalProGAN can be freely accessed at https://github.com/markliou/ThermalProGAN/ with an MIT license. The website is https://thermalprogan.markliou.tw:433. Supplementary information: Supplementary data are available on Github.
AB - Motivation: The synthesis of proteins with novel desired properties is challenging but sought after by the industry and academia. The dominating approach is based on trial-and-error inducing point mutations, assisted by structural information or predictive models built with paired data that are difficult to collect. This study proposes a sequence-based unpaired-sample of novel protein inventor (SUNI) to build ThermalProGAN for generating thermally stable proteins based on sequence information. Results: The ThermalProGAN can strongly mutate the input sequence with a median number of 32 residues. A known normal protein, 1RG0, was used to generate a thermally stable form by mutating 51 residues. After superimposing the two structures, high similarity is shown, indicating that the basic function would be conserved. Eighty four molecular dynamics simulation results of 1RG0 and the COVID-19 vaccine candidates with a total simulation time of 840ns indicate that the thermal stability increased. Conclusion: This proof of concept demonstrated that transfer of a desired protein property from one set of proteins is feasible. Availability and implementation: The source code of ThermalProGAN can be freely accessed at https://github.com/markliou/ThermalProGAN/ with an MIT license. The website is https://thermalprogan.markliou.tw:433. Supplementary information: Supplementary data are available on Github.
UR - http://www.scopus.com/inward/record.url?scp=85152173563&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85152173563&partnerID=8YFLogxK
U2 - 10.1142/S0219720023500087
DO - 10.1142/S0219720023500087
M3 - Article
C2 - 36999645
AN - SCOPUS:85152173563
SN - 0219-7200
VL - 21
JO - Journal of Bioinformatics and Computational Biology
JF - Journal of Bioinformatics and Computational Biology
IS - 1
M1 - 2350008
ER -