ThermalProGAN: A sequence-based thermally stable protein generator trained using unpaired data

Hui Ling Huang, Chong Heng Weng, Torbjorn E.M. Nordling, Yi Fan Liou

Research output: Contribution to journalArticlepeer-review

Abstract

Motivation: The synthesis of proteins with novel desired properties is challenging but sought after by the industry and academia. The dominating approach is based on trial-and-error inducing point mutations, assisted by structural information or predictive models built with paired data that are difficult to collect. This study proposes a sequence-based unpaired-sample of novel protein inventor (SUNI) to build ThermalProGAN for generating thermally stable proteins based on sequence information. Results: The ThermalProGAN can strongly mutate the input sequence with a median number of 32 residues. A known normal protein, 1RG0, was used to generate a thermally stable form by mutating 51 residues. After superimposing the two structures, high similarity is shown, indicating that the basic function would be conserved. Eighty four molecular dynamics simulation results of 1RG0 and the COVID-19 vaccine candidates with a total simulation time of 840ns indicate that the thermal stability increased. Conclusion: This proof of concept demonstrated that transfer of a desired protein property from one set of proteins is feasible. Availability and implementation: The source code of ThermalProGAN can be freely accessed at https://github.com/markliou/ThermalProGAN/ with an MIT license. The website is https://thermalprogan.markliou.tw:433. Supplementary information: Supplementary data are available on Github.

Original languageEnglish
Article number2350008
JournalJournal of Bioinformatics and Computational Biology
Volume21
Issue number1
DOIs
Publication statusPublished - 2023 Feb 1

All Science Journal Classification (ASJC) codes

  • Biochemistry
  • Molecular Biology
  • Computer Science Applications

Fingerprint

Dive into the research topics of 'ThermalProGAN: A sequence-based thermally stable protein generator trained using unpaired data'. Together they form a unique fingerprint.

Cite this