Discovering design principles of collagen molecular stability using a genetic algorithm, deep learning, and experimental validation

Eesha Khare, Chi Hua Yu, Constancio Gonzalez Obeso, Mario Milazzo, David L. Kaplan, Markus J. Buehler

研究成果: Article同行評審

17 引文 斯高帕斯(Scopus)

摘要

Collagen is the most abundant structural protein in humans, providing crucial mechanical properties, including high strength and toughness, in tissues. Collagen-based biomaterials are, therefore, used for tissue repair and regeneration. Utilizing collagen effectively during materials processing ex vivo and subsequent function in vivo requires stability over wide temperature ranges to avoid denaturation and loss of structure, measured as melting temperature (Tm). Although significant research has been conducted on understanding how collagen primary amino acid sequences correspond to Tm values, a robust framework to facilitate the design of collagen sequences with specific Tm remains a challenge. Here, we develop a general model using a genetic algorithm within a deep learning framework to design collagen sequences with specific Tm values. We report 1,000 de novo collagen sequences, and we show that we can efficiently use this model to generate collagen sequences and verify their Tm values using both experimental and computational methods. We find that the model accurately predicts Tm values within a few degrees centigrade. Further, using this model, we conduct a high-throughput study to identify the most frequently occurring collagen triplets that can be directly incorporated into collagen. We further discovered that the number of hydrogen bonds within collagen calculated with molecular dynamics (MD) is directly correlated to the experimental measurement of triple-helical quality. Ultimately, we see this work as a critical step to helping researchers develop collagen sequences with specific Tm values for intended materials manufacturing methods and biomedical applications, realizing a mechanistic materials by design paradigm.

原文English
文章編號e2209524119
期刊Proceedings of the National Academy of Sciences of the United States of America
119
發行號40
DOIs
出版狀態Published - 2022 10月 4

All Science Journal Classification (ASJC) codes

  • 多學科

指紋

深入研究「Discovering design principles of collagen molecular stability using a genetic algorithm, deep learning, and experimental validation」主題。共同形成了獨特的指紋。

引用此