TY - JOUR
T1 - Mechanical property prediction of random copolymers using uncertainty-based active learning
AU - Chang, Wei Che
AU - Tsai, Zong Yun
AU - Chen, Chin Wen
AU - Yu, Chi Hua
AU - Chen, Chuin Shan
N1 - Publisher Copyright:
© 2024 Elsevier B.V.
PY - 2025/1/31
Y1 - 2025/1/31
N2 - The copolymer, a widely used material in our daily lives, presents a significant challenge in targeted sequence design. While recent advancements in computational simulation and data science offer a promising avenue for addressing this complex issue, challenges persist in labeled data scarcity. In this study, we introduce an uncertainty-based active learning framework for predicting the properties of random copolymers. We found that the active learning strategy allowed for labeling only 40 data points within the design space of 1550 data points, drastically reducing the labeling efforts by 97%. Most data selected by active learning were positioned on the design space's periphery, transforming the learning task into an interpolation problem. Through integrating active learning and molecular dynamics, we successfully overcame the combinatorial explosion problem in copolymer sequence design, streamlining the data labeling process and culminating in a highly accurate model. This research demonstrates data science's potential in polymer design, especially when facing data scarcity.
AB - The copolymer, a widely used material in our daily lives, presents a significant challenge in targeted sequence design. While recent advancements in computational simulation and data science offer a promising avenue for addressing this complex issue, challenges persist in labeled data scarcity. In this study, we introduce an uncertainty-based active learning framework for predicting the properties of random copolymers. We found that the active learning strategy allowed for labeling only 40 data points within the design space of 1550 data points, drastically reducing the labeling efforts by 97%. Most data selected by active learning were positioned on the design space's periphery, transforming the learning task into an interpolation problem. Through integrating active learning and molecular dynamics, we successfully overcame the combinatorial explosion problem in copolymer sequence design, streamlining the data labeling process and culminating in a highly accurate model. This research demonstrates data science's potential in polymer design, especially when facing data scarcity.
UR - http://www.scopus.com/inward/record.url?scp=85208236657&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85208236657&partnerID=8YFLogxK
U2 - 10.1016/j.commatsci.2024.113489
DO - 10.1016/j.commatsci.2024.113489
M3 - Article
AN - SCOPUS:85208236657
SN - 0927-0256
VL - 247
JO - Computational Materials Science
JF - Computational Materials Science
M1 - 113489
ER -