Property Evaluation of Polymer Units in Asphalt Binder (II): Prediction of Properties Using Molecular Dynamics and Machine Learning
This study proposes a methodology for predicting the physical properties such as the density of polymer composites, including asphalt binders, and evaluates its feasibility by identifying the quantitative relationship between the structure and properties of individual polymers. To this end, features are constructed using molecular dynamics (MD) simulation results and descriptor calculation tools. This study investigates the changes in the calculated density depending on the characteristics of the training dataset and analyzes the feature characteristics across datasets to identify key features. In this study, 2,415 hydrocarbon and binder-derived polymer molecules were analyzed using MD simulations and 2,790 chemical descriptors generated using alvaDesc. The features were pre-processed using correlation filtering, PCA, and recursive feature elimination. The XGBoost models were trained using k-fold cross-validation and Optuna optimization. SHAP analysis was used to interpret feature contributions. The variables influencing the density prediction differed between the hydrocarbon and binder groups. However, the hydrogen atom count (H), van der Waals energy, and descriptors such as SpMAD_EA_LboR consistently had a strong impact. The trained models achieved high accuracy (R² > 0.99) across different datasets, and the SHAP results revealed that the edge adjacency, topological, and 3D geometrical descriptors were critical. In terms of predictive accuracy and interpretability, the integrated MDQSPR framework demonstrated high reliability for estimating the properties of individual binder polymers. This approach contributed to a molecular-level understanding and facilitated the development of ecofriendly and efficient modifiers for asphalt binders.