Classification enhanced machine learning model for energetic stability of binary compounds

dc.contributor.authorLiu, Y. K.
dc.contributor.authorLiu, Z. R.
dc.contributor.authorXu, T. F.
dc.contributor.authorLegut, Dominik
dc.contributor.authorYin, X.
dc.contributor.authorZhang, R. F.
dc.date.accessioned2026-04-29T11:35:47Z
dc.date.available2026-04-29T11:35:47Z
dc.date.issued2024
dc.description.abstractAs contemporary computational technologies and machine learning methodologies rapidly evolve, machine learning (ML) models for predicting formation enthalpies of materials exhibited convincible numerical precision and remarkable predictive efficiency, thus establishing a solid foundation for materials thermodynamic design. Despite achieving numerically high global probability accuracy, current ML models for formation enthalpy nonetheless exhibit suboptimal local accuracy within specific physical domain, which can be attributed to the misalignment between the physical constraints of chemical bonds and the critical descriptors capturing classspecific traits. Herein, we propose a novel approach to improve the local precision of the ML model for predicting formation enthalpy by utilizing Miedema theory-based classification, which segments data into distinct categories according to the electronegativity difference, electron density discontinuity and atomic size difference. Utilizing ML algorithms to build surrogate models guided by the classification strategy significantly improves the local predictive accuracy of formation enthalpy for specific binary compounds, significantly raising the R2 value from 0.4-0.9 to 0.8-0.9 compared to an unclassified method. Furthermore, feature importance analysis demonstrates that the pivotal factors for each category vary in some manner, highlighting the insufficiency of a sole ML model in classifying large-dimensional data, which can be addressed by adopting a physicsinformed classification strategy. Our results suggest that employing physical-informed classification scheme into ML equips the models with broad applicability and local accuracy, which also shed light for other material properties predication.
dc.description.firstpageart. no. 113277
dc.description.sourceWeb of Science
dc.description.volume244
dc.identifier.citationComputational Materials Science. 2024, vol. 244, art. no. 113277.
dc.identifier.doi10.1016/j.commatsci.2024.113277
dc.identifier.issn0927-0256
dc.identifier.issn1879-0801
dc.identifier.urihttp://hdl.handle.net/10084/158525
dc.identifier.wos001291170000001
dc.language.isoen
dc.publisherElsevier
dc.relation.ispartofseriesComputational Materials Science
dc.relation.urihttps://doi.org/10.1016/j.commatsci.2024.113277
dc.rights© 2024 Elsevier B.V. All rights are reserved, including those for text and data mining, AI training, and similar technologies.
dc.subjectformation enthalpy
dc.subjectbinary compounds
dc.subjectMiedema theory
dc.subjectmachine learning
dc.titleClassification enhanced machine learning model for energetic stability of binary compounds
dc.typearticle
dc.type.statusPeer-reviewed
dc.type.versionpublishedVersion

Files

License bundle

Now showing 1 - 1 out of 1 results
Loading...
Thumbnail Image
Name:
license.txt
Size:
718 B
Format:
Item-specific license agreed upon to submission
Description: