Size‐Extensive Molecular Machine Learning with Global Representations †

Abstract: Machine learning (ML) models are increasingly used in combination with electronic structure calculations to predict molecular properties at a much lower computational cost in high‐throughput settings. Such ML models require representations that encode the molecular structure, which are generally designed to respect the symmetries and invariances of the target property. However, size‐extensivity is usually not guaranteed for so‐called global representations. In this contribution, we show how extensivity can be built into global ML models using, e. g., the Many‐Body Tensor Representation. Properties of extensive and non‐extensive models for the atomization energy are systematically explored by training on small molecules and testing on small, medium and large molecules. Our results show that non‐extensive models are only useful in the size‐range of their training set, whereas extensive models provide reasonable predictions across large size differences. Remaining sources of error for extensive models are discussed.

Location
Deutsche Nationalbibliothek Frankfurt am Main
Extent
Online-Ressource
Language
Englisch

Bibliographic citation
Size‐Extensive Molecular Machine Learning with Global Representations † ; volume:2 ; number:4 ; year:2020 ; extent:7
ChemSystemsChem ; 2, Heft 4 (2020) (gesamt 7)

Creator
Jung, Hyunwook
Stocker, Sina
Kunkel, Christian
Oberhofer, Harald
Han, Byungchan
Reuter, Karsten
Margraf, Johannes T.

DOI
10.1002/syst.201900052
URN
urn:nbn:de:101:1-2022062910340301146108
Rights
Open Access; Der Zugriff auf das Objekt ist unbeschränkt möglich.
Last update
15.08.2025, 7:31 AM CEST

Data provider

This object is provided by:
Deutsche Nationalbibliothek. If you have any questions about the object, please contact the data provider.

Associated

Other Objects (12)