van Gerwen, Puck and Fabrizio, Alberto and Wodrich, Matthew D and Corminboeuf, Clemence (2022) Physics-based representations for machine learning properties of chemical reactions. Machine Learning: Science and Technology, 3 (4). 045005. ISSN 2632-2153
van_Gerwen_2022_Mach._Learn.__Sci._Technol._3_045005.pdf - Published Version
Download (1MB)
Abstract
Physics-based representations constructed using only atomic positions and nuclear charges (also known as quantum machine learning, QML) allow for the reliable and efficient inference of molecular properties from training data. Chemistry is a science rooted in chemical reactions, naturally involving multiple molecular species. Here, we extend QML's capabilities to include the prediction of reaction properties by defining reaction representations: representations taking as input multiple molecules participating in a reaction, each represented by their corresponding atomic charges and three-dimensional coordinates. Several reaction representations are constructed from established molecular ones and benchmarked on four datasets representative of thermodynamic or kinetic reaction properties. One of these, the Hydroform-22-TS dataset (2350 energy barriers), is introduced as part of this work. The relevant ingredients for a high-performing reaction representation are extracted and used to construct the Bond-Based Reaction Representation ($B^2R^2$) for the prediction of quantum-chemical properties of chemical reactions. Finally, variations of $B^2R^2$ with varying representation size vs. performance are provided.
Item Type: | Article |
---|---|
Subjects: | European Scholar > Multidisciplinary |
Depositing User: | Managing Editor |
Date Deposited: | 07 Jul 2023 03:33 |
Last Modified: | 10 Oct 2023 05:30 |
URI: | http://article.publish4promo.com/id/eprint/2099 |