Pernot, Pascal and Huang, Bing and Savin, Andreas (2020) Impact of non-normal error distributions on the benchmarking and ranking of quantum machine learning models. Machine Learning: Science and Technology, 1 (3). 035011. ISSN 2632-2153
Pernot_2020_Mach._Learn.__Sci._Technol._1_035011.pdf - Published Version
Download (2MB)
Abstract
Quantum machine learning models have been gaining significant traction within atomistic simulation communities. Conventionally, relative model performances are being assessed and compared using learning curves (prediction error vs. training set size). This article illustrates the limitations of using the Mean Absolute Error (MAE) for benchmarking, which is particularly relevant in the case of non-normal error distributions. We analyze more specifically the prediction error distribution of the kernel ridge regression with SLATM representation and L2 distance metric (KRR-SLATM-L2) for effective atomization energies of QM7b molecules calculated at the level of theory CCSD(T)/cc-pVDZ. Error distributions of HF and MP2 at the same basis set referenced to CCSD(T) values were also assessed and compared to the KRR model. We show that the true performance of the KRR-SLATM-L2 method over the QM7b dataset is poorly assessed by the Mean Absolute Error, and can be notably improved after adaptation of the learning set.
Item Type: | Article |
---|---|
Subjects: | European Scholar > Multidisciplinary |
Depositing User: | Managing Editor |
Date Deposited: | 30 Jun 2023 04:25 |
Last Modified: | 20 Oct 2023 04:06 |
URI: | http://article.publish4promo.com/id/eprint/2069 |