open access publication

Article, 2022

Quantum Machine Learning Approach for Studying Atmospheric Cluster Formation

Environmental Science & Technology Letters, ISSN 2328-8930, Volume 9, 3, Pages 239-244, 10.1021/acs.estlett.1c00997

Contributors

Kubečka, Jakub 0000-0002-8002-0911 [1] Christensen, Anders S. [2] Rasmussen, Freja Rydahl [1] Elm, Jonas 0000-0003-3736-4329 (Corresponding author) [1]

Affiliations

  1. [1] Aarhus University
  2. [NORA names: AU Aarhus University; University; Denmark; Europe, EU; Nordic; OECD];
  3. [2] Quantum Consulting by Christensen, Margarethenstrasse 71, 4053, Basel, Switzerland
  4. [NORA names: Switzerland; Europe, Non-EU; OECD]

Abstract

Quantum chemical (QC) calculations can yield direct insight into an atmospheric cluster formation mechanism and cluster formation rates. However, such calculations are extremely computationally demanding as more than millions of cluster configurations might exist and need to be computed. We present an efficient approach to produce high quality QC data sets for applications in cluster formation studies and how to train an accurate quantum machine learning model on the generated data. Using the two-component sulfuric acidwater system as a proof of concept, we demonstrate that a kernel ridge regression machine learning model with Δ-learning can be trained to accurately predict the binding energies of cluster equilibrium configurations with mean absolute errors below 0.5 kcal mol–1. Additionally, we enlarge the training data set with nonequilibrium configurations and show the possibility of predicting the binding energies of new structures of clusters several molecules larger than those in the training set. Applying the trained machine learning model leads to a drastic reduction in the number of relevant clusters that need to be explicitly evaluated by QC methods. The presented approach is directly transferable to clusters of arbitrary composition and will lead to faster and more efficient exploration of the configurational space of new cluster systems.

Keywords

D-Learning, QC data, QC method, applications, approach, arbitrary composition, atmosphere, binding, binding energy, calculations, chemical, cluster configurations, cluster formation, cluster formation mechanism, cluster formation rate, cluster system, clusters, composition, concept, configuration, configuration space, data, drastic reduction, efficient approach, efficient exploration, energy, equilibrium configuration, exploration, formation, formation mechanism, formation rate, formative study, kernel, learning approach, learning models, machine learning approach, machine learning models, mechanism, method, model, molecules, nonequilibrium configurations, quality, quantum, quantum chemical, quantum machine learning approach, quantum machine learning models, rate, reduction, regression machine learning model, relevant clusters, sets, space, structure, study, system, train machine learning models, training, training data, training set

Funders

  • Danish Agency for Science and Higher Education

Data Provider: Digital Science