open access publication

Article, 2023

Application of interpretable group-embedded graph neural networks for pure compound properties

Computers & Chemical Engineering, ISSN 0098-1354, 1873-4375, Volume 176, Page 108291, 10.1016/j.compchemeng.2023.108291

Contributors

Aouichaoui, Adem R N 0000-0002-3297-6054 [1] Fan, Fan [1] Abildskov, Jens 0000-0003-1187-8778 [1] Sin, Gtirkan 0000-0003-0513-4502 (Corresponding author) [1]

Affiliations

  1. [1] Technical University of Denmark
  2. [NORA names: DTU Technical University of Denmark; University; Denmark; Europe, EU; Nordic; OECD]

Abstract

The ability to evaluate pure compound properties of various molecular species is an important prerequisite for process simulation in general and in particular for computer-aided molecular design (CAMD). Current techniques rely on group-contribution (GC) methods, which suffer from many drawbacks mainly the absence of contributions for specific groups. To overcome this challenge, in this work, we extended the range of interpretable graph neural network (GNN) models for describing a wide range of pure component properties. The new model library contains 30 different properties ranging from thermophysical, safety-related, and environmental properties. All of these have been modeled with a suitable level of accuracy for compound screening purposes compared to current GC models used within CAMD applications. Moreover, the developed models have been subjected to a series of sanity checks using logical and thermodynamic constraints. Results show the importance of evaluating the model across a range of properties to establish their thermodynamic consistency.

Keywords

Current techniques, GC model, absence, absence of contributions, accuracy, applications, checking, component properties, components, compound properties, compounds, computer-aided molecular design, consistency, constraints, contribution, design, drawbacks, environmental properties, graph, graph neural networks, group, group contributions, level of accuracy, levels, library, model, model library, molecular design, molecular species, network, neural network, process, process simulation, properties, purposes, range, results, safety-related, sanity, sanity checks, screening purposes, simulation, species, technique, thermodynamic consistency, thermodynamic constraints

Data Provider: Digital Science