Preprint, 2024

Deep Generative Model for the Dual-Objective Inverse Design of Metal Complexes

ChemRxiv, ISSN 2573-2293, 10.26434/chemrxiv-2024-mzs7b

Contributors

Strandgaard, Magnus [1] Linjordet, Trond [2] Kneiding, Hannes

0000-0002-6799-3129 [2] Burnage, Arron L

0000-0001-7136-2402 [2] Nova, Ainara

0000-0003-3368-7702 [2] Jensen, Jan Halborg

0000-0002-1465-1010 [1] Balcells, David

0000-0002-3389-0543 [2]

Affiliations

[1] University of Copenhagen

[NORA names:

KU University of Copenhagen

University

[2] University of Oslo

[NORA names:

Abstract

Deep generative models yielding transition metal complexes (TMCs) remain scarce despite the key role of these compounds in industrial catalytic processes, anticancer therapies, and energy transformations. Compared to drug discovery within the organic molecular space, TMCs pose further challenges including the encoding of chemical bonds of higher complexity and the optimization of multiple properties, in a context in which synthesizability is affected by additional, complex factors. In this work, we developed a junction tree variational autoencoder (JT-VAE) model for the generation of metal ligands. After implementing a SMILES-based encoding of the metal–ligand bonds, the model was trained with the tmQMg-L ligand library, allowing for the random generation of thousands of monodentate and bidentate ligands with full validity and high novelty. The generated ligands were labeled with two target properties of the associated [IrL4]+ and [IrL2]+ homoleptic TMCs; namely the HOMO-LUMO gap (ϵ) and the metal charge (qIr), both computed at a DFT level. This data was used to implement a conditional JT-VAE model generating ligands from a prompt, with the single or dual objective of optimizing either one or both properties in Y = (ϵ, qIr). Conditional ligand generation was able to navigate both central and extreme regions of this bidimensional property space, allowing for chemical interpretation based on the step-wise analysis of the decoded optimization trajectories.

Keywords

DFT, DFT level, Deep, HOMO-LUMO gap, analysis, anticancer, anticancer therapy, autoencoder, bidentate ligand, bonds, catalytic process, charge, chemical interpretation, complex, complex factors, compounds, context, data, deep generative models, design of metal complexes, discovery, drug, drug discovery, encoding, energy, energy transformation, factors, gap, generation, generation ligands, generative model, higher complexity, homoleptic transition metal complexes, industrial catalytic processes, interpretation, junction, levels, library, ligand, ligand generation, ligand libraries, metal, metal charge, metal complexes, metal ligands, metal-ligand bonds, model, molecular space, monodentate, multiple properties, novelty, objective, optimal trajectory, optimization, process, prompts, properties, property space, random generation, region, space, synthesizability, target, target properties, therapy, trajectory, transformation, transition, transition metal complexes, validity, variational autoencoder

Deep Generative Model for the Dual-Objective Inverse Design of Metal Complexes

Contributors

Affiliations

Abstract

Keywords

Data Provider: Digital Science

LINKS
-

Matching Records in NORA

SUBJECTS
+

DK Main Research Area

UN SDG Classification

OECD Classification

AU/NZ FOR Classification

METRICS
+

Citation Metrics

Attention Metrics

Attention Metrics

DK Open Access Indicator

Contributors

Affiliations

Abstract

Keywords

Data Provider: Digital Science

LINKS-

Matching Records in NORA

SUBJECTS+

DK Main Research Area

UN SDG Classification

OECD Classification

AU/NZ FOR Classification

METRICS+

Citation Metrics

Attention Metrics

Attention Metrics

DK Open Access Indicator

Matching Records in NORA

DK Open Access Indicator

DK Green Classification

LINKS
-

SUBJECTS
+

METRICS
+