Article, 2024

Integration of the Butina algorithm and ensemble learning strategies for the advancement of a pharmacophore ligand-based model: an in silico investigation of apelin agonists

Frontiers in Chemistry, ISSN 2296-2646, Volume 12, Page 1382319, 10.3389/fchem.2024.1382319

Contributors

Tran, Xuan-Truc Dinh [1] Phan, Tieu-Long

0000-0002-3532-2064 [2] [3] To, Van-Thinh

0000-0002-7640-0807 [1] Tran, Ngoc-Vi Nguyen

0000-0002-1724-9457 [4] Nguyen, Nhu-Ngoc Song [1] Nguyen, Dong-Nghi Hoang

0000-0002-9701-1800 [1] Tran, Ngoc-Tam Nguyen

0009-0004-3893-0497 [1] Truong, Tuyen Ngoc

0000-0002-0952-1633 (Corresponding author) [1]

Affiliations

[1] Ho Chi Minh City Medicine and Pharmacy University

[NORA names:

[2] Leipzig University

[NORA names:

[3] University of Southern Denmark

[NORA names:

SDU University of Southern Denmark

University

[4] Uppsala University

[NORA names:

Abstract

Introduction: 3D pharmacophore models describe the ligand's chemical interactions in their bioactive conformation. They offer a simple but sophisticated approach to decipher the chemically encoded ligand information, making them a valuable tool in drug design. Methods: Our research summarized the key studies for applying 3D pharmacophore models in virtual screening for 6,944 compounds of APJ receptor agonists. Recent advances in clustering algorithms and ensemble methods have enabled classical pharmacophore modeling to evolve into more flexible and knowledge-driven techniques. Butina clustering categorizes molecules based on their structural similarity (indicated by the Tanimoto coefficient) to create a structurally diverse training dataset. The learning method combines various individual pharmacophore models into a set of pharmacophore models for pharmacophore space optimization in virtual screening. Results: This approach was evaluated on Apelin datasets and afforded good screening performance, as proven by Receiver Operating Characteristic (AUC score of 0.994 ± 0.007), enrichment factor of (EF1% of 50.07 ± 0.211), Güner-Henry score of 0.956 ± 0.015, and F-measure of 0.911 ± 0.031. Discussion: Although one of the high-scoring models achieved statistically superior results in each dataset (AUC of 0.82; an EF1% of 19.466; GH of 0.131 and F1-score of 0.071), the ensemble learning method including voting and stacking method balanced the shortcomings of each model and passed with close performance measures.

Keywords

Butinae, Discussion:</b> , F-measure, advances, agonists, algorithm, apelin, bioactive conformation, chemical, chemical interaction, clustering algorithm, clusters, compounds, conformation, dataset, design, diverse training dataset, drug, drug design, ensemble learning method, ensemble learning strategy, ensemble methods, in silico investigation, information, integration, interaction, knowledge-driven techniques, learning, learning methods, learning strategies, ligand, ligand information, ligand-based models, measurements, method, model, molecules, optimization, performance, performance measures, pharmacophore, pharmacophore model, receiver, receptor agonists, research, results, scores, screening, screening performance, shortcomings, similarity, space optimization, stack, stacking method, strategies, structural similarity, structure, study, technique, training dataset, virtual screening, voting

Integration of the Butina algorithm and ensemble learning strategies for the advancement of a pharmacophore ligand-based model: an in silico investigation of apelin agonists

Contributors

Affiliations

Abstract

Keywords

Data Provider: Digital Science

LINKS
-

Matching Records in NORA

SUBJECTS
+

DK Main Research Area

UN SDG Classification

OECD Classification

AU/NZ FOR Classification

METRICS
+

Citation Metrics

Attention Metrics

Attention Metrics

DK Open Access Indicator

Contributors

Affiliations

Abstract

Keywords

Data Provider: Digital Science

LINKS-

Matching Records in NORA

SUBJECTS+

DK Main Research Area

UN SDG Classification

OECD Classification

AU/NZ FOR Classification

METRICS+

Citation Metrics

Attention Metrics

Attention Metrics

DK Open Access Indicator

Matching Records in NORA

DK Open Access Indicator

DK Green Classification

LINKS
-

SUBJECTS
+

METRICS
+