Innovative Virtual Screening of PD-L1 Inhibitors: The Synergy of Molecular Similarity, Neural Networks, and GNINA Docking

Van-Thinh To; Tieu-Long Phan; Bao-Vy Ngoc Doan; Phuoc-Chung Van Nguyen; Dong-Nghi Hoang Nguyen; Quang-Huy Nguyen Le; Hoang-Huy Nguyen; The-Chuong Trinh; Tuyen Ngoc Truong

doi:10.26434/chemrxiv-2024-zf1k8

Theoretical and Computational Chemistry

Search within Theoretical and Computational Chemistry

Innovative Virtual Screening of PD-L1 Inhibitors: The Synergy of Molecular Similarity, Neural Networks, and GNINA Docking

09 January 2024, Version 1

Working Paper

Show author details

This content is a preprint and has not undergone peer review at the time of posting.

Abstract

Immune checkpoint inhibitors have garnered significant attention in oncological research over recent years. A plethora of studies have elucidated that inhibitors targeting the Programmed Death-Ligand 1 (PD-L1) play a pivotal role in circumventing the evasion mechanisms of cancer cells against the immune system. This study aimed to develop an integrated screening model combining an Artificial Neural Network (ANN), Molecular Similarity (MS) assessments, and GNINA 1.0 molecular docking, targeting PD-L1 inhibitors. A database of 2044 substances with known PD-L1 inhibitory activity was compiled from Google Patents and used to enhance molecular similarity evaluations and train the machine learning model. For retrospective validation of the docking procedure, the human PD-L1 protein, with the Protein Data Bank (PDB) ID: 5N2F, was employed as a control. In this phase of the study, 15,235 compounds from the DrugBank database were subjected to a series of screening processes: initially through medicinal chemistry filters, followed by MS assessments, the ANN model, and culminating with molecular docking using GNINA 1.0. The decoy generation yielded promising outcomes, evidenced by an AUC-ROC 1NN value of 0.52 and Doppelganger scores with a mean of 0.24 and a maximum of 0.346, indicating a high resemblance of the decoys to the active set. For MS, the AVALON emerged as the most effective fingerprint for similarity searching, demonstrating an Enrichment Factor (EF) of 1% at 10.96%, an AUC-ROC of 0.963, and an optimal similarity threshold of 0.32. The ANN model demonstrated superior performance in cross-validation, achieving an average precision of 0.863±0.032 and an F1 score of 0.745±0.039, outperforming both the Support Vector Classifier (SVC) and Random Forest (RF) models, albeit not significantly. In external validation, the ANN model maintained its superiority with an average precision of 0.851 and an F1 score of 0.790. GNINA 1.0, employed for molecular docking, was validated through redocking and retrospective control, achieving an AUC of 0.975, with a critical cnn_pose_score threshold of 0.73. From the initial 15,235 compounds, 128 were shortlisted using the MS and ANN models. Further screening through GNINA 1.0 identified 22 potential candidates, among which (3S)-1-(4-acetylphenyl)-5-oxopyrrolidine-3-carboxylic acid emerged as the most promising, with a cnn_pose_score of 0.79, a PD-L1 inhibitory probability of 70.5%, and a Tanimoto coefficient of 0.35.

Supplementary weblinks

Title

Description

Actions

Title

PD1-PDL1 GitHub link

Description

The comprehensive source code and research notebook for our study on PD1-PDL1 can be accessed through the following GitHub link. This repository encompasses all the essential resources and materials associated with our research, fostering transparency, reproducibility, and collaboration within the scientific community.

Actions

View

Comments

Comments are not moderated before they are posted, but they can be removed by the site moderators if they are found to be in contravention of our Commenting Policy - please read this policy before you post. Comments should be used for scholarly discussion of the content in question. You can find more information about how to use the commenting feature here .

This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.

Version History

Jan 09, 2024 Version 1

Metrics

1,051

436

Views

Downloads

Citations

License

The content is available under CC BY NC 4.0

DOI

10.26434/chemrxiv-2024-zf1k8

Author’s competing interest statement

The author(s) have declared they have no conflict of interest with regard to this content

Ethics

The author(s) have declared ethics committee/IRB approval is not relevant to this content

Innovative Virtual Screening of PD-L1 Inhibitors: The Synergy of Molecular Similarity, Neural Networks, and GNINA Docking

Authors

Abstract

Supplementary weblinks

Comments

Version History

Metrics

License

DOI

Author’s competing interest statement

Ethics

Share