Artificial intelligence-enabled virtual screening of ultra-large chemical libraries with deep docking
© 2022. The Author(s), under exclusive licence to Springer Nature Limited..
With the recent explosion of chemical libraries beyond a billion molecules, more efficient virtual screening approaches are needed. The Deep Docking (DD) platform enables up to 100-fold acceleration of structure-based virtual screening by docking only a subset of a chemical library, iteratively synchronized with a ligand-based prediction of the remaining docking scores. This method results in hundreds- to thousands-fold virtual hit enrichment (without significant loss of potential drug candidates) and hence enables the screening of billion molecule-sized chemical libraries without using extraordinary computational resources. Herein, we present and discuss the generalized DD protocol that has been proven successful in various computer-aided drug discovery (CADD) campaigns and can be applied in conjunction with any conventional docking program. The protocol encompasses eight consecutive stages: molecular library preparation, receptor preparation, random sampling of a library, ligand preparation, molecular docking, model training, model inference and the residual docking. The standard DD workflow enables iterative application of stages 3-7 with continuous augmentation of the training set, and the number of such iterations can be adjusted by the user. A predefined recall value allows for control of the percentage of top-scoring molecules that are retained by DD and can be adjusted to control the library size reduction. The procedure takes 1-2 weeks (depending on the available resources) and can be completely automated on computing clusters managed by job schedulers. This open-source protocol, at https://github.com/jamesgleave/DD_protocol , can be readily deployed by CADD researchers and can significantly accelerate the effective exploration of ultra-large portions of a chemical space.
Medienart: |
E-Artikel |
---|
Erscheinungsjahr: |
2022 |
---|---|
Erschienen: |
2022 |
Enthalten in: |
Zur Gesamtaufnahme - volume:17 |
---|---|
Enthalten in: |
Nature protocols - 17(2022), 3 vom: 01. März, Seite 672-697 |
Sprache: |
Englisch |
---|
Beteiligte Personen: |
Gentile, Francesco [VerfasserIn] |
---|
Links: |
---|
Themen: |
Journal Article |
---|
Anmerkungen: |
Date Completed 07.04.2022 Date Revised 28.03.2024 published: Print-Electronic Citation Status MEDLINE |
---|
doi: |
10.1038/s41596-021-00659-2 |
---|
funding: |
|
---|---|
Förderinstitution / Projekttitel: |
|
PPN (Katalog-ID): |
NLM336543905 |
---|
LEADER | 01000caa a22002652 4500 | ||
---|---|---|---|
001 | NLM336543905 | ||
003 | DE-627 | ||
005 | 20240328235728.0 | ||
007 | cr uuu---uuuuu | ||
008 | 231225s2022 xx |||||o 00| ||eng c | ||
024 | 7 | |a 10.1038/s41596-021-00659-2 |2 doi | |
028 | 5 | 2 | |a pubmed24n1353.xml |
035 | |a (DE-627)NLM336543905 | ||
035 | |a (NLM)35121854 | ||
040 | |a DE-627 |b ger |c DE-627 |e rakwb | ||
041 | |a eng | ||
100 | 1 | |a Gentile, Francesco |e verfasserin |4 aut | |
245 | 1 | 0 | |a Artificial intelligence-enabled virtual screening of ultra-large chemical libraries with deep docking |
264 | 1 | |c 2022 | |
336 | |a Text |b txt |2 rdacontent | ||
337 | |a ƒaComputermedien |b c |2 rdamedia | ||
338 | |a ƒa Online-Ressource |b cr |2 rdacarrier | ||
500 | |a Date Completed 07.04.2022 | ||
500 | |a Date Revised 28.03.2024 | ||
500 | |a published: Print-Electronic | ||
500 | |a Citation Status MEDLINE | ||
520 | |a © 2022. The Author(s), under exclusive licence to Springer Nature Limited. | ||
520 | |a With the recent explosion of chemical libraries beyond a billion molecules, more efficient virtual screening approaches are needed. The Deep Docking (DD) platform enables up to 100-fold acceleration of structure-based virtual screening by docking only a subset of a chemical library, iteratively synchronized with a ligand-based prediction of the remaining docking scores. This method results in hundreds- to thousands-fold virtual hit enrichment (without significant loss of potential drug candidates) and hence enables the screening of billion molecule-sized chemical libraries without using extraordinary computational resources. Herein, we present and discuss the generalized DD protocol that has been proven successful in various computer-aided drug discovery (CADD) campaigns and can be applied in conjunction with any conventional docking program. The protocol encompasses eight consecutive stages: molecular library preparation, receptor preparation, random sampling of a library, ligand preparation, molecular docking, model training, model inference and the residual docking. The standard DD workflow enables iterative application of stages 3-7 with continuous augmentation of the training set, and the number of such iterations can be adjusted by the user. A predefined recall value allows for control of the percentage of top-scoring molecules that are retained by DD and can be adjusted to control the library size reduction. The procedure takes 1-2 weeks (depending on the available resources) and can be completely automated on computing clusters managed by job schedulers. This open-source protocol, at https://github.com/jamesgleave/DD_protocol , can be readily deployed by CADD researchers and can significantly accelerate the effective exploration of ultra-large portions of a chemical space | ||
650 | 4 | |a Journal Article | |
650 | 4 | |a Research Support, Non-U.S. Gov't | |
650 | 4 | |a Review | |
650 | 7 | |a Ligands |2 NLM | |
650 | 7 | |a Small Molecule Libraries |2 NLM | |
700 | 1 | |a Yaacoub, Jean Charle |e verfasserin |4 aut | |
700 | 1 | |a Gleave, James |e verfasserin |4 aut | |
700 | 1 | |a Fernandez, Michael |e verfasserin |4 aut | |
700 | 1 | |a Ton, Anh-Tien |e verfasserin |4 aut | |
700 | 1 | |a Ban, Fuqiang |e verfasserin |4 aut | |
700 | 1 | |a Stern, Abraham |e verfasserin |4 aut | |
700 | 1 | |a Cherkasov, Artem |e verfasserin |4 aut | |
773 | 0 | 8 | |i Enthalten in |t Nature protocols |d 2006 |g 17(2022), 3 vom: 01. März, Seite 672-697 |w (DE-627)NLM167398601 |x 1750-2799 |7 nnns |
773 | 1 | 8 | |g volume:17 |g year:2022 |g number:3 |g day:01 |g month:03 |g pages:672-697 |
856 | 4 | 0 | |u http://dx.doi.org/10.1038/s41596-021-00659-2 |3 Volltext |
912 | |a GBV_USEFLAG_A | ||
912 | |a GBV_NLM | ||
951 | |a AR | ||
952 | |d 17 |j 2022 |e 3 |b 01 |c 03 |h 672-697 |