Practical Applications of Deep Learning To Impute Heterogeneous Drug Discovery Data

Contemporary deep learning approaches still struggle to bring a useful improvement in the field of drug discovery because of the challenges of sparse, noisy, and heterogeneous data that are typically encountered in this context. We use a state-of-the-art deep learning method, Alchemite, to impute data from drug discovery projects, including multitarget biochemical activities, phenotypic activities in cell-based assays, and a variety of absorption, distribution, metabolism, and excretion (ADME) endpoints. The resulting model gives excellent predictions for activity and ADME endpoints, offering an average increase in R2 of 0.22 versus quantitative structure-activity relationship methods. The model accuracy is robust to combining data across uncorrelated endpoints and projects with different chemical spaces, enabling a single model to be trained for all compounds and endpoints. We demonstrate improvements in accuracy on the latest chemistry and data when updating models with new data as an ongoing medicinal chemistry project progresses.

Medienart:

E-Artikel

Erscheinungsjahr:

2020

Erschienen:

2020

Enthalten in:

Zur Gesamtaufnahme - volume:60

Enthalten in:

Journal of chemical information and modeling - 60(2020), 6 vom: 22. Juni, Seite 2848-2857

Sprache:

Englisch

Beteiligte Personen:

Irwin, Benedict W J [VerfasserIn]
Levell, Julian R [VerfasserIn]
Whitehead, Thomas M [VerfasserIn]
Segall, Matthew D [VerfasserIn]
Conduit, Gareth J [VerfasserIn]

Links:

Volltext

Themen:

Journal Article

Anmerkungen:

Date Completed 17.06.2021

Date Revised 17.06.2021

published: Print-Electronic

Citation Status MEDLINE

doi:

10.1021/acs.jcim.0c00443

funding:

Förderinstitution / Projekttitel:

PPN (Katalog-ID):

NLM310589487