ADMET Predictability at Boehringer Ingelheim : State-of-the-Art, and Do Bigger Datasets or Algorithms Make a Difference?
© 2021 Wiley-VCH GmbH..
Computational methods assisting drug discovery and development are routine in the pharmaceutical industry. Digital recording of ADMET assays has provided a rich source of data for development of predictive models. Despite the accumulation of data and the public availability of advanced modeling algorithms, the utility of prediction in ADMET research is not clear. Here, we present a critical evaluation of the relationships between data volume, modeling algorithm, chemical representation and grouping, and temporal aspect (time sequence of assays) using an in-house ADMET database. We find no large difference in prediction algorithms nor any systemic and substantial gain from increasingly large datasets. Temporal-based data enlargement led to performance improvement in only in a limited number of assays, and with fractional improvement at best. Assays that are well-, intermediately-, or poorly-suited for ADMET predictions and reasons for such behavior are systematically identified, generating realistic expectations for areas in which computational models can be used to guide decision making in molecular design and development.
Medienart: |
E-Artikel |
---|
Erscheinungsjahr: |
2022 |
---|---|
Erschienen: |
2022 |
Enthalten in: |
Zur Gesamtaufnahme - volume:41 |
---|---|
Enthalten in: |
Molecular informatics - 41(2022), 2 vom: 02. Feb., Seite e2100113 |
Sprache: |
Englisch |
---|
Beteiligte Personen: |
Aleksić, Stevan [VerfasserIn] |
---|
Links: |
---|
Themen: |
ADMET modelling |
---|
Anmerkungen: |
Date Completed 02.05.2022 Date Revised 02.05.2022 published: Print-Electronic Citation Status MEDLINE |
---|
doi: |
10.1002/minf.202100113 |
---|
funding: |
|
---|---|
Förderinstitution / Projekttitel: |
|
PPN (Katalog-ID): |
NLM330159208 |
---|
LEADER | 01000naa a22002652 4500 | ||
---|---|---|---|
001 | NLM330159208 | ||
003 | DE-627 | ||
005 | 20231225210704.0 | ||
007 | cr uuu---uuuuu | ||
008 | 231225s2022 xx |||||o 00| ||eng c | ||
024 | 7 | |a 10.1002/minf.202100113 |2 doi | |
028 | 5 | 2 | |a pubmed24n1100.xml |
035 | |a (DE-627)NLM330159208 | ||
035 | |a (NLM)34473408 | ||
040 | |a DE-627 |b ger |c DE-627 |e rakwb | ||
041 | |a eng | ||
100 | 1 | |a Aleksić, Stevan |e verfasserin |4 aut | |
245 | 1 | 0 | |a ADMET Predictability at Boehringer Ingelheim |b State-of-the-Art, and Do Bigger Datasets or Algorithms Make a Difference? |
264 | 1 | |c 2022 | |
336 | |a Text |b txt |2 rdacontent | ||
337 | |a ƒaComputermedien |b c |2 rdamedia | ||
338 | |a ƒa Online-Ressource |b cr |2 rdacarrier | ||
500 | |a Date Completed 02.05.2022 | ||
500 | |a Date Revised 02.05.2022 | ||
500 | |a published: Print-Electronic | ||
500 | |a Citation Status MEDLINE | ||
520 | |a © 2021 Wiley-VCH GmbH. | ||
520 | |a Computational methods assisting drug discovery and development are routine in the pharmaceutical industry. Digital recording of ADMET assays has provided a rich source of data for development of predictive models. Despite the accumulation of data and the public availability of advanced modeling algorithms, the utility of prediction in ADMET research is not clear. Here, we present a critical evaluation of the relationships between data volume, modeling algorithm, chemical representation and grouping, and temporal aspect (time sequence of assays) using an in-house ADMET database. We find no large difference in prediction algorithms nor any systemic and substantial gain from increasingly large datasets. Temporal-based data enlargement led to performance improvement in only in a limited number of assays, and with fractional improvement at best. Assays that are well-, intermediately-, or poorly-suited for ADMET predictions and reasons for such behavior are systematically identified, generating realistic expectations for areas in which computational models can be used to guide decision making in molecular design and development | ||
650 | 4 | |a Journal Article | |
650 | 4 | |a ADMET modelling | |
650 | 4 | |a algorithm comparison | |
650 | 4 | |a chemical representation | |
650 | 4 | |a congeneric series | |
650 | 4 | |a machine learning | |
700 | 1 | |a Seeliger, Daniel |e verfasserin |4 aut | |
700 | 1 | |a Brown, J B |e verfasserin |4 aut | |
773 | 0 | 8 | |i Enthalten in |t Molecular informatics |d 2010 |g 41(2022), 2 vom: 02. Feb., Seite e2100113 |w (DE-627)NLM209791799 |x 1868-1751 |7 nnns |
773 | 1 | 8 | |g volume:41 |g year:2022 |g number:2 |g day:02 |g month:02 |g pages:e2100113 |
856 | 4 | 0 | |u http://dx.doi.org/10.1002/minf.202100113 |3 Volltext |
912 | |a GBV_USEFLAG_A | ||
912 | |a GBV_NLM | ||
951 | |a AR | ||
952 | |d 41 |j 2022 |e 2 |b 02 |c 02 |h e2100113 |