ADMET Predictability at Boehringer Ingelheim : State‐of‐the‐Art, and Do Bigger Datasets or Algorithms Make a Difference?
Abstract Computational methods assisting drug discovery and development are routine in the pharmaceutical industry. Digital recording of ADMET assays has provided a rich source of data for development of predictive models. Despite the accumulation of data and the public availability of advanced modeling algorithms, the utility of prediction in ADMET research is not clear. Here, we present a critical evaluation of the relationships between data volume, modeling algorithm, chemical representation and grouping, and temporal aspect (time sequence of assays) using an in‐house ADMET database. We find no large difference in prediction algorithms nor any systemic and substantial gain from increasingly large datasets. Temporal‐based data enlargement led to performance improvement in only in a limited number of assays, and with fractional improvement at best. Assays that are well‐, intermediately‐, or poorly‐suited for ADMET predictions and reasons for such behavior are systematically identified, generating realistic expectations for areas in which computational models can be used to guide decision making in molecular design and development..
Medienart: |
E-Artikel |
---|
Erscheinungsjahr: |
2022 |
---|---|
Erschienen: |
2022 |
Enthalten in: |
Zur Gesamtaufnahme - volume:41 |
---|---|
Enthalten in: |
Molecular Informatics - 41(2022), 2 |
Beteiligte Personen: |
Aleksić, Stevan [VerfasserIn] |
---|
BKL: |
---|
Anmerkungen: |
© 2022 Wiley‐VCH GmbH |
---|
Umfang: |
16 |
---|
doi: |
10.1002/minf.202100113 |
---|
funding: |
|
---|---|
Förderinstitution / Projekttitel: |
|
PPN (Katalog-ID): |
WLY010097325 |
---|
LEADER | 01000caa a22002652 4500 | ||
---|---|---|---|
001 | WLY010097325 | ||
003 | DE-627 | ||
005 | 20230307193451.0 | ||
007 | cr uuu---uuuuu | ||
008 | 230218s2022 xx |||||o 00| ||und c | ||
024 | 7 | |a 10.1002/minf.202100113 |2 doi | |
028 | 5 | 2 | |a MINF_MINF202100113.xml |
035 | |a (DE-627)WLY010097325 | ||
035 | |a (WILEY)MINF202100113 | ||
040 | |a DE-627 |b ger |c DE-627 |e rda | ||
082 | 0 | 4 | |a 530 |q ASE |
082 | 0 | 4 | |a 530 |q ASE |
084 | |a 30.00 |2 bkl | ||
084 | |a 33.00 |2 bkl | ||
084 | |a 44.40 |2 bkl | ||
100 | 1 | |a Aleksić, Stevan |e verfasserin |4 aut | |
245 | 1 | 0 | |a ADMET Predictability at Boehringer Ingelheim |b State‐of‐the‐Art, and Do Bigger Datasets or Algorithms Make a Difference? |
264 | 1 | |c 2022 | |
300 | |a 16 | ||
336 | |a Text |b txt |2 rdacontent | ||
337 | |a Computermedien |b c |2 rdamedia | ||
338 | |a Online-Ressource |b cr |2 rdacarrier | ||
500 | |a © 2022 Wiley‐VCH GmbH | ||
520 | |a Abstract Computational methods assisting drug discovery and development are routine in the pharmaceutical industry. Digital recording of ADMET assays has provided a rich source of data for development of predictive models. Despite the accumulation of data and the public availability of advanced modeling algorithms, the utility of prediction in ADMET research is not clear. Here, we present a critical evaluation of the relationships between data volume, modeling algorithm, chemical representation and grouping, and temporal aspect (time sequence of assays) using an in‐house ADMET database. We find no large difference in prediction algorithms nor any systemic and substantial gain from increasingly large datasets. Temporal‐based data enlargement led to performance improvement in only in a limited number of assays, and with fractional improvement at best. Assays that are well‐, intermediately‐, or poorly‐suited for ADMET predictions and reasons for such behavior are systematically identified, generating realistic expectations for areas in which computational models can be used to guide decision making in molecular design and development. | ||
700 | 1 | |a Seeliger, Daniel |4 aut | |
700 | 1 | |a Brown, J. B. |4 aut | |
773 | 0 | 8 | |i Enthalten in |t Molecular Informatics |g 41(2022), 2 |w (DE-627)WLY010095268 |x 18681751 |7 nnns |
773 | 1 | 8 | |g volume:41 |g year:2022 |g number:2 |g extent:16 |
912 | |a GBV_USEFLAG_A | ||
912 | |a GBV_WLY | ||
912 | |a SSG-OPC-PHA | ||
912 | |a SSG-OPC-ASE | ||
936 | b | k | |a 30.00 |q ASE |
936 | b | k | |a 33.00 |q ASE |
936 | b | k | |a 44.40 |q ASE |
951 | |a AR | ||
952 | |d 41 |j 2022 |e 2 |g 16 |