Outline and background for the EU-OS solubility prediction challenge
Copyright © 2024. Published by Elsevier Inc..
In June 2022, EU-OS came to the decision to make public a solubility data set of 100+K compounds obtained from several of the EU-OS proprietary screening compound collections. Leveraging on the interest of SLAS for screening scientific development it was decided to launch a joint EUOS-SLAS competition within the chemoinformatics and machine learning (ML) communities. The competition was open to real world computation experts, for the best, most predictive, classification model of compound solubility. The aim of the competition was multiple: from a practical side, the winning model should then serve as a cornerstone for future solubility predictions having used the largest training set so far publicly available. From a higher project perspective, the intent was to focus the energies and experiences, even if professionally not precisely coming from Pharma R&D; to address the issue of how to predict compound solubility. Here we report how the competition was ideated and the practical aspects of conducting it within the Kaggle framework, leveraging of the versatility and the open-source nature of this data science platform. Consideration on results and challenges encountered have been also examined.
Medienart: |
E-Artikel |
---|
Erscheinungsjahr: |
2024 |
---|---|
Erschienen: |
2024 |
Enthalten in: |
Zur Gesamtaufnahme - volume:29 |
---|---|
Enthalten in: |
SLAS discovery : advancing life sciences R & D - 29(2024), 4 vom: 20. März, Seite 100155 |
Sprache: |
Englisch |
---|
Beteiligte Personen: |
Wang, Wenyu [VerfasserIn] |
---|
Links: |
---|
Themen: |
---|
Anmerkungen: |
Date Revised 16.04.2024 published: Print-Electronic Citation Status Publisher |
---|
doi: |
10.1016/j.slasd.2024.100155 |
---|
funding: |
|
---|---|
Förderinstitution / Projekttitel: |
|
PPN (Katalog-ID): |
NLM370085280 |
---|
LEADER | 01000caa a22002652 4500 | ||
---|---|---|---|
001 | NLM370085280 | ||
003 | DE-627 | ||
005 | 20240417232725.0 | ||
007 | cr uuu---uuuuu | ||
008 | 240324s2024 xx |||||o 00| ||eng c | ||
024 | 7 | |a 10.1016/j.slasd.2024.100155 |2 doi | |
028 | 5 | 2 | |a pubmed24n1378.xml |
035 | |a (DE-627)NLM370085280 | ||
035 | |a (NLM)38518955 | ||
035 | |a (PII)S2472-5552(24)00017-0 | ||
040 | |a DE-627 |b ger |c DE-627 |e rakwb | ||
041 | |a eng | ||
100 | 1 | |a Wang, Wenyu |e verfasserin |4 aut | |
245 | 1 | 0 | |a Outline and background for the EU-OS solubility prediction challenge |
264 | 1 | |c 2024 | |
336 | |a Text |b txt |2 rdacontent | ||
337 | |a ƒaComputermedien |b c |2 rdamedia | ||
338 | |a ƒa Online-Ressource |b cr |2 rdacarrier | ||
500 | |a Date Revised 16.04.2024 | ||
500 | |a published: Print-Electronic | ||
500 | |a Citation Status Publisher | ||
520 | |a Copyright © 2024. Published by Elsevier Inc. | ||
520 | |a In June 2022, EU-OS came to the decision to make public a solubility data set of 100+K compounds obtained from several of the EU-OS proprietary screening compound collections. Leveraging on the interest of SLAS for screening scientific development it was decided to launch a joint EUOS-SLAS competition within the chemoinformatics and machine learning (ML) communities. The competition was open to real world computation experts, for the best, most predictive, classification model of compound solubility. The aim of the competition was multiple: from a practical side, the winning model should then serve as a cornerstone for future solubility predictions having used the largest training set so far publicly available. From a higher project perspective, the intent was to focus the energies and experiences, even if professionally not precisely coming from Pharma R&D; to address the issue of how to predict compound solubility. Here we report how the competition was ideated and the practical aspects of conducting it within the Kaggle framework, leveraging of the versatility and the open-source nature of this data science platform. Consideration on results and challenges encountered have been also examined | ||
650 | 4 | |a Journal Article | |
650 | 4 | |a Review | |
700 | 1 | |a Tang, Jing |e verfasserin |4 aut | |
700 | 1 | |a Zaliani, Andrea |e verfasserin |4 aut | |
773 | 0 | 8 | |i Enthalten in |t SLAS discovery : advancing life sciences R & D |d 2017 |g 29(2024), 4 vom: 20. März, Seite 100155 |w (DE-627)NLM258579609 |x 2472-5560 |7 nnns |
773 | 1 | 8 | |g volume:29 |g year:2024 |g number:4 |g day:20 |g month:03 |g pages:100155 |
856 | 4 | 0 | |u http://dx.doi.org/10.1016/j.slasd.2024.100155 |3 Volltext |
912 | |a GBV_USEFLAG_A | ||
912 | |a GBV_NLM | ||
951 | |a AR | ||
952 | |d 29 |j 2024 |e 4 |b 20 |c 03 |h 100155 |