Machine learning and molecular simulation-based protocols to identify novel potential inhibitors for reverse transcriptase against HIV infections
Acquired immunodeficiency syndrome (AIDS) is a potentially fatal condition affecting the human immune system, which is attributed to the human immunodeficiency virus (HIV). The suppression of reverse transcriptase activity is a promising and feasible strategy for the therapeutic management of AIDS. In this study, we employed machine learning algorithms, such as support vector machines (SVM), k-nearest neighbor (k-NN), random forest (RF), and Gaussian naive base (GNB), which are fast and effective tools commonly used in drug design. For model training, we initially obtained a dataset of 5,159 compounds from BindingDB. The models were assessed using tenfold cross-validation to ensure their accuracy and reliability. Among these compounds, 1,645 compounds were labeled as active, having an IC50 below 0.49 µM, while 3,514 compounds were labeled "inactive against reverse transcriptase. Random forest achieved 86% accuracy on the train and test set among the different machine learning algorithms. Random forest model was then applied to an external ZINC dataset. Subsequently, only three hits-ZINC1359750464, ZINC1435357562, and ZINC1545719422-were selected based on the Lipinski Rule, docking score, and good interaction. The stability of these molecules was further evaluated by deploying molecular dynamics simulation and MM/GBSA, which were found to be -38.6013 ± 0.1103 kcal/mol for the Zidovudine/RT complex, -59.1761 ± 2.2926 kcal/mol for the ZINC1359750464/RT complex, -47.6292 ± 2.4206 kcal/mol for the ZINC1435357562/RT complex, and -50.7334 ± 2.5713 kcal/mol for the ZINC1545719422/RT complex.Communicated by Ramaswamy H. Sarma.
Medienart: |
E-Artikel |
---|
Erscheinungsjahr: |
2024 |
---|---|
Erschienen: |
2024 |
Enthalten in: |
Zur Gesamtaufnahme - year:2024 |
---|---|
Enthalten in: |
Journal of biomolecular structure & dynamics - (2024) vom: 20. Feb., Seite 1-14 |
Sprache: |
Englisch |
---|
Beteiligte Personen: |
Shahab, Muhammad [VerfasserIn] |
---|
Links: |
---|
Themen: |
Docking |
---|
Anmerkungen: |
Date Revised 21.02.2024 published: Print-Electronic Citation Status Publisher |
---|
doi: |
10.1080/07391102.2024.2319112 |
---|
funding: |
|
---|---|
Förderinstitution / Projekttitel: |
|
PPN (Katalog-ID): |
NLM36869318X |
---|
LEADER | 01000naa a22002652 4500 | ||
---|---|---|---|
001 | NLM36869318X | ||
003 | DE-627 | ||
005 | 20240222092354.0 | ||
007 | cr uuu---uuuuu | ||
008 | 240222s2024 xx |||||o 00| ||eng c | ||
024 | 7 | |a 10.1080/07391102.2024.2319112 |2 doi | |
028 | 5 | 2 | |a pubmed24n1301.xml |
035 | |a (DE-627)NLM36869318X | ||
035 | |a (NLM)38379294 | ||
040 | |a DE-627 |b ger |c DE-627 |e rakwb | ||
041 | |a eng | ||
100 | 1 | |a Shahab, Muhammad |e verfasserin |4 aut | |
245 | 1 | 0 | |a Machine learning and molecular simulation-based protocols to identify novel potential inhibitors for reverse transcriptase against HIV infections |
264 | 1 | |c 2024 | |
336 | |a Text |b txt |2 rdacontent | ||
337 | |a ƒaComputermedien |b c |2 rdamedia | ||
338 | |a ƒa Online-Ressource |b cr |2 rdacarrier | ||
500 | |a Date Revised 21.02.2024 | ||
500 | |a published: Print-Electronic | ||
500 | |a Citation Status Publisher | ||
520 | |a Acquired immunodeficiency syndrome (AIDS) is a potentially fatal condition affecting the human immune system, which is attributed to the human immunodeficiency virus (HIV). The suppression of reverse transcriptase activity is a promising and feasible strategy for the therapeutic management of AIDS. In this study, we employed machine learning algorithms, such as support vector machines (SVM), k-nearest neighbor (k-NN), random forest (RF), and Gaussian naive base (GNB), which are fast and effective tools commonly used in drug design. For model training, we initially obtained a dataset of 5,159 compounds from BindingDB. The models were assessed using tenfold cross-validation to ensure their accuracy and reliability. Among these compounds, 1,645 compounds were labeled as active, having an IC50 below 0.49 µM, while 3,514 compounds were labeled "inactive against reverse transcriptase. Random forest achieved 86% accuracy on the train and test set among the different machine learning algorithms. Random forest model was then applied to an external ZINC dataset. Subsequently, only three hits-ZINC1359750464, ZINC1435357562, and ZINC1545719422-were selected based on the Lipinski Rule, docking score, and good interaction. The stability of these molecules was further evaluated by deploying molecular dynamics simulation and MM/GBSA, which were found to be -38.6013 ± 0.1103 kcal/mol for the Zidovudine/RT complex, -59.1761 ± 2.2926 kcal/mol for the ZINC1359750464/RT complex, -47.6292 ± 2.4206 kcal/mol for the ZINC1435357562/RT complex, and -50.7334 ± 2.5713 kcal/mol for the ZINC1545719422/RT complex.Communicated by Ramaswamy H. Sarma | ||
650 | 4 | |a Journal Article | |
650 | 4 | |a MD simulation | |
650 | 4 | |a Machine learning | |
650 | 4 | |a docking | |
650 | 4 | |a reverse transcriptase | |
650 | 4 | |a virtual screening | |
700 | 1 | |a Zheng, Guojun |e verfasserin |4 aut | |
700 | 1 | |a Bin Jardan, Yousef A |e verfasserin |4 aut | |
700 | 1 | |a Bourhia, Mohammed |e verfasserin |4 aut | |
773 | 0 | 8 | |i Enthalten in |t Journal of biomolecular structure & dynamics |d 1985 |g (2024) vom: 20. Feb., Seite 1-14 |w (DE-627)NLM012639974 |x 1538-0254 |7 nnns |
773 | 1 | 8 | |g year:2024 |g day:20 |g month:02 |g pages:1-14 |
856 | 4 | 0 | |u http://dx.doi.org/10.1080/07391102.2024.2319112 |3 Volltext |
912 | |a GBV_USEFLAG_A | ||
912 | |a GBV_NLM | ||
951 | |a AR | ||
952 | |j 2024 |b 20 |c 02 |h 1-14 |