Multiple-model machine learning identifies potential functional genes in dilated cardiomyopathy
Copyright © 2023 Zhang, Lin, Wang, Han, Zhang, Gao, Li, Zhang, Zhou, Yu and Fu..
Introduction: Machine learning (ML) has gained intensive popularity in various fields, such as disease diagnosis in healthcare. However, it has limitation for single algorithm to explore the diagnosing value of dilated cardiomyopathy (DCM). We aim to develop a novel overall normalized sum weight of multiple-model MLs to assess the diagnosing value in DCM.
Methods: Gene expression data were selected from previously published databases (six sets of eligible microarrays, 386 samples) with eligible criteria. Two sets of microarrays were used as training; the others were studied in the testing sets (ratio 5:1). Totally, we identified 20 differently expressed genes (DEGs) between DCM and control individuals (7 upregulated and 13 down-regulated).
Results: We developed six classification ML methods to identify potential candidate genes based on their overall weights. Three genes, serine proteinase inhibitor A3 (SERPINA3), frizzled-related proteins (FRPs) 3 (FRZB), and ficolin 3 (FCN3) were finally identified as the receiver operating characteristic (ROC). Interestingly, we found all three genes correlated considerably with plasma cells. Importantly, not only in training sets but also testing sets, the areas under the curve (AUCs) for SERPINA3, FRZB, and FCN3 were greater than 0.88. The ROC of SERPINA3 was significantly high (0.940 in training and 0.918 in testing sets), indicating it is a potentially functional gene in DCM. Especially, the plasma levels in DCM patients of SERPINA3, FCN, and FRZB were significant compared with healthy control.
Discussion: SERPINA3, FRZB, and FCN3 might be potential diagnosis targets for DCM, Further verification work could be implemented.
Medienart: |
E-Artikel |
---|
Erscheinungsjahr: |
2022 |
---|---|
Erschienen: |
2022 |
Enthalten in: |
Zur Gesamtaufnahme - volume:9 |
---|---|
Enthalten in: |
Frontiers in cardiovascular medicine - 9(2022) vom: 16., Seite 1044443 |
Sprache: |
Englisch |
---|
Beteiligte Personen: |
Zhang, Lin [VerfasserIn] |
---|
Links: |
---|
Themen: |
Diagnosis value |
---|
Anmerkungen: |
Date Revised 02.02.2023 published: Electronic-eCollection Citation Status PubMed-not-MEDLINE |
---|
doi: |
10.3389/fcvm.2022.1044443 |
---|
funding: |
|
---|---|
Förderinstitution / Projekttitel: |
|
PPN (Katalog-ID): |
NLM352226471 |
---|
LEADER | 01000naa a22002652 4500 | ||
---|---|---|---|
001 | NLM352226471 | ||
003 | DE-627 | ||
005 | 20231226053232.0 | ||
007 | cr uuu---uuuuu | ||
008 | 231226s2022 xx |||||o 00| ||eng c | ||
024 | 7 | |a 10.3389/fcvm.2022.1044443 |2 doi | |
028 | 5 | 2 | |a pubmed24n1174.xml |
035 | |a (DE-627)NLM352226471 | ||
035 | |a (NLM)36712235 | ||
040 | |a DE-627 |b ger |c DE-627 |e rakwb | ||
041 | |a eng | ||
100 | 1 | |a Zhang, Lin |e verfasserin |4 aut | |
245 | 1 | 0 | |a Multiple-model machine learning identifies potential functional genes in dilated cardiomyopathy |
264 | 1 | |c 2022 | |
336 | |a Text |b txt |2 rdacontent | ||
337 | |a ƒaComputermedien |b c |2 rdamedia | ||
338 | |a ƒa Online-Ressource |b cr |2 rdacarrier | ||
500 | |a Date Revised 02.02.2023 | ||
500 | |a published: Electronic-eCollection | ||
500 | |a Citation Status PubMed-not-MEDLINE | ||
520 | |a Copyright © 2023 Zhang, Lin, Wang, Han, Zhang, Gao, Li, Zhang, Zhou, Yu and Fu. | ||
520 | |a Introduction: Machine learning (ML) has gained intensive popularity in various fields, such as disease diagnosis in healthcare. However, it has limitation for single algorithm to explore the diagnosing value of dilated cardiomyopathy (DCM). We aim to develop a novel overall normalized sum weight of multiple-model MLs to assess the diagnosing value in DCM | ||
520 | |a Methods: Gene expression data were selected from previously published databases (six sets of eligible microarrays, 386 samples) with eligible criteria. Two sets of microarrays were used as training; the others were studied in the testing sets (ratio 5:1). Totally, we identified 20 differently expressed genes (DEGs) between DCM and control individuals (7 upregulated and 13 down-regulated) | ||
520 | |a Results: We developed six classification ML methods to identify potential candidate genes based on their overall weights. Three genes, serine proteinase inhibitor A3 (SERPINA3), frizzled-related proteins (FRPs) 3 (FRZB), and ficolin 3 (FCN3) were finally identified as the receiver operating characteristic (ROC). Interestingly, we found all three genes correlated considerably with plasma cells. Importantly, not only in training sets but also testing sets, the areas under the curve (AUCs) for SERPINA3, FRZB, and FCN3 were greater than 0.88. The ROC of SERPINA3 was significantly high (0.940 in training and 0.918 in testing sets), indicating it is a potentially functional gene in DCM. Especially, the plasma levels in DCM patients of SERPINA3, FCN, and FRZB were significant compared with healthy control | ||
520 | |a Discussion: SERPINA3, FRZB, and FCN3 might be potential diagnosis targets for DCM, Further verification work could be implemented | ||
650 | 4 | |a Journal Article | |
650 | 4 | |a FCN3 | |
650 | 4 | |a FRZB | |
650 | 4 | |a SERPINA3 | |
650 | 4 | |a diagnosis value | |
650 | 4 | |a dilated cardiomyopathy | |
650 | 4 | |a machine learning | |
700 | 1 | |a Lin, Yexiang |e verfasserin |4 aut | |
700 | 1 | |a Wang, Kaiyue |e verfasserin |4 aut | |
700 | 1 | |a Han, Lifeng |e verfasserin |4 aut | |
700 | 1 | |a Zhang, Xue |e verfasserin |4 aut | |
700 | 1 | |a Gao, Xiumei |e verfasserin |4 aut | |
700 | 1 | |a Li, Zheng |e verfasserin |4 aut | |
700 | 1 | |a Zhang, Houliang |e verfasserin |4 aut | |
700 | 1 | |a Zhou, Jiashun |e verfasserin |4 aut | |
700 | 1 | |a Yu, Heshui |e verfasserin |4 aut | |
700 | 1 | |a Fu, Xuebin |e verfasserin |4 aut | |
773 | 0 | 8 | |i Enthalten in |t Frontiers in cardiovascular medicine |d 2014 |g 9(2022) vom: 16., Seite 1044443 |w (DE-627)NLM255534736 |x 2297-055X |7 nnns |
773 | 1 | 8 | |g volume:9 |g year:2022 |g day:16 |g pages:1044443 |
856 | 4 | 0 | |u http://dx.doi.org/10.3389/fcvm.2022.1044443 |3 Volltext |
912 | |a GBV_USEFLAG_A | ||
912 | |a GBV_NLM | ||
951 | |a AR | ||
952 | |d 9 |j 2022 |b 16 |h 1044443 |