Novel multiclass classification machine learning approach for the early-stage classification of systemic autoimmune rheumatic diseases
© Author(s) (or their employer(s)) 2024. Re-use permitted under CC BY-NC. No commercial re-use. See rights and permissions. Published by BMJ..
OBJECTIVE: Systemic autoimmune rheumatic diseases (SARDs) encompass a diverse group of complex conditions with overlapping clinical features, making accurate diagnosis challenging. This study aims to develop a multiclass machine learning (ML) model for early-stage SARDs classification using accessible laboratory indicators.
METHODS: A total of 925 SARDs patients were included, categorised into SLE, Sjögren's syndrome (SS) and inflammatory myositis (IM). Clinical characteristics and laboratory markers were collected and nine key indicators, including anti-dsDNA, anti-SS-A60, anti-Sm/nRNP, antichromatin, anti-dsDNA (indirect immunofluorescence assay), haemoglobin (Hb), platelet, neutrophil percentage and cytoplasmic patterns (AC-19, AC-20), were selected for model building. Various ML algorithms were used to construct a tripartite classification ML model.
RESULTS: Patients were divided into two cohorts, cohort 1 was used to construct a tripartite classification model. Among models assessed, the random forest (RF) model demonstrated superior performance in distinguishing SLE, IM and SS (with area under curve=0.953, 0.903 and 0.836; accuracy= 0.892, 0.869 and 0.857; sensitivity= 0.890, 0.868 and 0.795; specificity= 0.910, 0.836 and 0.748; positive predictive value=0.922, 0.727 and 0.663; and negative predictive value= 0.854, 0.915 and 0.879). The RF model excelled in classifying SLE (precision=0.930, recall=0.985, F1 score=0.957). For IM and SS, RF model outcomes were (precision=0.793, 0.950; recall=0.920, 0.679; F1 score=0.852, 0.792). Cohort 2 served as an external validation set, achieving an overall accuracy of 87.3%. Individual classification performances for SLE, SS and IM were excellent, with precision, recall and F1 scores specified. SHAP analysis highlighted significant contributions from antibody profiles.
CONCLUSION: This pioneering multiclass ML model, using basic laboratory indicators, enhances clinical feasibility and demonstrates promising potential for SARDs classification. The collaboration of clinical expertise and ML offers a nuanced approach to SARDs classification, with potential for enhanced patient care.
Medienart: |
E-Artikel |
---|
Erscheinungsjahr: |
2024 |
---|---|
Erschienen: |
2024 |
Enthalten in: |
Zur Gesamtaufnahme - volume:11 |
---|---|
Enthalten in: |
Lupus science & medicine - 11(2024), 1 vom: 31. Jan. |
Sprache: |
Englisch |
---|
Beteiligte Personen: |
Wang, Yun [VerfasserIn] |
---|
Links: |
---|
Themen: |
Antibodies, Antinuclear |
---|
Anmerkungen: |
Date Completed 05.02.2024 Date Revised 05.02.2024 published: Electronic Citation Status MEDLINE |
---|
doi: |
10.1136/lupus-2023-001125 |
---|
funding: |
|
---|---|
Förderinstitution / Projekttitel: |
|
PPN (Katalog-ID): |
NLM36791493X |
---|
LEADER | 01000caa a22002652 4500 | ||
---|---|---|---|
001 | NLM36791493X | ||
003 | DE-627 | ||
005 | 20240205232111.0 | ||
007 | cr uuu---uuuuu | ||
008 | 240202s2024 xx |||||o 00| ||eng c | ||
024 | 7 | |a 10.1136/lupus-2023-001125 |2 doi | |
028 | 5 | 2 | |a pubmed24n1281.xml |
035 | |a (DE-627)NLM36791493X | ||
035 | |a (NLM)38302133 | ||
035 | |a (PII)e001125 | ||
040 | |a DE-627 |b ger |c DE-627 |e rakwb | ||
041 | |a eng | ||
100 | 1 | |a Wang, Yun |e verfasserin |4 aut | |
245 | 1 | 0 | |a Novel multiclass classification machine learning approach for the early-stage classification of systemic autoimmune rheumatic diseases |
264 | 1 | |c 2024 | |
336 | |a Text |b txt |2 rdacontent | ||
337 | |a ƒaComputermedien |b c |2 rdamedia | ||
338 | |a ƒa Online-Ressource |b cr |2 rdacarrier | ||
500 | |a Date Completed 05.02.2024 | ||
500 | |a Date Revised 05.02.2024 | ||
500 | |a published: Electronic | ||
500 | |a Citation Status MEDLINE | ||
520 | |a © Author(s) (or their employer(s)) 2024. Re-use permitted under CC BY-NC. No commercial re-use. See rights and permissions. Published by BMJ. | ||
520 | |a OBJECTIVE: Systemic autoimmune rheumatic diseases (SARDs) encompass a diverse group of complex conditions with overlapping clinical features, making accurate diagnosis challenging. This study aims to develop a multiclass machine learning (ML) model for early-stage SARDs classification using accessible laboratory indicators | ||
520 | |a METHODS: A total of 925 SARDs patients were included, categorised into SLE, Sjögren's syndrome (SS) and inflammatory myositis (IM). Clinical characteristics and laboratory markers were collected and nine key indicators, including anti-dsDNA, anti-SS-A60, anti-Sm/nRNP, antichromatin, anti-dsDNA (indirect immunofluorescence assay), haemoglobin (Hb), platelet, neutrophil percentage and cytoplasmic patterns (AC-19, AC-20), were selected for model building. Various ML algorithms were used to construct a tripartite classification ML model | ||
520 | |a RESULTS: Patients were divided into two cohorts, cohort 1 was used to construct a tripartite classification model. Among models assessed, the random forest (RF) model demonstrated superior performance in distinguishing SLE, IM and SS (with area under curve=0.953, 0.903 and 0.836; accuracy= 0.892, 0.869 and 0.857; sensitivity= 0.890, 0.868 and 0.795; specificity= 0.910, 0.836 and 0.748; positive predictive value=0.922, 0.727 and 0.663; and negative predictive value= 0.854, 0.915 and 0.879). The RF model excelled in classifying SLE (precision=0.930, recall=0.985, F1 score=0.957). For IM and SS, RF model outcomes were (precision=0.793, 0.950; recall=0.920, 0.679; F1 score=0.852, 0.792). Cohort 2 served as an external validation set, achieving an overall accuracy of 87.3%. Individual classification performances for SLE, SS and IM were excellent, with precision, recall and F1 scores specified. SHAP analysis highlighted significant contributions from antibody profiles | ||
520 | |a CONCLUSION: This pioneering multiclass ML model, using basic laboratory indicators, enhances clinical feasibility and demonstrates promising potential for SARDs classification. The collaboration of clinical expertise and ML offers a nuanced approach to SARDs classification, with potential for enhanced patient care | ||
650 | 4 | |a Journal Article | |
650 | 4 | |a Autoimmune Diseases | |
650 | 4 | |a Autoimmunity | |
650 | 4 | |a Lupus Erythematosus, Systemic | |
650 | 7 | |a Antibodies, Antinuclear |2 NLM | |
700 | 1 | |a Wei, Wei |e verfasserin |4 aut | |
700 | 1 | |a Ouyang, Renren |e verfasserin |4 aut | |
700 | 1 | |a Chen, Rujia |e verfasserin |4 aut | |
700 | 1 | |a Wang, Ting |e verfasserin |4 aut | |
700 | 1 | |a Yuan, Xu |e verfasserin |4 aut | |
700 | 1 | |a Wang, Feng |e verfasserin |4 aut | |
700 | 1 | |a Hou, Hongyan |e verfasserin |4 aut | |
700 | 1 | |a Wu, Shiji |e verfasserin |4 aut | |
773 | 0 | 8 | |i Enthalten in |t Lupus science & medicine |d 2014 |g 11(2024), 1 vom: 31. Jan. |w (DE-627)NLM243194145 |x 2053-8790 |7 nnns |
773 | 1 | 8 | |g volume:11 |g year:2024 |g number:1 |g day:31 |g month:01 |
856 | 4 | 0 | |u http://dx.doi.org/10.1136/lupus-2023-001125 |3 Volltext |
912 | |a GBV_USEFLAG_A | ||
912 | |a GBV_NLM | ||
951 | |a AR | ||
952 | |d 11 |j 2024 |e 1 |b 31 |c 01 |