Development and validation of algorithms to build an electronic health record based cohort of patients with systemic sclerosis
Copyright: © 2023 Tukpah et al. This is an open access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited..
OBJECTIVES: To evaluate methods of identifying patients with systemic sclerosis (SSc) using International Classification of Diseases, Tenth Revision (ICD-10) codes (M34*), electronic health record (EHR) databases and organ involvement keywords, that result in a validated cohort comprised of true cases with high disease burden.
METHODS: We retrospectively studied patients in a healthcare system likely to have SSc. Using structured EHR data from January 2016 to June 2021, we identified 955 adult patients with M34* documented 2 or more times during the study period. A random subset of 100 patients was selected to validate the ICD-10 code for its positive predictive value (PPV). The dataset was then divided into a training and validation sets for unstructured text processing (UTP) search algorithms, two of which were created using keywords for Raynaud's syndrome, and esophageal involvement/symptoms.
RESULTS: Among 955 patients, the average age was 60. Most patients (84%) were female; 75% of patients were White, and 5.2% were Black. There were approximately 175 patients per year with the code newly documented, overall 24% had an ICD-10 code for esophageal disease, and 13.4% for pulmonary hypertension. The baseline PPV was 78%, which improved to 84% with UTP, identifying 788 patients likely to have SSc. After the ICD-10 code was placed, 63% of patients had a rheumatology office visit. Patients identified by the UTP search algorithm were more likely to have increased healthcare utilization (ICD-10 codes 4 or more times 84.1% vs 61.7%, p < .001), organ involvement (pulmonary hypertension 12.7% vs 6% p = .011) and medication use (mycophenolate use 28.7% vs 11.4%, p < .001) than those identified by the ICD codes alone.
CONCLUSION: EHRs can be used to identify patients with SSc. Using unstructured text processing keyword searches for SSc clinical manifestations improved the PPV of ICD-10 codes alone and identified a group of patients most likely to have SSc and increased healthcare needs.
Medienart: |
E-Artikel |
---|
Erscheinungsjahr: |
2023 |
---|---|
Erschienen: |
2023 |
Enthalten in: |
Zur Gesamtaufnahme - volume:18 |
---|---|
Enthalten in: |
PloS one - 18(2023), 4 vom: 14., Seite e0283775 |
Sprache: |
Englisch |
---|
Beteiligte Personen: |
Tukpah, Ann-Marcia C [VerfasserIn] |
---|
Links: |
---|
Themen: |
Journal Article |
---|
Anmerkungen: |
Date Completed 17.04.2023 Date Revised 18.04.2023 published: Electronic-eCollection Citation Status MEDLINE |
---|
doi: |
10.1371/journal.pone.0283775 |
---|
funding: |
|
---|---|
Förderinstitution / Projekttitel: |
|
PPN (Katalog-ID): |
NLM355576430 |
---|
LEADER | 01000naa a22002652 4500 | ||
---|---|---|---|
001 | NLM355576430 | ||
003 | DE-627 | ||
005 | 20231226064635.0 | ||
007 | cr uuu---uuuuu | ||
008 | 231226s2023 xx |||||o 00| ||eng c | ||
024 | 7 | |a 10.1371/journal.pone.0283775 |2 doi | |
028 | 5 | 2 | |a pubmed24n1185.xml |
035 | |a (DE-627)NLM355576430 | ||
035 | |a (NLM)37053291 | ||
040 | |a DE-627 |b ger |c DE-627 |e rakwb | ||
041 | |a eng | ||
100 | 1 | |a Tukpah, Ann-Marcia C |e verfasserin |4 aut | |
245 | 1 | 0 | |a Development and validation of algorithms to build an electronic health record based cohort of patients with systemic sclerosis |
264 | 1 | |c 2023 | |
336 | |a Text |b txt |2 rdacontent | ||
337 | |a ƒaComputermedien |b c |2 rdamedia | ||
338 | |a ƒa Online-Ressource |b cr |2 rdacarrier | ||
500 | |a Date Completed 17.04.2023 | ||
500 | |a Date Revised 18.04.2023 | ||
500 | |a published: Electronic-eCollection | ||
500 | |a Citation Status MEDLINE | ||
520 | |a Copyright: © 2023 Tukpah et al. This is an open access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited. | ||
520 | |a OBJECTIVES: To evaluate methods of identifying patients with systemic sclerosis (SSc) using International Classification of Diseases, Tenth Revision (ICD-10) codes (M34*), electronic health record (EHR) databases and organ involvement keywords, that result in a validated cohort comprised of true cases with high disease burden | ||
520 | |a METHODS: We retrospectively studied patients in a healthcare system likely to have SSc. Using structured EHR data from January 2016 to June 2021, we identified 955 adult patients with M34* documented 2 or more times during the study period. A random subset of 100 patients was selected to validate the ICD-10 code for its positive predictive value (PPV). The dataset was then divided into a training and validation sets for unstructured text processing (UTP) search algorithms, two of which were created using keywords for Raynaud's syndrome, and esophageal involvement/symptoms | ||
520 | |a RESULTS: Among 955 patients, the average age was 60. Most patients (84%) were female; 75% of patients were White, and 5.2% were Black. There were approximately 175 patients per year with the code newly documented, overall 24% had an ICD-10 code for esophageal disease, and 13.4% for pulmonary hypertension. The baseline PPV was 78%, which improved to 84% with UTP, identifying 788 patients likely to have SSc. After the ICD-10 code was placed, 63% of patients had a rheumatology office visit. Patients identified by the UTP search algorithm were more likely to have increased healthcare utilization (ICD-10 codes 4 or more times 84.1% vs 61.7%, p < .001), organ involvement (pulmonary hypertension 12.7% vs 6% p = .011) and medication use (mycophenolate use 28.7% vs 11.4%, p < .001) than those identified by the ICD codes alone | ||
520 | |a CONCLUSION: EHRs can be used to identify patients with SSc. Using unstructured text processing keyword searches for SSc clinical manifestations improved the PPV of ICD-10 codes alone and identified a group of patients most likely to have SSc and increased healthcare needs | ||
650 | 4 | |a Journal Article | |
650 | 4 | |a Research Support, N.I.H., Extramural | |
650 | 7 | |a Uridine Triphosphate |2 NLM | |
650 | 7 | |a UT0S826Z60 |2 NLM | |
700 | 1 | |a Rose, Jonathan A |e verfasserin |4 aut | |
700 | 1 | |a Seger, Diane L |e verfasserin |4 aut | |
700 | 1 | |a Dellaripa, Paul F |e verfasserin |4 aut | |
700 | 1 | |a Hunninghake, Gary M |e verfasserin |4 aut | |
700 | 1 | |a Bates, David W |e verfasserin |4 aut | |
773 | 0 | 8 | |i Enthalten in |t PloS one |d 2006 |g 18(2023), 4 vom: 14., Seite e0283775 |w (DE-627)NLM167327399 |x 1932-6203 |7 nnns |
773 | 1 | 8 | |g volume:18 |g year:2023 |g number:4 |g day:14 |g pages:e0283775 |
856 | 4 | 0 | |u http://dx.doi.org/10.1371/journal.pone.0283775 |3 Volltext |
912 | |a GBV_USEFLAG_A | ||
912 | |a GBV_NLM | ||
951 | |a AR | ||
952 | |d 18 |j 2023 |e 4 |b 14 |h e0283775 |