Develop and Validate a Computable Phenotype for the Identification of Alzheimer’s Disease Patients Using Electronic Health Record Data
ABSTRACT INTRODUCTION Alzheimer’s Disease (AD) are often misclassified in electronic health records (EHRs) when relying solely on diagnostic codes. This study aims to develop a more accurate, computable phenotype (CP) for identifying AD patients by using both structured and unstructured EHR data.METHODS We used EHRs from the University of Florida Health (UF Health) system and created rule-based CPs iteratively through manual chart reviews. The CPs were then validated using data from the University of Texas Health Science Center at Houston (UT Health) and the University of Minnesota (UMN).RESULTS Our best-performing CP is “patient has at least 2 AD diagnoses and AD-related keywords” with an F1-score of 0.817 at UF, and 0.961 and 0.623 at UT Health and UMN, respectively.DISCUSSION We developed and validated rule-based CPs for AD identification with good performance, crucial for studies that aim to use real-world data like EHRs..
Medienart: |
Preprint |
---|
Erscheinungsjahr: |
2024 |
---|---|
Erschienen: |
2024 |
Enthalten in: |
bioRxiv.org - (2024) vom: 08. Feb. Zur Gesamtaufnahme - year:2024 |
---|
Sprache: |
Englisch |
---|
Beteiligte Personen: |
He, Xing [VerfasserIn] |
---|
Links: |
Volltext [kostenfrei] |
---|
Themen: |
---|
doi: |
10.1101/2024.02.06.24302389 |
---|
funding: |
|
---|---|
Förderinstitution / Projekttitel: |
|
PPN (Katalog-ID): |
XBI042419565 |
---|
LEADER | 01000caa a22002652 4500 | ||
---|---|---|---|
001 | XBI042419565 | ||
003 | DE-627 | ||
005 | 20240209090405.0 | ||
007 | cr uuu---uuuuu | ||
008 | 240207s2024 xx |||||o 00| ||eng c | ||
024 | 7 | |a 10.1101/2024.02.06.24302389 |2 doi | |
035 | |a (DE-627)XBI042419565 | ||
035 | |a (biorXiv)10.1101/2024.02.06.24302389 | ||
040 | |a DE-627 |b ger |c DE-627 |e rakwb | ||
041 | |a eng | ||
100 | 1 | |a He, Xing |e verfasserin |0 (orcid)0000-0003-0290-8058 |4 aut | |
245 | 1 | 0 | |a Develop and Validate a Computable Phenotype for the Identification of Alzheimer’s Disease Patients Using Electronic Health Record Data |
264 | 1 | |c 2024 | |
336 | |a Text |b txt |2 rdacontent | ||
337 | |a Computermedien |b c |2 rdamedia | ||
338 | |a Online-Ressource |b cr |2 rdacarrier | ||
520 | |a ABSTRACT INTRODUCTION Alzheimer’s Disease (AD) are often misclassified in electronic health records (EHRs) when relying solely on diagnostic codes. This study aims to develop a more accurate, computable phenotype (CP) for identifying AD patients by using both structured and unstructured EHR data.METHODS We used EHRs from the University of Florida Health (UF Health) system and created rule-based CPs iteratively through manual chart reviews. The CPs were then validated using data from the University of Texas Health Science Center at Houston (UT Health) and the University of Minnesota (UMN).RESULTS Our best-performing CP is “patient has at least 2 AD diagnoses and AD-related keywords” with an F1-score of 0.817 at UF, and 0.961 and 0.623 at UT Health and UMN, respectively.DISCUSSION We developed and validated rule-based CPs for AD identification with good performance, crucial for studies that aim to use real-world data like EHRs. | ||
650 | 4 | |a Biology |7 (dpeaa)DE-84 | |
650 | 4 | |a 570 |7 (dpeaa)DE-84 | |
700 | 1 | |a Wei, Ruoqi |4 aut | |
700 | 1 | |a Huang, Yu |4 aut | |
700 | 1 | |a Chen, Zhaoyi |4 aut | |
700 | 1 | |a Lyu, Tianchen |4 aut | |
700 | 1 | |a Bost, Sarah |4 aut | |
700 | 1 | |a Tong, Jiayi |4 aut | |
700 | 1 | |a Li, Lu |4 aut | |
700 | 1 | |a Zhou, Yujia |4 aut | |
700 | 1 | |a Guo, Jingchuan |4 aut | |
700 | 1 | |a Tang, Huilin |4 aut | |
700 | 1 | |a Wang, Fei |4 aut | |
700 | 1 | |a DeKosky, Steven |4 aut | |
700 | 1 | |a Xu, Hua |4 aut | |
700 | 1 | |a Chen, Yong |4 aut | |
700 | 1 | |a Zhang, Rui |0 (orcid)0000-0001-8258-3585 |4 aut | |
700 | 1 | |a Xu, Jie |4 aut | |
700 | 1 | |a Guo, Yi |4 aut | |
700 | 1 | |a Wu, Yonghui |4 aut | |
700 | 1 | |a Bian, Jiang |0 (orcid)0000-0002-2238-5429 |4 aut | |
773 | 0 | 8 | |i Enthalten in |t bioRxiv.org |g (2024) vom: 08. Feb. |
773 | 1 | 8 | |g year:2024 |g day:08 |g month:02 |
856 | 4 | 0 | |u http://dx.doi.org/10.1101/2024.02.06.24302389 |z kostenfrei |3 Volltext |
912 | |a GBV_XBI | ||
951 | |a AR | ||
952 | |j 2024 |b 08 |c 02 |