Details der Publikation - Validation of deep learning natural language processing algorithm for keyword extraction from pathology reports in electronic health records

Validation of deep learning natural language processing algorithm for keyword extraction from pathology reports in electronic health records

Pathology reports contain the essential data for both clinical and research purposes. However, the extraction of meaningful, qualitative data from the original document is difficult due to the narrative and complex nature of such reports. Keyword extraction for pathology reports is necessary to summarize the informative text and reduce intensive time consumption. In this study, we employed a deep learning model for the natural language process to extract keywords from pathology reports and presented the supervised keyword extraction algorithm. We considered three types of pathological keywords, namely specimen, procedure, and pathology types. We compared the performance of the present algorithm with the conventional keyword extraction methods on the 3115 pathology reports that were manually labeled by professional pathologists. Additionally, we applied the present algorithm to 36,014 unlabeled pathology reports and analysed the extracted keywords with biomedical vocabulary sets. The results demonstrated the suitability of our model for practical application in extracting important data from pathology reports.

Medienart:	E-Artikel

Erscheinungsjahr:	2020
Erschienen:	2020

Enthalten in:	Zur Gesamtaufnahme - volume:10
Enthalten in:	Scientific reports - 10(2020), 1 vom: 20. Nov., Seite 20265

Sprache:	Englisch

Beteiligte Personen:	Kim, Yoojoong [VerfasserIn] Lee, Jeong Hyeon [VerfasserIn] Choi, Sunho [VerfasserIn] Lee, Jeong Moon [VerfasserIn] Kim, Jong-Ho [VerfasserIn] Seok, Junhee [VerfasserIn] Joo, Hyung Joon [VerfasserIn]

Links:	Volltext

Themen:	Journal Article Research Support, Non-U.S. Gov't Validation Study

Anmerkungen:	Date Completed 14.04.2021 Date Revised 14.04.2021 published: Electronic Citation Status MEDLINE

doi:	10.1038/s41598-020-77258-w

funding:
Förderinstitution / Projekttitel:

PPN (Katalog-ID):	NLM317863118

Internformat


LEADER	01000caa a22002652 4500
001	NLM317863118
003	DE-627
005	20231226202231.0
007	cr uuu---uuuuu
008	231225s2020 xx \|\|\|\|\|o 00\| \|\|eng c
024	7		\|a 10.1038/s41598-020-77258-w \|2 doi
028	5	2	\|a pubmed24n1059.xml
035			\|a (DE-627)NLM317863118
035			\|a (NLM)33219276
040			\|a DE-627 \|b ger \|c DE-627 \|e rakwb
041			\|a eng
100	1		\|a Kim, Yoojoong \|e verfasserin \|4 aut
245	1	0	\|a Validation of deep learning natural language processing algorithm for keyword extraction from pathology reports in electronic health records
264		1	\|c 2020
336			\|a Text \|b txt \|2 rdacontent
337			\|a ƒaComputermedien \|b c \|2 rdamedia
338			\|a ƒa Online-Ressource \|b cr \|2 rdacarrier
500			\|a Date Completed 14.04.2021
500			\|a Date Revised 14.04.2021
500			\|a published: Electronic
500			\|a Citation Status MEDLINE
520			\|a Pathology reports contain the essential data for both clinical and research purposes. However, the extraction of meaningful, qualitative data from the original document is difficult due to the narrative and complex nature of such reports. Keyword extraction for pathology reports is necessary to summarize the informative text and reduce intensive time consumption. In this study, we employed a deep learning model for the natural language process to extract keywords from pathology reports and presented the supervised keyword extraction algorithm. We considered three types of pathological keywords, namely specimen, procedure, and pathology types. We compared the performance of the present algorithm with the conventional keyword extraction methods on the 3115 pathology reports that were manually labeled by professional pathologists. Additionally, we applied the present algorithm to 36,014 unlabeled pathology reports and analysed the extracted keywords with biomedical vocabulary sets. The results demonstrated the suitability of our model for practical application in extracting important data from pathology reports
650		4	\|a Journal Article
650		4	\|a Research Support, Non-U.S. Gov't
650		4	\|a Validation Study
700	1		\|a Lee, Jeong Hyeon \|e verfasserin \|4 aut
700	1		\|a Choi, Sunho \|e verfasserin \|4 aut
700	1		\|a Lee, Jeong Moon \|e verfasserin \|4 aut
700	1		\|a Kim, Jong-Ho \|e verfasserin \|4 aut
700	1		\|a Seok, Junhee \|e verfasserin \|4 aut
700	1		\|a Joo, Hyung Joon \|e verfasserin \|4 aut
773	0	8	\|i Enthalten in \|t Scientific reports \|d 2011 \|g 10(2020), 1 vom: 20. Nov., Seite 20265 \|w (DE-627)NLM215703936 \|x 2045-2322 \|7 nnns
773	1	8	\|g volume:10 \|g year:2020 \|g number:1 \|g day:20 \|g month:11 \|g pages:20265
856	4	0	\|u http://dx.doi.org/10.1038/s41598-020-77258-w \|3 Volltext
912			\|a GBV_USEFLAG_A
912			\|a GBV_NLM
951			\|a AR
952			\|d 10 \|j 2020 \|e 1 \|b 20 \|c 11 \|h 20265

Validation of deep learning natural language processing algorithm for keyword extraction from pathology reports in electronic health records

Zugang & Verfügbarkeit

Zugehörige Publikationen/Bände