A dataset of simulated patient-physician medical interviews with a focus on respiratory cases
© 2022. The Author(s)..
Artificial Intelligence (AI) is playing a major role in medical education, diagnosis, and outbreak detection through Natural Language Processing (NLP), machine learning models and deep learning tools. However, in order to train AI to facilitate these medical fields, well-documented and accurate medical conversations are needed. The dataset presented covers a series of medical conversations in the format of Objective Structured Clinical Examinations (OSCE), with a focus on respiratory cases in audio format and corresponding text documents. These cases were simulated, recorded, transcribed, and manually corrected with the underlying aim of providing a comprehensive set of medical conversation data to the academic and industry community. Potential applications include speech recognition detection for speech-to-text errors, training NLP models to extract symptoms, detecting diseases, or for educational purposes, including training an avatar to converse with healthcare professional students as a standardized patient during clinical examinations. The application opportunities for the presented dataset are vast, given that this calibre of data is difficult to access and costly to develop.
Medienart: |
E-Artikel |
---|
Erscheinungsjahr: |
2022 |
---|---|
Erschienen: |
2022 |
Enthalten in: |
Zur Gesamtaufnahme - volume:9 |
---|---|
Enthalten in: |
Scientific data - 9(2022), 1 vom: 16. Juni, Seite 313 |
Sprache: |
Englisch |
---|
Beteiligte Personen: |
Fareez, Faiha [VerfasserIn] |
---|
Links: |
---|
Themen: |
---|
Anmerkungen: |
Date Completed 20.06.2022 Date Revised 30.05.2023 published: Electronic Citation Status MEDLINE |
---|
doi: |
10.1038/s41597-022-01423-1 |
---|
funding: |
|
---|---|
Förderinstitution / Projekttitel: |
|
PPN (Katalog-ID): |
NLM342321218 |
---|
LEADER | 01000naa a22002652 4500 | ||
---|---|---|---|
001 | NLM342321218 | ||
003 | DE-627 | ||
005 | 20231226013836.0 | ||
007 | cr uuu---uuuuu | ||
008 | 231226s2022 xx |||||o 00| ||eng c | ||
024 | 7 | |a 10.1038/s41597-022-01423-1 |2 doi | |
028 | 5 | 2 | |a pubmed24n1141.xml |
035 | |a (DE-627)NLM342321218 | ||
035 | |a (NLM)35710769 | ||
040 | |a DE-627 |b ger |c DE-627 |e rakwb | ||
041 | |a eng | ||
100 | 1 | |a Fareez, Faiha |e verfasserin |4 aut | |
245 | 1 | 2 | |a A dataset of simulated patient-physician medical interviews with a focus on respiratory cases |
264 | 1 | |c 2022 | |
336 | |a Text |b txt |2 rdacontent | ||
337 | |a ƒaComputermedien |b c |2 rdamedia | ||
338 | |a ƒa Online-Ressource |b cr |2 rdacarrier | ||
500 | |a Date Completed 20.06.2022 | ||
500 | |a Date Revised 30.05.2023 | ||
500 | |a published: Electronic | ||
500 | |a Citation Status MEDLINE | ||
520 | |a © 2022. The Author(s). | ||
520 | |a Artificial Intelligence (AI) is playing a major role in medical education, diagnosis, and outbreak detection through Natural Language Processing (NLP), machine learning models and deep learning tools. However, in order to train AI to facilitate these medical fields, well-documented and accurate medical conversations are needed. The dataset presented covers a series of medical conversations in the format of Objective Structured Clinical Examinations (OSCE), with a focus on respiratory cases in audio format and corresponding text documents. These cases were simulated, recorded, transcribed, and manually corrected with the underlying aim of providing a comprehensive set of medical conversation data to the academic and industry community. Potential applications include speech recognition detection for speech-to-text errors, training NLP models to extract symptoms, detecting diseases, or for educational purposes, including training an avatar to converse with healthcare professional students as a standardized patient during clinical examinations. The application opportunities for the presented dataset are vast, given that this calibre of data is difficult to access and costly to develop | ||
650 | 4 | |a Dataset | |
650 | 4 | |a Journal Article | |
700 | 1 | |a Parikh, Tishya |e verfasserin |4 aut | |
700 | 1 | |a Wavell, Christopher |e verfasserin |4 aut | |
700 | 1 | |a Shahab, Saba |e verfasserin |4 aut | |
700 | 1 | |a Chevalier, Meghan |e verfasserin |4 aut | |
700 | 1 | |a Good, Scott |e verfasserin |4 aut | |
700 | 1 | |a De Blasi, Isabella |e verfasserin |4 aut | |
700 | 1 | |a Rhouma, Rafik |e verfasserin |4 aut | |
700 | 1 | |a McMahon, Christopher |e verfasserin |4 aut | |
700 | 1 | |a Lam, Jean-Paul |e verfasserin |4 aut | |
700 | 1 | |a Lo, Thomas |e verfasserin |4 aut | |
700 | 1 | |a Smith, Christopher W |e verfasserin |4 aut | |
773 | 0 | 8 | |i Enthalten in |t Scientific data |d 2014 |g 9(2022), 1 vom: 16. Juni, Seite 313 |w (DE-627)NLM245762450 |x 2052-4463 |7 nnns |
773 | 1 | 8 | |g volume:9 |g year:2022 |g number:1 |g day:16 |g month:06 |g pages:313 |
856 | 4 | 0 | |u http://dx.doi.org/10.1038/s41597-022-01423-1 |3 Volltext |
912 | |a GBV_USEFLAG_A | ||
912 | |a GBV_NLM | ||
951 | |a AR | ||
952 | |d 9 |j 2022 |e 1 |b 16 |c 06 |h 313 |