A comparative vignette study : Evaluating the potential role of a generative AI model in enhancing clinical decision-making in nursing
© 2024 The Authors. Journal of Advanced Nursing published by John Wiley & Sons Ltd..
AIM: This study explores the potential of a generative artificial intelligence tool (ChatGPT) as clinical support for nurses. Specifically, we aim to assess whether ChatGPT can demonstrate clinical decision-making equivalent to that of expert nurses and novice nursing students. This will be evaluated by comparing ChatGPT responses to clinical scenarios to those of nurses on different levels of experience.
DESIGN: This is a cross-sectional study.
METHODS: Emergency room registered nurses (i.e. experts; n = 30) and nursing students (i.e. novices; n = 38) were recruited during March-April 2023. Clinical decision-making was measured using three validated clinical scenarios involving an initial assessment and reevaluation. Clinical decision-making aspects assessed were the accuracy of initial assessments, the appropriateness of recommended tests and resource use and the capacity to reevaluate decisions. Performance was also compared by timing response generations and word counts. Expert nurses and novice students completed online questionnaires (via Qualtrics), while ChatGPT responses were obtained from OpenAI.
RESULTS: Concerning aspects of clinical decision-making and compared to novices and experts: (1) ChatGPT exhibited indecisiveness in initial assessments; (2) ChatGPT tended to suggest unnecessary diagnostic tests; (3) When new information required re-evaluation, ChatGPT responses demonstrated inaccurate understanding and inappropriate modifications. In terms of performance, the mean number of words utilized in ChatGPT answers was 27-41 times greater than that utilized by both experts and novices; and responses were provided approximately 4 times faster than those of novices and twice faster than expert nurses. ChatGPT responses maintained logical structure and clarity.
CONCLUSIONS: A generative AI tool demonstrated indecisiveness and a tendency towards over-triage compared to human clinicians.
IMPACT: The study shows that it is important to approach the implementation of ChatGPT as a nurse's digital assistant with caution. More study is needed to optimize the model's training and algorithms to provide accurate healthcare support that aids clinical decision-making.
REPORTING METHOD: This study adhered to relevant EQUATOR guidelines for reporting observational studies.
PATIENT OR PUBLIC CONTRIBUTION: Patients were not directly involved in the conduct of this study.
Medienart: |
E-Artikel |
---|
Erscheinungsjahr: |
2024 |
---|---|
Erschienen: |
2024 |
Enthalten in: |
Zur Gesamtaufnahme - year:2024 |
---|---|
Enthalten in: |
Journal of advanced nursing - (2024) vom: 17. Feb. |
Sprache: |
Englisch |
---|
Beteiligte Personen: |
Saban, Mor [VerfasserIn] |
---|
Links: |
---|
Themen: |
ChatGPT |
---|
Anmerkungen: |
Date Revised 07.03.2024 published: Print-Electronic Citation Status Publisher |
---|
doi: |
10.1111/jan.16101 |
---|
funding: |
|
---|---|
Förderinstitution / Projekttitel: |
|
PPN (Katalog-ID): |
NLM368567532 |
---|
LEADER | 01000caa a22002652 4500 | ||
---|---|---|---|
001 | NLM368567532 | ||
003 | DE-627 | ||
005 | 20240308232540.0 | ||
007 | cr uuu---uuuuu | ||
008 | 240217s2024 xx |||||o 00| ||eng c | ||
024 | 7 | |a 10.1111/jan.16101 |2 doi | |
028 | 5 | 2 | |a pubmed24n1320.xml |
035 | |a (DE-627)NLM368567532 | ||
035 | |a (NLM)38366690 | ||
040 | |a DE-627 |b ger |c DE-627 |e rakwb | ||
041 | |a eng | ||
100 | 1 | |a Saban, Mor |e verfasserin |4 aut | |
245 | 1 | 2 | |a A comparative vignette study |b Evaluating the potential role of a generative AI model in enhancing clinical decision-making in nursing |
264 | 1 | |c 2024 | |
336 | |a Text |b txt |2 rdacontent | ||
337 | |a ƒaComputermedien |b c |2 rdamedia | ||
338 | |a ƒa Online-Ressource |b cr |2 rdacarrier | ||
500 | |a Date Revised 07.03.2024 | ||
500 | |a published: Print-Electronic | ||
500 | |a Citation Status Publisher | ||
520 | |a © 2024 The Authors. Journal of Advanced Nursing published by John Wiley & Sons Ltd. | ||
520 | |a AIM: This study explores the potential of a generative artificial intelligence tool (ChatGPT) as clinical support for nurses. Specifically, we aim to assess whether ChatGPT can demonstrate clinical decision-making equivalent to that of expert nurses and novice nursing students. This will be evaluated by comparing ChatGPT responses to clinical scenarios to those of nurses on different levels of experience | ||
520 | |a DESIGN: This is a cross-sectional study | ||
520 | |a METHODS: Emergency room registered nurses (i.e. experts; n = 30) and nursing students (i.e. novices; n = 38) were recruited during March-April 2023. Clinical decision-making was measured using three validated clinical scenarios involving an initial assessment and reevaluation. Clinical decision-making aspects assessed were the accuracy of initial assessments, the appropriateness of recommended tests and resource use and the capacity to reevaluate decisions. Performance was also compared by timing response generations and word counts. Expert nurses and novice students completed online questionnaires (via Qualtrics), while ChatGPT responses were obtained from OpenAI | ||
520 | |a RESULTS: Concerning aspects of clinical decision-making and compared to novices and experts: (1) ChatGPT exhibited indecisiveness in initial assessments; (2) ChatGPT tended to suggest unnecessary diagnostic tests; (3) When new information required re-evaluation, ChatGPT responses demonstrated inaccurate understanding and inappropriate modifications. In terms of performance, the mean number of words utilized in ChatGPT answers was 27-41 times greater than that utilized by both experts and novices; and responses were provided approximately 4 times faster than those of novices and twice faster than expert nurses. ChatGPT responses maintained logical structure and clarity | ||
520 | |a CONCLUSIONS: A generative AI tool demonstrated indecisiveness and a tendency towards over-triage compared to human clinicians | ||
520 | |a IMPACT: The study shows that it is important to approach the implementation of ChatGPT as a nurse's digital assistant with caution. More study is needed to optimize the model's training and algorithms to provide accurate healthcare support that aids clinical decision-making | ||
520 | |a REPORTING METHOD: This study adhered to relevant EQUATOR guidelines for reporting observational studies | ||
520 | |a PATIENT OR PUBLIC CONTRIBUTION: Patients were not directly involved in the conduct of this study | ||
650 | 4 | |a Journal Article | |
650 | 4 | |a ChatGPT | |
650 | 4 | |a Clinical decision-making | |
650 | 4 | |a Generative AI | |
650 | 4 | |a Novice | |
650 | 4 | |a Nurse education | |
650 | 4 | |a Nurses | |
700 | 1 | |a Dubovi, Ilana |e verfasserin |4 aut | |
773 | 0 | 8 | |i Enthalten in |t Journal of advanced nursing |d 1980 |g (2024) vom: 17. Feb. |w (DE-627)NLM000357901 |x 1365-2648 |7 nnns |
773 | 1 | 8 | |g year:2024 |g day:17 |g month:02 |
856 | 4 | 0 | |u http://dx.doi.org/10.1111/jan.16101 |3 Volltext |
912 | |a GBV_USEFLAG_A | ||
912 | |a GBV_NLM | ||
951 | |a AR | ||
952 | |j 2024 |b 17 |c 02 |