Details der Publikation - Exploring Diagnostic Precision and Triage Proficiency

Exploring Diagnostic Precision and Triage Proficiency : A Comparative Study of GPT-4 and Bard in Addressing Common Ophthalmic Complaints

In the modern era, patients often resort to the internet for answers to their health-related concerns, and clinics face challenges to providing timely response to patient concerns. This has led to a need to investigate the capabilities of AI chatbots for ophthalmic diagnosis and triage. In this in silico study, 80 simulated patient complaints in ophthalmology with varying urgency levels and clinical descriptors were entered into both ChatGPT and Bard in a systematic 3-step submission process asking chatbots to triage, diagnose, and evaluate urgency. Three ophthalmologists graded chatbot responses. Chatbots were significantly better at ophthalmic triage than diagnosis (90.0% appropriate triage vs. 48.8% correct leading diagnosis; p < 0.001), and GPT-4 performed better than Bard for appropriate triage recommendations (96.3% vs. 83.8%; p = 0.008), grader satisfaction for patient use (81.3% vs. 55.0%; p < 0.001), and lower potential harm rates (6.3% vs. 20.0%; p = 0.010). More descriptors improved the accuracy of diagnosis for both GPT-4 and Bard. These results indicate that chatbots may not need to recognize the correct diagnosis to provide appropriate ophthalmic triage, and there is a potential utility of these tools in aiding patients or triage staff; however, they are not a replacement for professional ophthalmic evaluation or advice.

Medienart:	E-Artikel

Erscheinungsjahr:	2024
Erschienen:	2024

Enthalten in:	Zur Gesamtaufnahme - volume:11
Enthalten in:	Bioengineering (Basel, Switzerland) - 11(2024), 2 vom: 26. Jan.

Sprache:	Englisch

Beteiligte Personen:	Zandi, Roya [VerfasserIn] Fahey, Joseph D [VerfasserIn] Drakopoulos, Michael [VerfasserIn] Bryan, John M [VerfasserIn] Dong, Siyuan [VerfasserIn] Bryar, Paul J [VerfasserIn] Bidwell, Ann E [VerfasserIn] Bowen, R Chris [VerfasserIn] Lavine, Jeremy A [VerfasserIn] Mirza, Rukhsana G [VerfasserIn]

Links:	Volltext

Themen:	Artificial intelligence Bard ChatGPT Chatbots Journal Article Large language models Ophthalmology Triage

Anmerkungen:	Date Revised 25.02.2024 published: Electronic Citation Status PubMed-not-MEDLINE

doi:	10.3390/bioengineering11020120

funding:
Förderinstitution / Projekttitel:

PPN (Katalog-ID):	NLM368815897

Internformat


LEADER	01000naa a22002652 4500
001	NLM368815897
003	DE-627
005	20240229151800.0
007	cr uuu---uuuuu
008	240229s2024 xx \|\|\|\|\|o 00\| \|\|eng c
024	7		\|a 10.3390/bioengineering11020120 \|2 doi
028	5	2	\|a pubmed24n1305.xml
035			\|a (DE-627)NLM368815897
035			\|a (NLM)38391606
035			\|a (PII)120
040			\|a DE-627 \|b ger \|c DE-627 \|e rakwb
041			\|a eng
100	1		\|a Zandi, Roya \|e verfasserin \|4 aut
245	1	0	\|a Exploring Diagnostic Precision and Triage Proficiency \|b A Comparative Study of GPT-4 and Bard in Addressing Common Ophthalmic Complaints
264		1	\|c 2024
336			\|a Text \|b txt \|2 rdacontent
337			\|a ƒaComputermedien \|b c \|2 rdamedia
338			\|a ƒa Online-Ressource \|b cr \|2 rdacarrier
500			\|a Date Revised 25.02.2024
500			\|a published: Electronic
500			\|a Citation Status PubMed-not-MEDLINE
520			\|a In the modern era, patients often resort to the internet for answers to their health-related concerns, and clinics face challenges to providing timely response to patient concerns. This has led to a need to investigate the capabilities of AI chatbots for ophthalmic diagnosis and triage. In this in silico study, 80 simulated patient complaints in ophthalmology with varying urgency levels and clinical descriptors were entered into both ChatGPT and Bard in a systematic 3-step submission process asking chatbots to triage, diagnose, and evaluate urgency. Three ophthalmologists graded chatbot responses. Chatbots were significantly better at ophthalmic triage than diagnosis (90.0% appropriate triage vs. 48.8% correct leading diagnosis; p < 0.001), and GPT-4 performed better than Bard for appropriate triage recommendations (96.3% vs. 83.8%; p = 0.008), grader satisfaction for patient use (81.3% vs. 55.0%; p < 0.001), and lower potential harm rates (6.3% vs. 20.0%; p = 0.010). More descriptors improved the accuracy of diagnosis for both GPT-4 and Bard. These results indicate that chatbots may not need to recognize the correct diagnosis to provide appropriate ophthalmic triage, and there is a potential utility of these tools in aiding patients or triage staff; however, they are not a replacement for professional ophthalmic evaluation or advice
650		4	\|a Journal Article
650		4	\|a ChatGPT
650		4	\|a artificial intelligence
650		4	\|a bard
650		4	\|a chatbots
650		4	\|a large language models
650		4	\|a ophthalmology
650		4	\|a triage
700	1		\|a Fahey, Joseph D \|e verfasserin \|4 aut
700	1		\|a Drakopoulos, Michael \|e verfasserin \|4 aut
700	1		\|a Bryan, John M \|e verfasserin \|4 aut
700	1		\|a Dong, Siyuan \|e verfasserin \|4 aut
700	1		\|a Bryar, Paul J \|e verfasserin \|4 aut
700	1		\|a Bidwell, Ann E \|e verfasserin \|4 aut
700	1		\|a Bowen, R Chris \|e verfasserin \|4 aut
700	1		\|a Lavine, Jeremy A \|e verfasserin \|4 aut
700	1		\|a Mirza, Rukhsana G \|e verfasserin \|4 aut
773	0	8	\|i Enthalten in \|t Bioengineering (Basel, Switzerland) \|d 2014 \|g 11(2024), 2 vom: 26. Jan. \|w (DE-627)NLM259987786 \|x 2306-5354 \|7 nnns
773	1	8	\|g volume:11 \|g year:2024 \|g number:2 \|g day:26 \|g month:01
856	4	0	\|u http://dx.doi.org/10.3390/bioengineering11020120 \|3 Volltext
912			\|a GBV_USEFLAG_A
912			\|a GBV_NLM
951			\|a AR
952			\|d 11 \|j 2024 \|e 2 \|b 26 \|c 01

Exploring Diagnostic Precision and Triage Proficiency : A Comparative Study of GPT-4 and Bard in Addressing Common Ophthalmic Complaints

Zugang & Verfügbarkeit

Zugehörige Publikationen/Bände