Details der Publikation - Build neural network models to identify and correct news headlines exaggerating obesity-related scientific findings

Build neural network models to identify and correct news headlines exaggerating obesity-related scientific findings

Purpose Media exaggerations of health research may confuse readers’ understanding, erode public trust in science and medicine, and cause disease mismanagement. This study built artificial intelligence (AI) models to automatically identify and correct news headlines exaggerating obesity-related research findings. Design/methodology/approach We searched popular digital media outlets to collect 523 headlines exaggerating obesity-related research findings. The reasons for exaggerations include: inferring causality from observational studies, inferring human outcomes from animal research, inferring distant/end outcomes (e.g., obesity) from immediate/intermediate outcomes (e.g., calorie intake), and generalizing findings to the population from a subgroup or convenience sample. Each headline was paired with the title and abstract of the peer-reviewed journal publication covered by the news article. We drafted an exaggeration-free counterpart for each original headline and fined-tuned a BERT model to differentiate between them. We further fine-tuned three generative language models—BART, PEGASUS, and T5 to autogenerate exaggeration-free headlines based on a journal publication’s title and abstract. Model performance was evaluated using the ROUGE metrics by comparing model-generated headlines with journal publication titles. Findings The fine-tuned BERT model achieved 92.5% accuracy in differentiating between exaggeration-free and original headlines. Baseline ROUGE scores averaged 0.311 for ROUGE-1, 0.113 for ROUGE-2, 0.253 for ROUGE-L, and 0.253 ROUGE-Lsum. PEGASUS, T5, and BART all outperformed the baseline. The best-performing BART model attained 0.447 for ROUGE-1, 0.221 for ROUGE-2, 0.402 for ROUGE-L, and 0.402 for ROUGE-Lsum. Originality/value This study demonstrated the feasibility of leveraging AI to automatically identify and correct news headlines exaggerating obesity-related research findings..

Medienart:	E-Artikel

Erscheinungsjahr:	2023
Erschienen:	2023

Enthalten in:	Zur Gesamtaufnahme - volume:8
Enthalten in:	Journal of data and information science - 8(2023), 3 vom: 25. Aug., Seite 88-97

Sprache:	Englisch

Beteiligte Personen:	An, Ruopeng [VerfasserIn] Batcheller, Quinlan [VerfasserIn] Wang, Junjie [VerfasserIn] Yang, Yuyi [VerfasserIn]

Links:	Volltext [kostenfrei]

BKL:	06.30 / Bibliothekswesen / Dokumentationswesen: Allgemeines 06.74 / Informationssysteme

Anmerkungen:	© 2023 Ruopeng An et al., published by Sciendo

doi:	10.2478/jdis-2023-0014

funding:
Förderinstitution / Projekttitel:

PPN (Katalog-ID):	GRUY009188231

Internformat


LEADER	01000caa a22002652 4500
001	GRUY009188231
003	DE-627
005	20231205155308.0
007	cr uuu---uuuuu
008	230830s2023 xx \|\|\|\|\|o 00\| \|\|eng c
024	7		\|a 10.2478/jdis-2023-0014 \|2 doi
035			\|a (DE-627)GRUY009188231
035			\|a (DE-B1597)jdis-2023-0014-e
040			\|a DE-627 \|b ger \|c DE-627 \|e rakwb
041			\|a eng
082	0	4	\|a 020 \|q VZ
084			\|a ASIEN \|q DE-1a \|2 fid
084			\|a 06.30 \|2 bkl
084			\|a 06.74 \|2 bkl
100	1		\|a An, Ruopeng \|e verfasserin \|0 (orcid)0000-0003-2270-7245 \|4 aut
245	1	0	\|a Build neural network models to identify and correct news headlines exaggerating obesity-related scientific findings
264		1	\|c 2023
336			\|a Text \|b txt \|2 rdacontent
337			\|a Computermedien \|b c \|2 rdamedia
338			\|a Online-Ressource \|b cr \|2 rdacarrier
500			\|a © 2023 Ruopeng An et al., published by Sciendo
520			\|a Purpose Media exaggerations of health research may confuse readers’ understanding, erode public trust in science and medicine, and cause disease mismanagement. This study built artificial intelligence (AI) models to automatically identify and correct news headlines exaggerating obesity-related research findings. Design/methodology/approach We searched popular digital media outlets to collect 523 headlines exaggerating obesity-related research findings. The reasons for exaggerations include: inferring causality from observational studies, inferring human outcomes from animal research, inferring distant/end outcomes (e.g., obesity) from immediate/intermediate outcomes (e.g., calorie intake), and generalizing findings to the population from a subgroup or convenience sample. Each headline was paired with the title and abstract of the peer-reviewed journal publication covered by the news article. We drafted an exaggeration-free counterpart for each original headline and fined-tuned a BERT model to differentiate between them. We further fine-tuned three generative language models—BART, PEGASUS, and T5 to autogenerate exaggeration-free headlines based on a journal publication’s title and abstract. Model performance was evaluated using the ROUGE metrics by comparing model-generated headlines with journal publication titles. Findings The fine-tuned BERT model achieved 92.5% accuracy in differentiating between exaggeration-free and original headlines. Baseline ROUGE scores averaged 0.311 for ROUGE-1, 0.113 for ROUGE-2, 0.253 for ROUGE-L, and 0.253 ROUGE-Lsum. PEGASUS, T5, and BART all outperformed the baseline. The best-performing BART model attained 0.447 for ROUGE-1, 0.221 for ROUGE-2, 0.402 for ROUGE-L, and 0.402 for ROUGE-Lsum. Originality/value This study demonstrated the feasibility of leveraging AI to automatically identify and correct news headlines exaggerating obesity-related research findings.
700	1		\|a Batcheller, Quinlan \|4 aut
700	1		\|a Wang, Junjie \|4 aut
700	1		\|a Yang, Yuyi \|4 aut
773	0	8	\|i Enthalten in \|t Journal of data and information science \|d Sciendo, 2016 \|g 8(2023), 3 vom: 25. Aug., Seite 88-97 \|h Online-Ressource \|w (DE-627)GRUY001486012 \|w (DE-600)2881521-X \|w (DE-576)482249730 \|x 2543-683X \|7 nnns
773	1	8	\|g volume:8 \|g year:2023 \|g number:3 \|g day:25 \|g month:08 \|g pages:88-97
856	4	0	\|u https://dx.doi.org/10.2478/jdis-2023-0014 \|z kostenfrei \|3 Volltext
912			\|a GBV_GRUY
912			\|a FID-ASIEN
936	b	k	\|a 06.30 \|j Bibliothekswesen \|j Dokumentationswesen: Allgemeines \|q VZ
936	b	k	\|a 06.74 \|j Informationssysteme \|q VZ
951			\|a AR
952			\|d 8 \|j 2023 \|e 3 \|b 25 \|c 08 \|h 88-97

Build neural network models to identify and correct news headlines exaggerating obesity-related scientific findings

Zugang & Verfügbarkeit

Zugehörige Publikationen/Bände