Segment-Based Spotting of Bowel Sounds Using Pretrained Models in Continuous Data Streams

We analyse pretrained and non-pretrained deep neural models to detect 10-seconds Bowel Sounds (BS) audio segments in continuous audio data streams. The models include MobileNet, EfficientNet, and Distilled Transformer architectures. Models were initially trained on AudioSet and then transferred and evaluated on 84 hours of labelled audio data of eighteen healthy participants. Evaluation data was recorded in a semi-naturalistic daytime setting including movement and background noise using a smart shirt with embedded microphones. The collected dataset was annotated for individual BS events by two independent raters with substantial agreement (Cohen's Kappa κ = 0.74). Leave-One-Participant-Out cross-validation for detecting 10-second BS audio segments, i.e. segment-based BS spotting, yielded a best F1 score of 73% and 67%, with and without transfer learning respectively. The best model for segment-based BS spotting was EfficientNet-B2 with an attention module. Our results show that pretrained models could improve F1 score up to 26%, in particular, increasing robustness against background noise. Our segment-based BS spotting approach reduces the amount of audio data to be reviewed by experts from 84 h to 11 h, thus by  ∼ 87%.

Medienart:

E-Artikel

Erscheinungsjahr:

2023

Erschienen:

2023

Enthalten in:

Zur Gesamtaufnahme - volume:27

Enthalten in:

IEEE journal of biomedical and health informatics - 27(2023), 7 vom: 08. Juli, Seite 3164-3174

Sprache:

Englisch

Beteiligte Personen:

Baronetto, Annalisa [VerfasserIn]
Graf, Luisa S [VerfasserIn]
Fischer, Sarah [VerfasserIn]
Neurath, Markus F [VerfasserIn]
Amft, Oliver [VerfasserIn]

Links:

Volltext

Themen:

Journal Article

Anmerkungen:

Date Completed 03.07.2023

Date Revised 16.11.2023

published: Print-Electronic

Citation Status MEDLINE

doi:

10.1109/JBHI.2023.3269910

funding:

Förderinstitution / Projekttitel:

PPN (Katalog-ID):

NLM356585271