Details der Publikation - LOMIA-T: A Transformer-based LOngitudinal Medical Image Analysis framework for predicting treatment response of esophageal cancer

LOMIA-T: A Transformer-based LOngitudinal Medical Image Analysis framework for predicting treatment response of esophageal cancer

Abstract Deep learning models based on medical images have made significant strides in predicting treatment outcomes. However, previous methods have primarily concentrated on single time-point images, neglecting the temporal dynamics and changes inherent in longitudinal medical images. Thus, we propose a Transformer-based longitudinal image analysis framework (LOMIA-T) to contrast and fuse latent representations from pre- and post-treatment medical images for predicting treatment response. Specifically, we first design a treatment response- based contrastive loss to enhance latent representation by discerning evolutionary processes across various disease stages. Then, we integrate latent representations from pre- and post-treatment CT images using a cross-attention mechanism. Considering the redundancy in the dual-branch output features induced by the cross-attention mechanism, we propose a clinically interpretable feature fusion strategy to predict treatment response. Experimentally, the proposed framework outperforms several state-of-the-art longitudinal image analysis methods on an in-house Esophageal Squamous Cell Carcinoma (ESCC) dataset, encompassing 170 pre- and post-treatment contrast-enhanced CT image pairs from ESCC patients underwent neoadjuvant chemoradiotherapy. Ablation experiments validate the efficacy of the proposed treatment response-based contrastive loss and feature fusion strategy. The codes will be made available at<jats:ext-link xmlns:xlink="http://www.w3.org/1999/xlink" ext-link-type="uri" xlink:href="https://github.com/syc19074115/LOMIA-T">https://github.com/syc19074115/LOMIA-T</jats:ext-link>..

Medienart:	Preprint

Erscheinungsjahr:	2024
Erschienen:	2024

Enthalten in:	bioRxiv.org - (2024) vom: 03. Apr. Zur Gesamtaufnahme - year:2024

Sprache:	Englisch

Beteiligte Personen:	Sun, Yuchen [VerfasserIn] Li, Kunwei [VerfasserIn] Chen, Duanduan [VerfasserIn] Hu, Yi [VerfasserIn] Zhang, Shuaitong [VerfasserIn]

Links:	Volltext [kostenfrei]

Themen:	570 Biology

doi:	10.1101/2024.03.29.24305018

funding:
Förderinstitution / Projekttitel:

PPN (Katalog-ID):	XBI043111211

Internformat


LEADER	01000caa a22002652 4500
001	XBI043111211
003	DE-627
005	20240405120400.0
007	cr uuu---uuuuu
008	240401s2024 xx \|\|\|\|\|o 00\| \|\|eng c
024	7		\|a 10.1101/2024.03.29.24305018 \|2 doi
035			\|a (DE-627)XBI043111211
035			\|a (biorXiv)10.1101/2024.03.29.24305018
040			\|a DE-627 \|b ger \|c DE-627 \|e rakwb
041			\|a eng
100	1		\|a Sun, Yuchen \|e verfasserin \|4 aut
245	1	0	\|a LOMIA-T: A Transformer-based LOngitudinal Medical Image Analysis framework for predicting treatment response of esophageal cancer
264		1	\|c 2024
336			\|a Text \|b txt \|2 rdacontent
337			\|a Computermedien \|b c \|2 rdamedia
338			\|a Online-Ressource \|b cr \|2 rdacarrier
520			\|a Abstract Deep learning models based on medical images have made significant strides in predicting treatment outcomes. However, previous methods have primarily concentrated on single time-point images, neglecting the temporal dynamics and changes inherent in longitudinal medical images. Thus, we propose a Transformer-based longitudinal image analysis framework (LOMIA-T) to contrast and fuse latent representations from pre- and post-treatment medical images for predicting treatment response. Specifically, we first design a treatment response- based contrastive loss to enhance latent representation by discerning evolutionary processes across various disease stages. Then, we integrate latent representations from pre- and post-treatment CT images using a cross-attention mechanism. Considering the redundancy in the dual-branch output features induced by the cross-attention mechanism, we propose a clinically interpretable feature fusion strategy to predict treatment response. Experimentally, the proposed framework outperforms several state-of-the-art longitudinal image analysis methods on an in-house Esophageal Squamous Cell Carcinoma (ESCC) dataset, encompassing 170 pre- and post-treatment contrast-enhanced CT image pairs from ESCC patients underwent neoadjuvant chemoradiotherapy. Ablation experiments validate the efficacy of the proposed treatment response-based contrastive loss and feature fusion strategy. The codes will be made available at<jats:ext-link xmlns:xlink="http://www.w3.org/1999/xlink" ext-link-type="uri" xlink:href="https://github.com/syc19074115/LOMIA-T">https://github.com/syc19074115/LOMIA-T</jats:ext-link>.
650		4	\|a Biology \|7 (dpeaa)DE-84
650		4	\|a 570 \|7 (dpeaa)DE-84
700	1		\|a Li, Kunwei \|e verfasserin \|4 aut
700	1		\|a Chen, Duanduan \|e verfasserin \|4 aut
700	1		\|a Hu, Yi \|e verfasserin \|4 aut
700	1		\|a Zhang, Shuaitong \|e verfasserin \|0 (orcid)0009-0006-8536-4541 \|4 aut
773	0	8	\|i Enthalten in \|t bioRxiv.org \|g (2024) vom: 03. Apr.
773	1	8	\|g year:2024 \|g day:03 \|g month:04
856	4	0	\|u http://dx.doi.org/10.1101/2024.03.29.24305018 \|m X:VERLAG \|x 0 \|z kostenfrei \|3 Volltext
912			\|a GBV_XBI
951			\|a AR
952			\|j 2024 \|b 03 \|c 04

LOMIA-T: A Transformer-based LOngitudinal Medical Image Analysis framework for predicting treatment response of esophageal cancer

Zugang & Verfügbarkeit

Zugehörige Publikationen/Bände