Multitask Learning of Longitudinal Circulating Biomarkers and Clinical Outcomes: Identification of Optimal Machine-Learning and Deep-Learning Models

Abstract Many circulating biomarkers are assessed at different time intervals during clinical studies. Despite of the success of standard joint models in predicting clinical outcomes using low-dimensional longitudinal data (1-2 biomarkers), significant computational challenges are encountered when applying these techniques to high-dimensional biomarker datasets. Modern machine- or deep-learning models show potential for multiple biomarker processes, but systematic evaluations and applications to high-dimensional data in the clinical settings have yet to be reported. We aimed to enhance the scalability of joint modeling and provide guidance on optimal approaches for high-dimensional biomarker data and outcomes. We evaluated multiple deep-learning and machine-learning models using 24 clinical biomarkers and survival data from the SQUIRE trial, a phase 3 randomized clinical trial investigating necitumumab and standard gemcitabine/cisplatin treatment in patients with squamous non-small-cell lung cancer (NSCLC). Overall, we confirmed that longitudinal models enabled more accurate prediction of patients’ survival compared to those solely based on baseline information. Coupling multivariate functional principal component analysis (MFPCA) with Cox regression (MFPCA-Cox) provided the highest predictive discrimination and accuracy for the NSCLC patients with AUC values of 0.7 - >0.8 at various landmark time points and prediction timeframes, outperforming recent advanced Transformer and convolutional neural network deep-learning algorithms (TransformerJM and Match-Net, respectively). In conclusion, we identified that MFPCA-Cox represents a robust and versatile joint modeling algorithm for high-dimensional biomarker longitudinal data with irregular and missing data, capturing complex relationships within the data, yielding accurate predictions for both longitudinal biomarkers and survival outcomes, and gaining insights into the underlying dynamics..

Medienart:

Preprint

Erscheinungsjahr:

2023

Erschienen:

2023

Enthalten in:

bioRxiv.org - (2023) vom: 25. Aug. Zur Gesamtaufnahme - year:2023

Sprache:

Englisch

Beteiligte Personen:

Yuan, Min [VerfasserIn]
Su, Shixin [VerfasserIn]
Ding, Haolun [VerfasserIn]
Yang, Yaning [VerfasserIn]
Gupta, Manish [VerfasserIn]
Xu, Xu Steven [VerfasserIn]

Links:

Volltext [kostenfrei]

Themen:

570
Biology

doi:

10.1101/2023.08.19.553991

funding:

Förderinstitution / Projekttitel:

PPN (Katalog-ID):

XBI040587312