Learning from vertically distributed data across multiple sites : An efficient privacy-preserving algorithm for Cox proportional hazards model with variable selection

Copyright © 2023 Elsevier Inc. All rights reserved..

OBJECTIVE: To develop a lossless distributed algorithm for regularized Cox proportional hazards model with variable selection to support federated learning for vertically distributed data.

METHODS: We propose a novel distributed algorithm for fitting regularized Cox proportional hazards model when data sharing among different data providers is restricted. Based on cyclical coordinate descent, the proposed algorithm computes intermediary statistics by each site and then exchanges them to update the model parameters in other sites without accessing individual patient-level data. We evaluate the performance of the proposed algorithm with (1) a simulation study and (2) a real-world data analysis predicting the risk of Alzheimer's dementia from the Religious Orders Study and Rush Memory and Aging Project (ROSMAP). Moreover, we compared the performance of our method with existing privacy-preserving models.

RESULTS: Our algorithm achieves privacy-preserving variable selection for time-to-event data in the vertically distributed setting, without degradation of accuracy compared with a centralized approach. Simulation demonstrates that our algorithm is highly efficient in analyzing high-dimensional datasets. Real-world data analysis reveals that our distributed Cox model yields higher accuracy in predicting the risk of Alzheimer's dementia than the conventional Cox model built by each data provider without data sharing. Moreover, our algorithm is computationally more efficient compared with existing privacy-preserving Cox models with or without regularization term.

CONCLUSION: The proposed algorithm is lossless, privacy-preserving and highly efficient to fit regularized Cox model for vertically distributed data. It provides a suitable and convenient approach for modeling time-to-event data in a distributed manner.

Medienart:

E-Artikel

Erscheinungsjahr:

2024

Erschienen:

2024

Enthalten in:

Zur Gesamtaufnahme - volume:149

Enthalten in:

Journal of biomedical informatics - 149(2024) vom: 11. Jan., Seite 104581

Sprache:

Englisch

Beteiligte Personen:

Miao, Guanhong [VerfasserIn]
Yu, Lei [VerfasserIn]
Yang, Jingyun [VerfasserIn]
Bennett, David A [VerfasserIn]
Zhao, Jinying [VerfasserIn]
Wu, Samuel S [VerfasserIn]

Links:

Volltext

Themen:

Cox proportional hazards model
Distributed algorithm
Journal Article
Privacy preserving
Research Support, N.I.H., Extramural
Variable selection
Vertical partitioning

Anmerkungen:

Date Completed 22.01.2024

Date Revised 26.04.2024

published: Print-Electronic

Citation Status MEDLINE

doi:

10.1016/j.jbi.2023.104581

funding:

Förderinstitution / Projekttitel:

PPN (Katalog-ID):

NLM366335693