Learning from vertically distributed data across multiple sites : An efficient privacy-preserving algorithm for Cox proportional hazards model with variable selection
Copyright © 2023 Elsevier Inc. All rights reserved..
OBJECTIVE: To develop a lossless distributed algorithm for regularized Cox proportional hazards model with variable selection to support federated learning for vertically distributed data.
METHODS: We propose a novel distributed algorithm for fitting regularized Cox proportional hazards model when data sharing among different data providers is restricted. Based on cyclical coordinate descent, the proposed algorithm computes intermediary statistics by each site and then exchanges them to update the model parameters in other sites without accessing individual patient-level data. We evaluate the performance of the proposed algorithm with (1) a simulation study and (2) a real-world data analysis predicting the risk of Alzheimer's dementia from the Religious Orders Study and Rush Memory and Aging Project (ROSMAP). Moreover, we compared the performance of our method with existing privacy-preserving models.
RESULTS: Our algorithm achieves privacy-preserving variable selection for time-to-event data in the vertically distributed setting, without degradation of accuracy compared with a centralized approach. Simulation demonstrates that our algorithm is highly efficient in analyzing high-dimensional datasets. Real-world data analysis reveals that our distributed Cox model yields higher accuracy in predicting the risk of Alzheimer's dementia than the conventional Cox model built by each data provider without data sharing. Moreover, our algorithm is computationally more efficient compared with existing privacy-preserving Cox models with or without regularization term.
CONCLUSION: The proposed algorithm is lossless, privacy-preserving and highly efficient to fit regularized Cox model for vertically distributed data. It provides a suitable and convenient approach for modeling time-to-event data in a distributed manner.
Medienart: |
E-Artikel |
---|
Erscheinungsjahr: |
2024 |
---|---|
Erschienen: |
2024 |
Enthalten in: |
Zur Gesamtaufnahme - volume:149 |
---|---|
Enthalten in: |
Journal of biomedical informatics - 149(2024) vom: 11. Jan., Seite 104581 |
Sprache: |
Englisch |
---|
Beteiligte Personen: |
Miao, Guanhong [VerfasserIn] |
---|
Links: |
---|
Themen: |
Cox proportional hazards model |
---|
Anmerkungen: |
Date Completed 22.01.2024 Date Revised 26.04.2024 published: Print-Electronic Citation Status MEDLINE |
---|
doi: |
10.1016/j.jbi.2023.104581 |
---|
funding: |
|
---|---|
Förderinstitution / Projekttitel: |
|
PPN (Katalog-ID): |
NLM366335693 |
---|
LEADER | 01000caa a22002652 4500 | ||
---|---|---|---|
001 | NLM366335693 | ||
003 | DE-627 | ||
005 | 20240426233504.0 | ||
007 | cr uuu---uuuuu | ||
008 | 231227s2024 xx |||||o 00| ||eng c | ||
024 | 7 | |a 10.1016/j.jbi.2023.104581 |2 doi | |
028 | 5 | 2 | |a pubmed24n1388.xml |
035 | |a (DE-627)NLM366335693 | ||
035 | |a (NLM)38142903 | ||
035 | |a (PII)S1532-0464(23)00302-7 | ||
040 | |a DE-627 |b ger |c DE-627 |e rakwb | ||
041 | |a eng | ||
100 | 1 | |a Miao, Guanhong |e verfasserin |4 aut | |
245 | 1 | 0 | |a Learning from vertically distributed data across multiple sites |b An efficient privacy-preserving algorithm for Cox proportional hazards model with variable selection |
264 | 1 | |c 2024 | |
336 | |a Text |b txt |2 rdacontent | ||
337 | |a ƒaComputermedien |b c |2 rdamedia | ||
338 | |a ƒa Online-Ressource |b cr |2 rdacarrier | ||
500 | |a Date Completed 22.01.2024 | ||
500 | |a Date Revised 26.04.2024 | ||
500 | |a published: Print-Electronic | ||
500 | |a Citation Status MEDLINE | ||
520 | |a Copyright © 2023 Elsevier Inc. All rights reserved. | ||
520 | |a OBJECTIVE: To develop a lossless distributed algorithm for regularized Cox proportional hazards model with variable selection to support federated learning for vertically distributed data | ||
520 | |a METHODS: We propose a novel distributed algorithm for fitting regularized Cox proportional hazards model when data sharing among different data providers is restricted. Based on cyclical coordinate descent, the proposed algorithm computes intermediary statistics by each site and then exchanges them to update the model parameters in other sites without accessing individual patient-level data. We evaluate the performance of the proposed algorithm with (1) a simulation study and (2) a real-world data analysis predicting the risk of Alzheimer's dementia from the Religious Orders Study and Rush Memory and Aging Project (ROSMAP). Moreover, we compared the performance of our method with existing privacy-preserving models | ||
520 | |a RESULTS: Our algorithm achieves privacy-preserving variable selection for time-to-event data in the vertically distributed setting, without degradation of accuracy compared with a centralized approach. Simulation demonstrates that our algorithm is highly efficient in analyzing high-dimensional datasets. Real-world data analysis reveals that our distributed Cox model yields higher accuracy in predicting the risk of Alzheimer's dementia than the conventional Cox model built by each data provider without data sharing. Moreover, our algorithm is computationally more efficient compared with existing privacy-preserving Cox models with or without regularization term | ||
520 | |a CONCLUSION: The proposed algorithm is lossless, privacy-preserving and highly efficient to fit regularized Cox model for vertically distributed data. It provides a suitable and convenient approach for modeling time-to-event data in a distributed manner | ||
650 | 4 | |a Journal Article | |
650 | 4 | |a Research Support, N.I.H., Extramural | |
650 | 4 | |a Cox proportional hazards model | |
650 | 4 | |a Distributed algorithm | |
650 | 4 | |a Privacy preserving | |
650 | 4 | |a Variable selection | |
650 | 4 | |a Vertical partitioning | |
700 | 1 | |a Yu, Lei |e verfasserin |4 aut | |
700 | 1 | |a Yang, Jingyun |e verfasserin |4 aut | |
700 | 1 | |a Bennett, David A |e verfasserin |4 aut | |
700 | 1 | |a Zhao, Jinying |e verfasserin |4 aut | |
700 | 1 | |a Wu, Samuel S |e verfasserin |4 aut | |
773 | 0 | 8 | |i Enthalten in |t Journal of biomedical informatics |d 2001 |g 149(2024) vom: 11. Jan., Seite 104581 |w (DE-627)NLM112821766 |x 1532-0480 |7 nnns |
773 | 1 | 8 | |g volume:149 |g year:2024 |g day:11 |g month:01 |g pages:104581 |
856 | 4 | 0 | |u http://dx.doi.org/10.1016/j.jbi.2023.104581 |3 Volltext |
912 | |a GBV_USEFLAG_A | ||
912 | |a GBV_NLM | ||
951 | |a AR | ||
952 | |d 149 |j 2024 |b 11 |c 01 |h 104581 |