Leveraging Artificial Intelligence and Synthetic Data Derivatives for Spine Surgery Research

STUDY DESIGN: Retrospective cohort study.

OBJECTIVES: Leveraging electronic health records (EHRs) for spine surgery research is impeded by concerns regarding patient privacy and data ownership. Synthetic data derivatives may help overcome these limitations. This study's objective was to validate the use of synthetic data for spine surgery research.

METHODS: Data came from the EHR from 15 hospitals. Patients that underwent anterior cervical or posterior lumbar fusion (2010-2020) were included. Real data were obtained from the EHR. Synthetic data was generated to simulate the properties of the real data, without maintaining a one-to-one correspondence with real patients. Within each cohort, ability to predict 30-day readmissions and 30-day complications was evaluated using logistic regression and extreme gradient boosting machines (XGBoost).

RESULTS: We identified 9,072 real and 9,088 synthetic cervical fusion patients. Descriptive characteristics were nearly identical between the 2 datasets. When predicting readmission, models built using real and synthetic data both had c-statistics of .69-.71 using logistic regression and XGBoost. Among 12,111 real and 12,126 synthetic lumbar fusion patients, descriptive characteristics were nearly the same for most variables. Using logistic regression and XGBoost to predict readmission, discrimination was similar with models built using real and synthetic data (c-statistics .66-.69). When predicting complications, models derived using real and synthetic data showed similar discrimination in both cohorts. Despite some differences, the most influential predictors were similar in the real and synthetic datasets.

CONCLUSION: Synthetic data replicate most descriptive and predictive properties of real data, and therefore may expand EHR research in spine surgery.

Medienart:

E-Artikel

Erscheinungsjahr:

2023

Erschienen:

2023

Enthalten in:

Zur Gesamtaufnahme - volume:13

Enthalten in:

Global spine journal - 13(2023), 8 vom: 01. Okt., Seite 2409-2421

Sprache:

Englisch

Beteiligte Personen:

Greenberg, Jacob K [VerfasserIn]
Landman, Joshua M [VerfasserIn]
Kelly, Michael P [VerfasserIn]
Pennicooke, Brenton H [VerfasserIn]
Molina, Camilo A [VerfasserIn]
Foraker, Randi E [VerfasserIn]
Ray, Wilson Z [VerfasserIn]

Links:

Volltext

Themen:

Artificial intelligence
Electronic health records
Journal Article
Machine learning
Medical informatics
Spine surgery
Synthetic data derivatives
Treatment outcome

Anmerkungen:

Date Revised 10.02.2024

published: Print-Electronic

Citation Status PubMed-not-MEDLINE

doi:

10.1177/21925682221085535

funding:

Förderinstitution / Projekttitel:

PPN (Katalog-ID):

NLM339034920