A transferrable and interpretable multiple instance learning model for microsatellite instability prediction based on histopathology images

ABSTRACT Background Microsatellite instability (MSI) is a negative prognostic factor for colorectal cancer (CRC) and can be used as a predictor of success for immunotherapy in pan-cancer. However, current MSI identification methods are not available for all patients. We propose an ensemble multiple instance learning (MIL)-based deep learning model to predict MSI status directly from histopathology images.Design Two cohorts of patients were collected, including 429 from The Cancer Genome Atlas (TCGA-COAD) and 785 from a self-collected Asian data set (Asian-CRC). The initial model was developed and validated in TCGA-COAD, and then generalized in Asian-CRC through transfer learning. The pathological signatures extracted from the model are associated with genotypes for model interpretation.Results A model called Ensembled Patch Likelihood Aggregation (EPLA) was developed in the TCGA-COAD training set based on two consecutive stages: patch-level prediction and WSI-level prediction. The EPLA model achieved an area-under-the -curve (AUC) of 0.8848 in the TCGA-COAD test set, which outperformed the state-of-the-art approach, and an AUC of 0.8504 in the Asian-CRC after transfer learning. Furthermore, the five pathological imaging signatures identified using the model are associated with genomic and transcriptomic profiles, which makes the MIL model interpretable. Results show that our model recognizes pathological signatures related to mutation burden, DNA repair pathways, and immunity.Conclusion Our MIL-based deep learning model can effectively predict MSI from histopathology images and are transferable to a new patient cohort. The interpretability of our model by association with genomic and transcriptomic biomarkers lays the foundation for prospective clinical research..

Medienart:

Preprint

Erscheinungsjahr:

2020

Erschienen:

2020

Enthalten in:

bioRxiv.org - (2020) vom: 06. März Zur Gesamtaufnahme - year:2020

Sprache:

Englisch

Beteiligte Personen:

Cao, Rui [VerfasserIn]
Yang, Fan [VerfasserIn]
Ma, Si-Cong [VerfasserIn]
Liu, Li [VerfasserIn]
Li, Yan [VerfasserIn]
Wu, De-Hua [VerfasserIn]
Zhao, Yu [VerfasserIn]
Wang, Tong-Xin [VerfasserIn]
Lu, Wei-Jia [VerfasserIn]
Cai, Wei-Jing [VerfasserIn]
Zhu, Hong-Bo [VerfasserIn]
Guo, Xue-Jun [VerfasserIn]
Lu, Yu-Wen [VerfasserIn]
Kuang, Jun-Jie [VerfasserIn]
Huan, Wen-Jing [VerfasserIn]
Tang, Wei-Min [VerfasserIn]
Huang, Junzhou [VerfasserIn]
Yao, Jianhua [VerfasserIn]
Dong, Zhong-Yi [VerfasserIn]

Links:

Volltext [kostenfrei]

doi:

10.1101/2020.02.29.971150

funding:

Förderinstitution / Projekttitel:

PPN (Katalog-ID):

XBI000751618