Model-based tumor subclonal reconstruction

Abstract The vast majority of cancer next-generation sequencing data consist of bulk samples composed of mixtures of cancer and normal cells. To study tumor evolution, subclonal reconstruction approaches based on machine learning are used to separate subpopulation of cancer cells and reconstruct their ancestral relationships. However, current approaches are entirely data-driven and agnostic to evolutionary theory. We demonstrate that systematic errors occur in subclonal reconstruction if tumor evolution is not accounted for, and that those errors increase when multiple samples are taken from the same tumor. To address this issue, we present a novel approach for model-based subclonal reconstruction that combines data-driven machine learning with evolutionary theory. Using public, synthetic and newly generated data, we show the method is more robust and accurate than current techniques in both single-sample and multi-region sequencing data. With careful data curation and interpretation, we show how the method allows minimizing the confounding factors that affect non-evolutionary methods, leading to a more accurate recovery of the evolutionary history of human tumors..

Medienart:

Preprint

Erscheinungsjahr:

2022

Erschienen:

2022

Enthalten in:

bioRxiv.org - (2022) vom: 14. Sept. Zur Gesamtaufnahme - year:2022

Sprache:

Englisch

Beteiligte Personen:

Caravagna, Giulio [VerfasserIn]
Heide, Timon [VerfasserIn]
Williams, Marc [VerfasserIn]
Zapata, Luis [VerfasserIn]
Nichol, Daniel [VerfasserIn]
Chkhaidze, Ketevan [VerfasserIn]
Cross, William [VerfasserIn]
Cresswell, George D. [VerfasserIn]
Werner, Benjamin [VerfasserIn]
Acar, Ahmet [VerfasserIn]
Barnes, Chris P. [VerfasserIn]
Sanguinetti, Guido [VerfasserIn]
Graham, Trevor A. [VerfasserIn]
Sottoriva, Andrea [VerfasserIn]

Links:

Volltext [lizenzpflichtig]
Volltext [kostenfrei]

Themen:

570
Biology

doi:

10.1101/586560

funding:

Förderinstitution / Projekttitel:

PPN (Katalog-ID):

XBI000480339