Reference-free deconvolution, visualization and interpretation of complex DNA methylation data using DecompPipeline, MeDeCom and FactorViz

DNA methylation profiling offers unique insights into human development and diseases. Often the analysis of complex tissues and cell mixtures is the only feasible option to study methylation changes across large patient cohorts. Since DNA methylomes are highly cell type specific, deconvolution methods can be used to recover cell type-specific information in the form of latent methylation components (LMCs) from such 'bulk' samples. Reference-free deconvolution methods retrieve these components without the need for DNA methylation profiles of purified cell types. Currently no integrated and guided procedure is available for data preparation and subsequent interpretation of deconvolution results. Here, we describe a three-stage protocol for reference-free deconvolution of DNA methylation data comprising: (i) data preprocessing, confounder adjustment using independent component analysis (ICA) and feature selection using DecompPipeline, (ii) deconvolution with multiple parameters using MeDeCom, RefFreeCellMix or EDec and (iii) guided biological inference and validation of deconvolution results with the R/Shiny graphical user interface FactorViz. Our protocol simplifies the analysis and guides the initial interpretation of DNA methylation data derived from complex samples. The harmonized approach is particularly useful to dissect and evaluate cell heterogeneity in complex systems such as tumors. We apply the protocol to lung cancer methylomes from The Cancer Genome Atlas (TCGA) and show that our approach identifies the proportions of stromal cells and tumor-infiltrating immune cells, as well as associations of the detected components with clinical parameters. The protocol takes slightly >3 d to complete and requires basic R skills.

Medienart:

E-Artikel

Erscheinungsjahr:

2020

Erschienen:

2020

Enthalten in:

Zur Gesamtaufnahme - volume:15

Enthalten in:

Nature protocols - 15(2020), 10 vom: 25. Okt., Seite 3240-3263

Sprache:

Englisch

Beteiligte Personen:

Scherer, Michael [VerfasserIn]
Nazarov, Petr V [VerfasserIn]
Toth, Reka [VerfasserIn]
Sahay, Shashwat [VerfasserIn]
Kaoma, Tony [VerfasserIn]
Maurer, Valentin [VerfasserIn]
Vedeneev, Nikita [VerfasserIn]
Plass, Christoph [VerfasserIn]
Lengauer, Thomas [VerfasserIn]
Walter, Jörn [VerfasserIn]
Lutsik, Pavlo [VerfasserIn]

Links:

Volltext

Themen:

Journal Article
Research Support, Non-U.S. Gov't

Anmerkungen:

Date Completed 03.11.2020

Date Revised 22.04.2022

published: Print-Electronic

Citation Status MEDLINE

doi:

10.1038/s41596-020-0369-6

funding:

Förderinstitution / Projekttitel:

PPN (Katalog-ID):

NLM315493283