scRAA : the development of a robust and automatic annotation procedure for single-cell RNA sequencing data

A critical task in single-cell RNA sequencing (scRNA-Seq) data analysis is to identify cell types from heterogeneous tissues. While the majority of classification methods demonstrated high performance in scRNA-Seq annotation problems, a robust and accurate solution is desired to generate reliable outcomes for downstream analyses, for instance, marker genes identification, differentially expressed genes, and pathway analysis. It is hard to establish a universally good metric. Thus, a universally good classification method for all kinds of scenarios does not exist. In addition, reference and query data in cell classification are usually from different experimental batches, and failure to consider batch effects may result in misleading conclusions. To overcome this bottleneck, we propose a robust ensemble approach to classify cells and utilize a batch correction method between reference and query data. We simulated four scenarios that comprise simple to complex batch effect and account for varying cell-type proportions. We further tested our approach on both lung and pancreas data. We found improved prediction accuracy and robust performance across simulation scenarios and real data. The incorporation of batch effect correction between reference and query, and the ensemble approach improve cell-type prediction accuracy while maintaining robustness. We demonstrated these through simulated and real scRNA-Seq data.

Medienart:

E-Artikel

Erscheinungsjahr:

2023

Erschienen:

2023

Enthalten in:

Zur Gesamtaufnahme - year:2023

Enthalten in:

Journal of biopharmaceutical statistics - (2023) vom: 10. Mai, Seite 1-14

Sprache:

Englisch

Beteiligte Personen:

Yan, Dongyan [VerfasserIn]
Sun, Zhe [VerfasserIn]
Fang, Jiyuan [VerfasserIn]
Cao, Shanshan [VerfasserIn]
Wang, Wenjie [VerfasserIn]
Chang, Xinyue [VerfasserIn]
Badirli, Sarkhan [VerfasserIn]
Fu, Haoda [VerfasserIn]
Liu, Yushi [VerfasserIn]

Links:

Volltext

Themen:

Batch effect correction
Cell-type classification
Ensembled method
Journal Article

Anmerkungen:

Date Revised 10.05.2023

published: Print-Electronic

Citation Status Publisher

doi:

10.1080/10543406.2023.2208671

funding:

Förderinstitution / Projekttitel:

PPN (Katalog-ID):

NLM35665205X