scPROTEIN : a versatile deep graph contrastive learning framework for single-cell proteomics embedding
© 2024. The Author(s), under exclusive licence to Springer Nature America, Inc..
Single-cell proteomics sequencing technology sheds light on protein-protein interactions, posttranslational modifications and proteoform dynamics in the cell. However, the uncertainty estimation for peptide quantification, data missingness, batch effects and high noise hinder the analysis of single-cell proteomic data. It is important to solve this set of tangled problems together, but the existing methods tailored for single-cell transcriptomes cannot fully address this task. Here we propose a versatile framework designed for single-cell proteomics data analysis called scPROTEIN, which consists of peptide uncertainty estimation based on a multitask heteroscedastic regression model and cell embedding generation based on graph contrastive learning. scPROTEIN can estimate the uncertainty of peptide quantification, denoise protein data, remove batch effects and encode single-cell proteomic-specific embeddings in a unified framework. We demonstrate that scPROTEIN is efficient for cell clustering, batch correction, cell type annotation, clinical analysis and spatially resolved proteomic data exploration.
Medienart: |
E-Artikel |
---|
Erscheinungsjahr: |
2024 |
---|---|
Erschienen: |
2024 |
Enthalten in: |
Zur Gesamtaufnahme - volume:21 |
---|---|
Enthalten in: |
Nature methods - 21(2024), 4 vom: 20. Apr., Seite 623-634 |
Sprache: |
Englisch |
---|
Beteiligte Personen: |
Li, Wei [VerfasserIn] |
---|
Links: |
---|
Themen: |
---|
Anmerkungen: |
Date Completed 15.04.2024 Date Revised 15.04.2024 published: Print-Electronic Citation Status MEDLINE |
---|
doi: |
10.1038/s41592-024-02214-9 |
---|
funding: |
|
---|---|
Förderinstitution / Projekttitel: |
|
PPN (Katalog-ID): |
NLM369937163 |
---|
LEADER | 01000caa a22002652 4500 | ||
---|---|---|---|
001 | NLM369937163 | ||
003 | DE-627 | ||
005 | 20240415233501.0 | ||
007 | cr uuu---uuuuu | ||
008 | 240320s2024 xx |||||o 00| ||eng c | ||
024 | 7 | |a 10.1038/s41592-024-02214-9 |2 doi | |
028 | 5 | 2 | |a pubmed24n1376.xml |
035 | |a (DE-627)NLM369937163 | ||
035 | |a (NLM)38504113 | ||
040 | |a DE-627 |b ger |c DE-627 |e rakwb | ||
041 | |a eng | ||
100 | 1 | |a Li, Wei |e verfasserin |4 aut | |
245 | 1 | 0 | |a scPROTEIN |b a versatile deep graph contrastive learning framework for single-cell proteomics embedding |
264 | 1 | |c 2024 | |
336 | |a Text |b txt |2 rdacontent | ||
337 | |a ƒaComputermedien |b c |2 rdamedia | ||
338 | |a ƒa Online-Ressource |b cr |2 rdacarrier | ||
500 | |a Date Completed 15.04.2024 | ||
500 | |a Date Revised 15.04.2024 | ||
500 | |a published: Print-Electronic | ||
500 | |a Citation Status MEDLINE | ||
520 | |a © 2024. The Author(s), under exclusive licence to Springer Nature America, Inc. | ||
520 | |a Single-cell proteomics sequencing technology sheds light on protein-protein interactions, posttranslational modifications and proteoform dynamics in the cell. However, the uncertainty estimation for peptide quantification, data missingness, batch effects and high noise hinder the analysis of single-cell proteomic data. It is important to solve this set of tangled problems together, but the existing methods tailored for single-cell transcriptomes cannot fully address this task. Here we propose a versatile framework designed for single-cell proteomics data analysis called scPROTEIN, which consists of peptide uncertainty estimation based on a multitask heteroscedastic regression model and cell embedding generation based on graph contrastive learning. scPROTEIN can estimate the uncertainty of peptide quantification, denoise protein data, remove batch effects and encode single-cell proteomic-specific embeddings in a unified framework. We demonstrate that scPROTEIN is efficient for cell clustering, batch correction, cell type annotation, clinical analysis and spatially resolved proteomic data exploration | ||
650 | 4 | |a Journal Article | |
650 | 7 | |a Peptides |2 NLM | |
700 | 1 | |a Yang, Fan |e verfasserin |4 aut | |
700 | 1 | |a Wang, Fang |e verfasserin |4 aut | |
700 | 1 | |a Rong, Yu |e verfasserin |4 aut | |
700 | 1 | |a Liu, Linjing |e verfasserin |4 aut | |
700 | 1 | |a Wu, Bingzhe |e verfasserin |4 aut | |
700 | 1 | |a Zhang, Han |e verfasserin |4 aut | |
700 | 1 | |a Yao, Jianhua |e verfasserin |4 aut | |
773 | 0 | 8 | |i Enthalten in |t Nature methods |d 2004 |g 21(2024), 4 vom: 20. Apr., Seite 623-634 |w (DE-627)NLM154352349 |x 1548-7105 |7 nnns |
773 | 1 | 8 | |g volume:21 |g year:2024 |g number:4 |g day:20 |g month:04 |g pages:623-634 |
856 | 4 | 0 | |u http://dx.doi.org/10.1038/s41592-024-02214-9 |3 Volltext |
912 | |a GBV_USEFLAG_A | ||
912 | |a GBV_NLM | ||
951 | |a AR | ||
952 | |d 21 |j 2024 |e 4 |b 20 |c 04 |h 623-634 |