The P10K database : a data portal for the protist 10 000 genomes project
© The Author(s) 2023. Published by Oxford University Press on behalf of Nucleic Acids Research..
Protists, a highly diverse group of microscopic eukaryotic organisms distinct from fungi, animals and plants, exert crucial roles within the earth's biosphere. However, the genomes of only a small fraction of known protist species have been published and made publicly accessible. To address this constraint, the Protist 10 000 Genomes Project (P10K) was initiated, implementing a specialized pipeline for single-cell genome/transcriptome assembly, decontamination and annotation of protists. The resultant P10K database (https://ngdc.cncb.ac.cn/p10k/) serves as a comprehensive platform, collating and disseminating genome sequences and annotations from diverse protist groups. Currently, the P10K database has incorporated 2959 genomes and transcriptomes, including 1101 newly sequenced datasets by P10K and 1858 publicly available datasets. Notably, it covers 45% of the protist orders, with a significant representation (53% coverage) of ciliates, featuring nearly a thousand genomes/transcriptomes. Intriguingly, analysis of the unique codon table usage among ciliates has revealed differences compared to the NCBI taxonomy system, suggesting a need to revise the codon tables used for these species. Collectively, the P10K database serves as a valuable repository of genetic resources for protist research and aims to expand its collection by incorporating more sequenced data and advanced analysis tools to benefit protist studies worldwide.
Errataetall: |
ErratumIn: Nucleic Acids Res. 2023 Nov 30;:. - PMID 38035371 |
---|---|
Medienart: |
E-Artikel |
Erscheinungsjahr: |
2024 |
---|---|
Erschienen: |
2024 |
Enthalten in: |
Zur Gesamtaufnahme - volume:52 |
---|---|
Enthalten in: |
Nucleic acids research - 52(2024), D1 vom: 05. Jan., Seite D747-D755 |
Sprache: |
Englisch |
---|
Beteiligte Personen: |
Gao, Xinxin [VerfasserIn] |
---|
Links: |
---|
Themen: |
---|
Anmerkungen: |
Date Completed 10.01.2024 Date Revised 10.01.2024 published: Print ErratumIn: Nucleic Acids Res. 2023 Nov 30;:. - PMID 38035371 Citation Status MEDLINE |
---|
doi: |
10.1093/nar/gkad992 |
---|
funding: |
|
---|---|
Förderinstitution / Projekttitel: |
|
PPN (Katalog-ID): |
NLM364226994 |
---|
LEADER | 01000caa a22002652 4500 | ||
---|---|---|---|
001 | NLM364226994 | ||
003 | DE-627 | ||
005 | 20240114233410.0 | ||
007 | cr uuu---uuuuu | ||
008 | 231226s2024 xx |||||o 00| ||eng c | ||
024 | 7 | |a 10.1093/nar/gkad992 |2 doi | |
028 | 5 | 2 | |a pubmed24n1255.xml |
035 | |a (DE-627)NLM364226994 | ||
035 | |a (NLM)37930867 | ||
040 | |a DE-627 |b ger |c DE-627 |e rakwb | ||
041 | |a eng | ||
100 | 1 | |a Gao, Xinxin |e verfasserin |4 aut | |
245 | 1 | 4 | |a The P10K database |b a data portal for the protist 10 000 genomes project |
264 | 1 | |c 2024 | |
336 | |a Text |b txt |2 rdacontent | ||
337 | |a ƒaComputermedien |b c |2 rdamedia | ||
338 | |a ƒa Online-Ressource |b cr |2 rdacarrier | ||
500 | |a Date Completed 10.01.2024 | ||
500 | |a Date Revised 10.01.2024 | ||
500 | |a published: Print | ||
500 | |a ErratumIn: Nucleic Acids Res. 2023 Nov 30;:. - PMID 38035371 | ||
500 | |a Citation Status MEDLINE | ||
520 | |a © The Author(s) 2023. Published by Oxford University Press on behalf of Nucleic Acids Research. | ||
520 | |a Protists, a highly diverse group of microscopic eukaryotic organisms distinct from fungi, animals and plants, exert crucial roles within the earth's biosphere. However, the genomes of only a small fraction of known protist species have been published and made publicly accessible. To address this constraint, the Protist 10 000 Genomes Project (P10K) was initiated, implementing a specialized pipeline for single-cell genome/transcriptome assembly, decontamination and annotation of protists. The resultant P10K database (https://ngdc.cncb.ac.cn/p10k/) serves as a comprehensive platform, collating and disseminating genome sequences and annotations from diverse protist groups. Currently, the P10K database has incorporated 2959 genomes and transcriptomes, including 1101 newly sequenced datasets by P10K and 1858 publicly available datasets. Notably, it covers 45% of the protist orders, with a significant representation (53% coverage) of ciliates, featuring nearly a thousand genomes/transcriptomes. Intriguingly, analysis of the unique codon table usage among ciliates has revealed differences compared to the NCBI taxonomy system, suggesting a need to revise the codon tables used for these species. Collectively, the P10K database serves as a valuable repository of genetic resources for protist research and aims to expand its collection by incorporating more sequenced data and advanced analysis tools to benefit protist studies worldwide | ||
650 | 4 | |a Journal Article | |
650 | 7 | |a Codon |2 NLM | |
700 | 1 | |a Chen, Kai |e verfasserin |4 aut | |
700 | 1 | |a Xiong, Jie |e verfasserin |4 aut | |
700 | 1 | |a Zou, Dong |e verfasserin |4 aut | |
700 | 1 | |a Yang, Fangdian |e verfasserin |4 aut | |
700 | 1 | |a Ma, Yingke |e verfasserin |4 aut | |
700 | 1 | |a Jiang, Chuanqi |e verfasserin |4 aut | |
700 | 1 | |a Gao, Xiaoxuan |e verfasserin |4 aut | |
700 | 1 | |a Wang, Guangying |e verfasserin |4 aut | |
700 | 1 | |a Gu, Siyu |e verfasserin |4 aut | |
700 | 1 | |a Zhang, Peng |e verfasserin |4 aut | |
700 | 1 | |a Luo, Shuai |e verfasserin |4 aut | |
700 | 1 | |a Huang, Kaiyao |e verfasserin |4 aut | |
700 | 1 | |a Bao, Yiming |e verfasserin |4 aut | |
700 | 1 | |a Zhang, Zhang |e verfasserin |4 aut | |
700 | 1 | |a Ma, Lina |e verfasserin |4 aut | |
700 | 1 | |a Miao, Wei |e verfasserin |4 aut | |
773 | 0 | 8 | |i Enthalten in |t Nucleic acids research |d 1974 |g 52(2024), D1 vom: 05. Jan., Seite D747-D755 |w (DE-627)NLM000063398 |x 1362-4962 |7 nnns |
773 | 1 | 8 | |g volume:52 |g year:2024 |g number:D1 |g day:05 |g month:01 |g pages:D747-D755 |
856 | 4 | 0 | |u http://dx.doi.org/10.1093/nar/gkad992 |3 Volltext |
912 | |a GBV_USEFLAG_A | ||
912 | |a GBV_NLM | ||
951 | |a AR | ||
952 | |d 52 |j 2024 |e D1 |b 05 |c 01 |h D747-D755 |