ESPRESSO : Robust discovery and quantification of transcript isoforms from error-prone long-read RNA-seq data
Long-read RNA sequencing (RNA-seq) holds great potential for characterizing transcriptome variation and full-length transcript isoforms, but the relatively high error rate of current long-read sequencing platforms poses a major challenge. We present ESPRESSO, a computational tool for robust discovery and quantification of transcript isoforms from error-prone long reads. ESPRESSO jointly considers alignments of all long reads aligned to a gene and uses error profiles of individual reads to improve the identification of splice junctions and the discovery of their corresponding transcript isoforms. On both a synthetic spike-in RNA sample and human RNA samples, ESPRESSO outperforms multiple contemporary tools in not only transcript isoform discovery but also transcript isoform quantification. In total, we generated and analyzed ~1.1 billion nanopore RNA-seq reads covering 30 human tissue samples and three human cell lines. ESPRESSO and its companion dataset provide a useful resource for studying the RNA repertoire of eukaryotic transcriptomes.
Medienart: |
E-Artikel |
---|
Erscheinungsjahr: |
2023 |
---|---|
Erschienen: |
2023 |
Enthalten in: |
Zur Gesamtaufnahme - volume:9 |
---|---|
Enthalten in: |
Science advances - 9(2023), 3 vom: 20. Jan., Seite eabq5072 |
Sprache: |
Englisch |
---|
Beteiligte Personen: |
Gao, Yuan [VerfasserIn] |
---|
Links: |
---|
Themen: |
---|
Anmerkungen: |
Date Completed 24.01.2023 Date Revised 24.11.2023 published: Print-Electronic Citation Status MEDLINE |
---|
doi: |
10.1126/sciadv.abq5072 |
---|
funding: |
|
---|---|
Förderinstitution / Projekttitel: |
|
PPN (Katalog-ID): |
NLM351735887 |
---|
LEADER | 01000naa a22002652 4500 | ||
---|---|---|---|
001 | NLM351735887 | ||
003 | DE-627 | ||
005 | 20231226052110.0 | ||
007 | cr uuu---uuuuu | ||
008 | 231226s2023 xx |||||o 00| ||eng c | ||
024 | 7 | |a 10.1126/sciadv.abq5072 |2 doi | |
028 | 5 | 2 | |a pubmed24n1172.xml |
035 | |a (DE-627)NLM351735887 | ||
035 | |a (NLM)36662851 | ||
040 | |a DE-627 |b ger |c DE-627 |e rakwb | ||
041 | |a eng | ||
100 | 1 | |a Gao, Yuan |e verfasserin |4 aut | |
245 | 1 | 0 | |a ESPRESSO |b Robust discovery and quantification of transcript isoforms from error-prone long-read RNA-seq data |
264 | 1 | |c 2023 | |
336 | |a Text |b txt |2 rdacontent | ||
337 | |a ƒaComputermedien |b c |2 rdamedia | ||
338 | |a ƒa Online-Ressource |b cr |2 rdacarrier | ||
500 | |a Date Completed 24.01.2023 | ||
500 | |a Date Revised 24.11.2023 | ||
500 | |a published: Print-Electronic | ||
500 | |a Citation Status MEDLINE | ||
520 | |a Long-read RNA sequencing (RNA-seq) holds great potential for characterizing transcriptome variation and full-length transcript isoforms, but the relatively high error rate of current long-read sequencing platforms poses a major challenge. We present ESPRESSO, a computational tool for robust discovery and quantification of transcript isoforms from error-prone long reads. ESPRESSO jointly considers alignments of all long reads aligned to a gene and uses error profiles of individual reads to improve the identification of splice junctions and the discovery of their corresponding transcript isoforms. On both a synthetic spike-in RNA sample and human RNA samples, ESPRESSO outperforms multiple contemporary tools in not only transcript isoform discovery but also transcript isoform quantification. In total, we generated and analyzed ~1.1 billion nanopore RNA-seq reads covering 30 human tissue samples and three human cell lines. ESPRESSO and its companion dataset provide a useful resource for studying the RNA repertoire of eukaryotic transcriptomes | ||
650 | 4 | |a Journal Article | |
650 | 7 | |a RNA |2 NLM | |
650 | 7 | |a 63231-63-0 |2 NLM | |
650 | 7 | |a Protein Isoforms |2 NLM | |
700 | 1 | |a Wang, Feng |e verfasserin |4 aut | |
700 | 1 | |a Wang, Robert |e verfasserin |4 aut | |
700 | 1 | |a Kutschera, Eric |e verfasserin |4 aut | |
700 | 1 | |a Xu, Yang |e verfasserin |4 aut | |
700 | 1 | |a Xie, Stephan |e verfasserin |4 aut | |
700 | 1 | |a Wang, Yuanyuan |e verfasserin |4 aut | |
700 | 1 | |a Kadash-Edmondson, Kathryn E |e verfasserin |4 aut | |
700 | 1 | |a Lin, Lan |e verfasserin |4 aut | |
700 | 1 | |a Xing, Yi |e verfasserin |4 aut | |
773 | 0 | 8 | |i Enthalten in |t Science advances |d 2015 |g 9(2023), 3 vom: 20. Jan., Seite eabq5072 |w (DE-627)NLM247717614 |x 2375-2548 |7 nnns |
773 | 1 | 8 | |g volume:9 |g year:2023 |g number:3 |g day:20 |g month:01 |g pages:eabq5072 |
856 | 4 | 0 | |u http://dx.doi.org/10.1126/sciadv.abq5072 |3 Volltext |
912 | |a GBV_USEFLAG_A | ||
912 | |a GBV_NLM | ||
951 | |a AR | ||
952 | |d 9 |j 2023 |e 3 |b 20 |c 01 |h eabq5072 |