RNAirport : a deep neural network-based database characterizing representative gene models in plants
Copyright © 2024 The Authors. Published by Elsevier Ltd.. All rights reserved..
A 5'-leader, known initially as the 5'-untranslated region, contains multiple isoforms due to alternative splicings (aS) and transcription start sites (aTSS). Therefore, a representative 5'-leader is demanded to examine the embedded RNA regulatory elements in controlling translation efficiency. Here, we develop a ranking algorithm and a deep-learning model to annotate representative 5'-leaders for five plant species. We rank the intra- and inter-sample frequency of aS-mediated transcript isoforms using the Kruskal-Wallis test-based algorithm and identify the representative aS-5'-leader. To further assign a representative 5'-end, we train the deep-learning model 5'leaderP to learn aTSS-mediated 5'-end distribution patterns from cap-analysis gene expression (CAGE) data. The model accurately predicts the 5'-end, confirmed experimentally in Arabidopsis and rice. The representative 5'-leader-contained gene models and 5'leaderP can be accessed at RNAirport (http://www.rnairport.com/leader5P/). This stage 1 5'-leader annotation records 5'-leader diversity and will pave the way to Ribo-Seq ORF annotation, identical to the project recently initiated by human GENCODE.
Medienart: |
E-Artikel |
---|
Erscheinungsjahr: |
2024 |
---|---|
Erschienen: |
2024 |
Enthalten in: |
Zur Gesamtaufnahme - year:2024 |
---|---|
Enthalten in: |
Journal of genetics and genomics = Yi chuan xue bao - (2024) vom: 20. März |
Sprache: |
Englisch |
---|
Beteiligte Personen: |
Zhu, Sitao [VerfasserIn] |
---|
Links: |
---|
Themen: |
5′-leader |
---|
Anmerkungen: |
Date Revised 22.03.2024 published: Print-Electronic Citation Status Publisher |
---|
doi: |
10.1016/j.jgg.2024.03.004 |
---|
funding: |
|
---|---|
Förderinstitution / Projekttitel: |
|
PPN (Katalog-ID): |
NLM370085477 |
---|
LEADER | 01000naa a22002652 4500 | ||
---|---|---|---|
001 | NLM370085477 | ||
003 | DE-627 | ||
005 | 20240324000125.0 | ||
007 | cr uuu---uuuuu | ||
008 | 240324s2024 xx |||||o 00| ||eng c | ||
024 | 7 | |a 10.1016/j.jgg.2024.03.004 |2 doi | |
028 | 5 | 2 | |a pubmed24n1343.xml |
035 | |a (DE-627)NLM370085477 | ||
035 | |a (NLM)38518981 | ||
035 | |a (PII)S1673-8527(24)00057-2 | ||
040 | |a DE-627 |b ger |c DE-627 |e rakwb | ||
041 | |a eng | ||
100 | 1 | |a Zhu, Sitao |e verfasserin |4 aut | |
245 | 1 | 0 | |a RNAirport |b a deep neural network-based database characterizing representative gene models in plants |
264 | 1 | |c 2024 | |
336 | |a Text |b txt |2 rdacontent | ||
337 | |a ƒaComputermedien |b c |2 rdamedia | ||
338 | |a ƒa Online-Ressource |b cr |2 rdacarrier | ||
500 | |a Date Revised 22.03.2024 | ||
500 | |a published: Print-Electronic | ||
500 | |a Citation Status Publisher | ||
520 | |a Copyright © 2024 The Authors. Published by Elsevier Ltd.. All rights reserved. | ||
520 | |a A 5'-leader, known initially as the 5'-untranslated region, contains multiple isoforms due to alternative splicings (aS) and transcription start sites (aTSS). Therefore, a representative 5'-leader is demanded to examine the embedded RNA regulatory elements in controlling translation efficiency. Here, we develop a ranking algorithm and a deep-learning model to annotate representative 5'-leaders for five plant species. We rank the intra- and inter-sample frequency of aS-mediated transcript isoforms using the Kruskal-Wallis test-based algorithm and identify the representative aS-5'-leader. To further assign a representative 5'-end, we train the deep-learning model 5'leaderP to learn aTSS-mediated 5'-end distribution patterns from cap-analysis gene expression (CAGE) data. The model accurately predicts the 5'-end, confirmed experimentally in Arabidopsis and rice. The representative 5'-leader-contained gene models and 5'leaderP can be accessed at RNAirport (http://www.rnairport.com/leader5P/). This stage 1 5'-leader annotation records 5'-leader diversity and will pave the way to Ribo-Seq ORF annotation, identical to the project recently initiated by human GENCODE | ||
650 | 4 | |a Journal Article | |
650 | 4 | |a 5′-leader | |
650 | 4 | |a Deep learning | |
650 | 4 | |a RNA regulatory elements | |
650 | 4 | |a Synthetic biology | |
650 | 4 | |a Transcript isoforms | |
650 | 4 | |a Translational control | |
650 | 4 | |a uORF | |
700 | 1 | |a Yuan, Shu |e verfasserin |4 aut | |
700 | 1 | |a Niu, Ruixia |e verfasserin |4 aut | |
700 | 1 | |a Zhou, Yulu |e verfasserin |4 aut | |
700 | 1 | |a Wang, Zhao |e verfasserin |4 aut | |
700 | 1 | |a Xu, Guoyong |e verfasserin |4 aut | |
773 | 0 | 8 | |i Enthalten in |t Journal of genetics and genomics = Yi chuan xue bao |d 2007 |g (2024) vom: 20. März |w (DE-627)NLM169995097 |x 1673-8527 |7 nnns |
773 | 1 | 8 | |g year:2024 |g day:20 |g month:03 |
856 | 4 | 0 | |u http://dx.doi.org/10.1016/j.jgg.2024.03.004 |3 Volltext |
912 | |a GBV_USEFLAG_A | ||
912 | |a GBV_NLM | ||
951 | |a AR | ||
952 | |j 2024 |b 20 |c 03 |