Reference-informed prediction of alternative splicing and splicing-altering mutations from sequences
© 2024 Xu et al.; Published by Cold Spring Harbor Laboratory Press..
Alternative splicing plays a crucial role in protein diversity and gene expression regulation in higher eukaryotes, and mutations causing dysregulated splicing underlie a range of genetic diseases. Computational prediction of alternative splicing from genomic sequences not only provides insight into gene-regulatory mechanisms but also helps identify disease-causing mutations and drug targets. However, the current methods for the quantitative prediction of splice site usage still have limited accuracy. Here, we present DeltaSplice, a deep neural network model optimized to learn the impact of mutations on quantitative changes in alternative splicing from the comparative analysis of homologous genes. The model architecture enables DeltaSplice to perform "reference-informed prediction" by incorporating the known splice site usage of a reference gene sequence to improve its prediction on splicing-altering mutations. We benchmarked DeltaSplice and several other state-of-the-art methods on various prediction tasks, including evolutionary sequence divergence on lineage-specific splicing and splicing-altering mutations in human populations and neurodevelopmental disorders, and demonstrated that DeltaSplice outperformed consistently. DeltaSplice predicted ∼15% of splicing quantitative trait loci (sQTLs) in the human brain as causal splicing-altering variants. It also predicted splicing-altering de novo mutations outside the splice sites in a subset of patients affected by autism and other neurodevelopmental disorders (NDDs), including 19 genes with recurrent splicing-altering mutations. Integration of splicing-altering mutations with other types of de novo mutation burdens allowed the prediction of eight novel NDD-risk genes. Our work expanded the capacity of in silico splicing models with potential applications in genetic diagnosis and the development of splicing-based precision medicine.
Errataetall: |
UpdateOf: bioRxiv. 2024 Apr 08:2024.03.22.586363. doi: 10.1101/2024.03.22.586363. - PMID 38586002 |
---|---|
Media Type: |
Electronic Article |
Year of Publication: |
2024 |
---|---|
Publication: |
2024 |
Contained In: |
To Main Record - volume:34 |
---|---|
Contained In: |
Genome research - 34(2024), 7 vom: 20. Aug., Seite 1052-1065 |
Language: |
English |
---|
Contributors: |
Xu, Chencheng [Author] |
---|
Links: |
---|
Keywords: |
---|
Notes: |
Date Completed 20.08.2024 Date Revised 04.09.2024 published: Electronic UpdateOf: bioRxiv. 2024 Apr 08:2024.03.22.586363. doi: 10.1101/2024.03.22.586363. - PMID 38586002 Citation Status MEDLINE |
---|
doi: |
10.1101/gr.279044.124 |
---|
funding: |
|
---|---|
Supporting institution / Project title: |
|
PPN (Catalogue-ID): |
NLM37547532X |
---|
LEADER | 01000caa a22002652 4500 | ||
---|---|---|---|
001 | NLM37547532X | ||
003 | DE-627 | ||
005 | 20240904233014.0 | ||
007 | cr uuu---uuuuu | ||
008 | 240727s2024 xx |||||o 00| ||eng c | ||
024 | 7 | |a 10.1101/gr.279044.124 |2 doi | |
028 | 5 | 2 | |a pubmed24n1523.xml |
035 | |a (DE-627)NLM37547532X | ||
035 | |a (NLM)39060028 | ||
040 | |a DE-627 |b ger |c DE-627 |e rakwb | ||
041 | |a eng | ||
100 | 1 | |a Xu, Chencheng |e verfasserin |4 aut | |
245 | 1 | 0 | |a Reference-informed prediction of alternative splicing and splicing-altering mutations from sequences |
264 | 1 | |c 2024 | |
336 | |a Text |b txt |2 rdacontent | ||
337 | |a ƒaComputermedien |b c |2 rdamedia | ||
338 | |a ƒa Online-Ressource |b cr |2 rdacarrier | ||
500 | |a Date Completed 20.08.2024 | ||
500 | |a Date Revised 04.09.2024 | ||
500 | |a published: Electronic | ||
500 | |a UpdateOf: bioRxiv. 2024 Apr 08:2024.03.22.586363. doi: 10.1101/2024.03.22.586363. - PMID 38586002 | ||
500 | |a Citation Status MEDLINE | ||
520 | |a © 2024 Xu et al.; Published by Cold Spring Harbor Laboratory Press. | ||
520 | |a Alternative splicing plays a crucial role in protein diversity and gene expression regulation in higher eukaryotes, and mutations causing dysregulated splicing underlie a range of genetic diseases. Computational prediction of alternative splicing from genomic sequences not only provides insight into gene-regulatory mechanisms but also helps identify disease-causing mutations and drug targets. However, the current methods for the quantitative prediction of splice site usage still have limited accuracy. Here, we present DeltaSplice, a deep neural network model optimized to learn the impact of mutations on quantitative changes in alternative splicing from the comparative analysis of homologous genes. The model architecture enables DeltaSplice to perform "reference-informed prediction" by incorporating the known splice site usage of a reference gene sequence to improve its prediction on splicing-altering mutations. We benchmarked DeltaSplice and several other state-of-the-art methods on various prediction tasks, including evolutionary sequence divergence on lineage-specific splicing and splicing-altering mutations in human populations and neurodevelopmental disorders, and demonstrated that DeltaSplice outperformed consistently. DeltaSplice predicted ∼15% of splicing quantitative trait loci (sQTLs) in the human brain as causal splicing-altering variants. It also predicted splicing-altering de novo mutations outside the splice sites in a subset of patients affected by autism and other neurodevelopmental disorders (NDDs), including 19 genes with recurrent splicing-altering mutations. Integration of splicing-altering mutations with other types of de novo mutation burdens allowed the prediction of eight novel NDD-risk genes. Our work expanded the capacity of in silico splicing models with potential applications in genetic diagnosis and the development of splicing-based precision medicine | ||
650 | 4 | |a Journal Article | |
650 | 7 | |a RNA Splice Sites |2 NLM | |
700 | 1 | |a Bao, Suying |e verfasserin |4 aut | |
700 | 1 | |a Wang, Ye |e verfasserin |4 aut | |
700 | 1 | |a Li, Wenxing |e verfasserin |4 aut | |
700 | 1 | |a Chen, Hao |e verfasserin |4 aut | |
700 | 1 | |a Shen, Yufeng |e verfasserin |4 aut | |
700 | 1 | |a Jiang, Tao |e verfasserin |4 aut | |
700 | 1 | |a Zhang, Chaolin |e verfasserin |4 aut | |
773 | 0 | 8 | |i Enthalten in |t Genome research |d 1996 |g 34(2024), 7 vom: 20. Aug., Seite 1052-1065 |w (DE-627)NLM085678031 |x 1549-5469 |7 nnns |
773 | 1 | 8 | |g volume:34 |g year:2024 |g number:7 |g day:20 |g month:08 |g pages:1052-1065 |
856 | 4 | 0 | |u http://dx.doi.org/10.1101/gr.279044.124 |3 Volltext |
912 | |a GBV_USEFLAG_A | ||
912 | |a SYSFLAG_A | ||
912 | |a GBV_NLM | ||
951 | |a AR | ||
952 | |d 34 |j 2024 |e 7 |b 20 |c 08 |h 1052-1065 |