Semantic Similarity Analysis Reveals Robust Gene-Disease Relationships in Developmental and Epileptic Encephalopathies
Copyright © 2020 The Author(s). Published by Elsevier Inc. All rights reserved..
More than 100 genetic etiologies have been identified in developmental and epileptic encephalopathies (DEEs), but correlating genetic findings with clinical features at scale has remained a hurdle because of a lack of frameworks for analyzing heterogenous clinical data. Here, we analyzed 31,742 Human Phenotype Ontology (HPO) terms in 846 individuals with existing whole-exome trio data and assessed associated clinical features and phenotypic relatedness by using HPO-based semantic similarity analysis for individuals with de novo variants in the same gene. Gene-specific phenotypic signatures included associations of SCN1A with "complex febrile seizures" (HP: 0011172; p = 2.1 × 10-5) and "focal clonic seizures" (HP: 0002266; p = 8.9 × 10-6), STXBP1 with "absent speech" (HP: 0001344; p = 1.3 × 10-11), and SLC6A1 with "EEG with generalized slow activity" (HP: 0010845; p = 0.018). Of 41 genes with de novo variants in two or more individuals, 11 genes showed significant phenotypic similarity, including SCN1A (n = 16, p < 0.0001), STXBP1 (n = 14, p = 0.0021), and KCNB1 (n = 6, p = 0.011). Including genetic and phenotypic data of control subjects increased phenotypic similarity for all genetic etiologies, whereas the probability of observing de novo variants decreased, emphasizing the conceptual differences between semantic similarity analysis and approaches based on the expected number of de novo events. We demonstrate that HPO-based phenotype analysis captures unique profiles for distinct genetic etiologies, reflecting the breadth of the phenotypic spectrum in genetic epilepsies. Semantic similarity can be used to generate statistical evidence for disease causation analogous to the traditional approach of primarily defining disease entities through similar clinical features.
Medienart: |
E-Artikel |
---|
Erscheinungsjahr: |
2020 |
---|---|
Erschienen: |
2020 |
Enthalten in: |
Zur Gesamtaufnahme - volume:107 |
---|---|
Enthalten in: |
American journal of human genetics - 107(2020), 4 vom: 01. Okt., Seite 683-697 |
Sprache: |
Englisch |
---|
Beteiligte Personen: |
Galer, Peter D [VerfasserIn] |
---|
Links: |
---|
Anmerkungen: |
Date Completed 30.11.2020 Date Revised 07.12.2022 published: Print-Electronic Citation Status MEDLINE |
---|
doi: |
10.1016/j.ajhg.2020.08.003 |
---|
funding: |
|
---|---|
Förderinstitution / Projekttitel: |
|
PPN (Katalog-ID): |
NLM314268030 |
---|
LEADER | 01000naa a22002652 4500 | ||
---|---|---|---|
001 | NLM314268030 | ||
003 | DE-627 | ||
005 | 20231225152349.0 | ||
007 | cr uuu---uuuuu | ||
008 | 231225s2020 xx |||||o 00| ||eng c | ||
024 | 7 | |a 10.1016/j.ajhg.2020.08.003 |2 doi | |
028 | 5 | 2 | |a pubmed24n1047.xml |
035 | |a (DE-627)NLM314268030 | ||
035 | |a (NLM)32853554 | ||
035 | |a (PII)S0002-9297(20)30271-8 | ||
040 | |a DE-627 |b ger |c DE-627 |e rakwb | ||
041 | |a eng | ||
100 | 1 | |a Galer, Peter D |e verfasserin |4 aut | |
245 | 1 | 0 | |a Semantic Similarity Analysis Reveals Robust Gene-Disease Relationships in Developmental and Epileptic Encephalopathies |
264 | 1 | |c 2020 | |
336 | |a Text |b txt |2 rdacontent | ||
337 | |a ƒaComputermedien |b c |2 rdamedia | ||
338 | |a ƒa Online-Ressource |b cr |2 rdacarrier | ||
500 | |a Date Completed 30.11.2020 | ||
500 | |a Date Revised 07.12.2022 | ||
500 | |a published: Print-Electronic | ||
500 | |a Citation Status MEDLINE | ||
520 | |a Copyright © 2020 The Author(s). Published by Elsevier Inc. All rights reserved. | ||
520 | |a More than 100 genetic etiologies have been identified in developmental and epileptic encephalopathies (DEEs), but correlating genetic findings with clinical features at scale has remained a hurdle because of a lack of frameworks for analyzing heterogenous clinical data. Here, we analyzed 31,742 Human Phenotype Ontology (HPO) terms in 846 individuals with existing whole-exome trio data and assessed associated clinical features and phenotypic relatedness by using HPO-based semantic similarity analysis for individuals with de novo variants in the same gene. Gene-specific phenotypic signatures included associations of SCN1A with "complex febrile seizures" (HP: 0011172; p = 2.1 × 10-5) and "focal clonic seizures" (HP: 0002266; p = 8.9 × 10-6), STXBP1 with "absent speech" (HP: 0001344; p = 1.3 × 10-11), and SLC6A1 with "EEG with generalized slow activity" (HP: 0010845; p = 0.018). Of 41 genes with de novo variants in two or more individuals, 11 genes showed significant phenotypic similarity, including SCN1A (n = 16, p < 0.0001), STXBP1 (n = 14, p = 0.0021), and KCNB1 (n = 6, p = 0.011). Including genetic and phenotypic data of control subjects increased phenotypic similarity for all genetic etiologies, whereas the probability of observing de novo variants decreased, emphasizing the conceptual differences between semantic similarity analysis and approaches based on the expected number of de novo events. We demonstrate that HPO-based phenotype analysis captures unique profiles for distinct genetic etiologies, reflecting the breadth of the phenotypic spectrum in genetic epilepsies. Semantic similarity can be used to generate statistical evidence for disease causation analogous to the traditional approach of primarily defining disease entities through similar clinical features | ||
650 | 4 | |a Journal Article | |
650 | 4 | |a Research Support, N.I.H., Extramural | |
650 | 4 | |a Research Support, Non-U.S. Gov't | |
650 | 4 | |a Human Phenotype Ontology | |
650 | 4 | |a childhood epilepsies | |
650 | 4 | |a computational phenotypes | |
650 | 4 | |a developmental and epileptic encephalopathies | |
650 | 4 | |a electronic medical records | |
650 | 4 | |a neurogenetic disorders | |
650 | 4 | |a whole-exome sequencing | |
650 | 7 | |a GABA Plasma Membrane Transport Proteins |2 NLM | |
650 | 7 | |a KCNB1 protein, human |2 NLM | |
650 | 7 | |a Munc18 Proteins |2 NLM | |
650 | 7 | |a NAV1.1 Voltage-Gated Sodium Channel |2 NLM | |
650 | 7 | |a SCN1A protein, human |2 NLM | |
650 | 7 | |a SLC6A1 protein, human |2 NLM | |
650 | 7 | |a STXBP1 protein, human |2 NLM | |
650 | 7 | |a Shab Potassium Channels |2 NLM | |
700 | 1 | |a Ganesan, Shiva |e verfasserin |4 aut | |
700 | 1 | |a Lewis-Smith, David |e verfasserin |4 aut | |
700 | 1 | |a McKeown, Sarah E |e verfasserin |4 aut | |
700 | 1 | |a Pendziwiat, Manuela |e verfasserin |4 aut | |
700 | 1 | |a Helbig, Katherine L |e verfasserin |4 aut | |
700 | 1 | |a Ellis, Colin A |e verfasserin |4 aut | |
700 | 1 | |a Rademacher, Annika |e verfasserin |4 aut | |
700 | 1 | |a Smith, Lacey |e verfasserin |4 aut | |
700 | 1 | |a Poduri, Annapurna |e verfasserin |4 aut | |
700 | 1 | |a Seiffert, Simone |e verfasserin |4 aut | |
700 | 1 | |a von Spiczak, Sarah |e verfasserin |4 aut | |
700 | 1 | |a Muhle, Hiltrud |e verfasserin |4 aut | |
700 | 1 | |a van Baalen, Andreas |e verfasserin |4 aut | |
700 | 0 | |a NCEE Study Group |e verfasserin |4 aut | |
700 | 0 | |a EPGP Investigators |e verfasserin |4 aut | |
700 | 0 | |a EuroEPINOMICS-RES Consortium |e verfasserin |4 aut | |
700 | 0 | |a Genomics Research and Innovation Network |e verfasserin |4 aut | |
700 | 1 | |a Thomas, Rhys H |e verfasserin |4 aut | |
700 | 1 | |a Krause, Roland |e verfasserin |4 aut | |
700 | 1 | |a Weber, Yvonne |e verfasserin |4 aut | |
700 | 1 | |a Helbig, Ingo |e verfasserin |4 aut | |
773 | 0 | 8 | |i Enthalten in |t American journal of human genetics |d 1949 |g 107(2020), 4 vom: 01. Okt., Seite 683-697 |w (DE-627)NLM000025461 |x 1537-6605 |7 nnns |
773 | 1 | 8 | |g volume:107 |g year:2020 |g number:4 |g day:01 |g month:10 |g pages:683-697 |
856 | 4 | 0 | |u http://dx.doi.org/10.1016/j.ajhg.2020.08.003 |3 Volltext |
912 | |a GBV_USEFLAG_A | ||
912 | |a GBV_NLM | ||
951 | |a AR | ||
952 | |d 107 |j 2020 |e 4 |b 01 |c 10 |h 683-697 |