A versatile, fast and unbiased method for estimation of gene-by-environment interaction effects on biobank-scale datasets
© 2023. Springer Nature Limited..
Identification of gene-by-environment interactions (GxE) is crucial to understand the interplay of environmental effects on complex traits. However, current methods evaluating GxE on biobank-scale datasets have limitations. We introduce MonsterLM, a multiple linear regression method that does not rely on model specification and provides unbiased estimates of variance explained by GxE. We demonstrate robustness of MonsterLM through comprehensive genome-wide simulations using real genetic data from 325,989 individuals. We estimate GxE using waist-to-hip-ratio, smoking, and exercise as the environmental variables on 13 outcomes (N = 297,529-325,989) in the UK Biobank. GxE variance is significant for 8 environment-outcome pairs, ranging from 0.009 - 0.071. The majority of GxE variance involves SNPs without strong marginal or interaction associations. We observe modest improvements in polygenic score prediction when incorporating GxE. Our results imply a significant contribution of GxE to complex trait variance and we show MonsterLM to be well-purposed to handle this with biobank-scale data.
Medienart: |
E-Artikel |
---|
Erscheinungsjahr: |
2023 |
---|---|
Erschienen: |
2023 |
Enthalten in: |
Zur Gesamtaufnahme - volume:14 |
---|---|
Enthalten in: |
Nature communications - 14(2023), 1 vom: 25. Aug., Seite 5196 |
Sprache: |
Englisch |
---|
Beteiligte Personen: |
Di Scipio, Matteo [VerfasserIn] |
---|
Links: |
---|
Themen: |
---|
Anmerkungen: |
Date Completed 28.08.2023 Date Revised 19.11.2023 published: Electronic Citation Status MEDLINE |
---|
doi: |
10.1038/s41467-023-40913-7 |
---|
funding: |
|
---|---|
Förderinstitution / Projekttitel: |
|
PPN (Katalog-ID): |
NLM361242484 |
---|
LEADER | 01000naa a22002652 4500 | ||
---|---|---|---|
001 | NLM361242484 | ||
003 | DE-627 | ||
005 | 20231226084730.0 | ||
007 | cr uuu---uuuuu | ||
008 | 231226s2023 xx |||||o 00| ||eng c | ||
024 | 7 | |a 10.1038/s41467-023-40913-7 |2 doi | |
028 | 5 | 2 | |a pubmed24n1204.xml |
035 | |a (DE-627)NLM361242484 | ||
035 | |a (NLM)37626057 | ||
040 | |a DE-627 |b ger |c DE-627 |e rakwb | ||
041 | |a eng | ||
100 | 1 | |a Di Scipio, Matteo |e verfasserin |4 aut | |
245 | 1 | 2 | |a A versatile, fast and unbiased method for estimation of gene-by-environment interaction effects on biobank-scale datasets |
264 | 1 | |c 2023 | |
336 | |a Text |b txt |2 rdacontent | ||
337 | |a ƒaComputermedien |b c |2 rdamedia | ||
338 | |a ƒa Online-Ressource |b cr |2 rdacarrier | ||
500 | |a Date Completed 28.08.2023 | ||
500 | |a Date Revised 19.11.2023 | ||
500 | |a published: Electronic | ||
500 | |a Citation Status MEDLINE | ||
520 | |a © 2023. Springer Nature Limited. | ||
520 | |a Identification of gene-by-environment interactions (GxE) is crucial to understand the interplay of environmental effects on complex traits. However, current methods evaluating GxE on biobank-scale datasets have limitations. We introduce MonsterLM, a multiple linear regression method that does not rely on model specification and provides unbiased estimates of variance explained by GxE. We demonstrate robustness of MonsterLM through comprehensive genome-wide simulations using real genetic data from 325,989 individuals. We estimate GxE using waist-to-hip-ratio, smoking, and exercise as the environmental variables on 13 outcomes (N = 297,529-325,989) in the UK Biobank. GxE variance is significant for 8 environment-outcome pairs, ranging from 0.009 - 0.071. The majority of GxE variance involves SNPs without strong marginal or interaction associations. We observe modest improvements in polygenic score prediction when incorporating GxE. Our results imply a significant contribution of GxE to complex trait variance and we show MonsterLM to be well-purposed to handle this with biobank-scale data | ||
650 | 4 | |a Journal Article | |
650 | 4 | |a Research Support, Non-U.S. Gov't | |
700 | 1 | |a Khan, Mohammad |e verfasserin |4 aut | |
700 | 1 | |a Mao, Shihong |e verfasserin |4 aut | |
700 | 1 | |a Chong, Michael |e verfasserin |4 aut | |
700 | 1 | |a Judge, Conor |e verfasserin |4 aut | |
700 | 1 | |a Pathan, Nazia |e verfasserin |4 aut | |
700 | 1 | |a Perrot, Nicolas |e verfasserin |4 aut | |
700 | 1 | |a Nelson, Walter |e verfasserin |4 aut | |
700 | 1 | |a Lali, Ricky |e verfasserin |4 aut | |
700 | 1 | |a Di, Shuang |e verfasserin |4 aut | |
700 | 1 | |a Morton, Robert |e verfasserin |4 aut | |
700 | 1 | |a Petch, Jeremy |e verfasserin |4 aut | |
700 | 1 | |a Paré, Guillaume |e verfasserin |4 aut | |
773 | 0 | 8 | |i Enthalten in |t Nature communications |d 2010 |g 14(2023), 1 vom: 25. Aug., Seite 5196 |w (DE-627)NLM199274525 |x 2041-1723 |7 nnns |
773 | 1 | 8 | |g volume:14 |g year:2023 |g number:1 |g day:25 |g month:08 |g pages:5196 |
856 | 4 | 0 | |u http://dx.doi.org/10.1038/s41467-023-40913-7 |3 Volltext |
912 | |a GBV_USEFLAG_A | ||
912 | |a GBV_NLM | ||
951 | |a AR | ||
952 | |d 14 |j 2023 |e 1 |b 25 |c 08 |h 5196 |