Gene hunting with hidden Markov model knockoffs
Modern scientific studies often require the identification of a subset of explanatory variables. Several statistical methods have been developed to automate this task, and the framework of knockoffs has been proposed as a general solution for variable selection under rigorous Type I error control, without relying on strong modelling assumptions. In this paper, we extend the methodology of knockoffs to problems where the distribution of the covariates can be described by a hidden Markov model. We develop an exact and efficient algorithm to sample knockoff variables in this setting and then argue that, combined with the existing selective framework, this provides a natural and powerful tool for inference in genome-wide association studies with guaranteed false discovery rate control. We apply our method to datasets on Crohn's disease and some continuous phenotypes.
Medienart: |
E-Artikel |
---|
Erscheinungsjahr: |
2019 |
---|---|
Erschienen: |
2019 |
Enthalten in: |
Zur Gesamtaufnahme - volume:106 |
---|---|
Enthalten in: |
Biometrika - 106(2019), 1 vom: 01. März, Seite 1-18 |
Sprache: |
Englisch |
---|
Beteiligte Personen: |
Sesia, M [VerfasserIn] |
---|
Links: |
---|
Themen: |
False discovery rate |
---|
Anmerkungen: |
Date Revised 19.02.2022 published: Print-Electronic Citation Status PubMed-not-MEDLINE |
---|
doi: |
10.1093/biomet/asy033 |
---|
funding: |
|
---|---|
Förderinstitution / Projekttitel: |
|
PPN (Katalog-ID): |
NLM294247343 |
---|
LEADER | 01000naa a22002652 4500 | ||
---|---|---|---|
001 | NLM294247343 | ||
003 | DE-627 | ||
005 | 20231225081053.0 | ||
007 | cr uuu---uuuuu | ||
008 | 231225s2019 xx |||||o 00| ||eng c | ||
024 | 7 | |a 10.1093/biomet/asy033 |2 doi | |
028 | 5 | 2 | |a pubmed24n0980.xml |
035 | |a (DE-627)NLM294247343 | ||
035 | |a (NLM)30799875 | ||
040 | |a DE-627 |b ger |c DE-627 |e rakwb | ||
041 | |a eng | ||
100 | 1 | |a Sesia, M |e verfasserin |4 aut | |
245 | 1 | 0 | |a Gene hunting with hidden Markov model knockoffs |
264 | 1 | |c 2019 | |
336 | |a Text |b txt |2 rdacontent | ||
337 | |a ƒaComputermedien |b c |2 rdamedia | ||
338 | |a ƒa Online-Ressource |b cr |2 rdacarrier | ||
500 | |a Date Revised 19.02.2022 | ||
500 | |a published: Print-Electronic | ||
500 | |a Citation Status PubMed-not-MEDLINE | ||
520 | |a Modern scientific studies often require the identification of a subset of explanatory variables. Several statistical methods have been developed to automate this task, and the framework of knockoffs has been proposed as a general solution for variable selection under rigorous Type I error control, without relying on strong modelling assumptions. In this paper, we extend the methodology of knockoffs to problems where the distribution of the covariates can be described by a hidden Markov model. We develop an exact and efficient algorithm to sample knockoff variables in this setting and then argue that, combined with the existing selective framework, this provides a natural and powerful tool for inference in genome-wide association studies with guaranteed false discovery rate control. We apply our method to datasets on Crohn's disease and some continuous phenotypes | ||
650 | 4 | |a Journal Article | |
650 | 4 | |a False discovery rate | |
650 | 4 | |a Genome-wide association study | |
650 | 4 | |a Knockoff | |
650 | 4 | |a Variable selection | |
700 | 1 | |a Sabatti, C |e verfasserin |4 aut | |
700 | 1 | |a Candès, E J |e verfasserin |4 aut | |
773 | 0 | 8 | |i Enthalten in |t Biometrika |d 1945 |g 106(2019), 1 vom: 01. März, Seite 1-18 |w (DE-627)NLM048403121 |x 0006-3444 |7 nnns |
773 | 1 | 8 | |g volume:106 |g year:2019 |g number:1 |g day:01 |g month:03 |g pages:1-18 |
856 | 4 | 0 | |u http://dx.doi.org/10.1093/biomet/asy033 |3 Volltext |
912 | |a GBV_USEFLAG_A | ||
912 | |a GBV_NLM | ||
951 | |a AR | ||
952 | |d 106 |j 2019 |e 1 |b 01 |c 03 |h 1-18 |