Evaluation of imputation performance of multiple reference panels in a Pakistani population
Abstract Genotype imputation is crucial for GWAS, but reference panels and existing benchmarking studies prioritize European individuals. Consequently, it is unclear which publicly available reference panel should be used for Pakistani individuals, and whether ancestry composition or sample size of the panel matters more for imputation accuracy. Our study compared different reference panels to impute genotype data in 1814 Pakistani individuals, finding the best performance balancing accuracy and coverage with meta-imputation with TOPMed and the expanded 1000 Genomes (ex1KG) reference. Imputation accuracy of ex1KG outperformed TOPMed despite its 30-fold smaller sample size, supporting efforts to create future panels with diverse populations..
Medienart: |
Preprint |
---|
Erscheinungsjahr: |
2023 |
---|---|
Erschienen: |
2023 |
Enthalten in: |
bioRxiv.org - (2023) vom: 29. Dez. Zur Gesamtaufnahme - year:2023 |
---|
Sprache: |
Englisch |
---|
Beteiligte Personen: |
Xu, Jiayi [VerfasserIn] |
---|
Links: |
Volltext [kostenfrei] |
---|
Themen: |
---|
doi: |
10.1101/2023.12.22.23300448 |
---|
funding: |
|
---|---|
Förderinstitution / Projekttitel: |
|
PPN (Katalog-ID): |
XBI041995333 |
---|
LEADER | 01000caa a22002652 4500 | ||
---|---|---|---|
001 | XBI041995333 | ||
003 | DE-627 | ||
005 | 20231230090441.0 | ||
007 | cr uuu---uuuuu | ||
008 | 231227s2023 xx |||||o 00| ||eng c | ||
024 | 7 | |a 10.1101/2023.12.22.23300448 |2 doi | |
035 | |a (DE-627)XBI041995333 | ||
035 | |a (biorXiv)10.1101/2023.12.22.23300448 | ||
040 | |a DE-627 |b ger |c DE-627 |e rakwb | ||
041 | |a eng | ||
100 | 1 | |a Xu, Jiayi |e verfasserin |4 aut | |
245 | 1 | 0 | |a Evaluation of imputation performance of multiple reference panels in a Pakistani population |
264 | 1 | |c 2023 | |
336 | |a Text |b txt |2 rdacontent | ||
337 | |a Computermedien |b c |2 rdamedia | ||
338 | |a Online-Ressource |b cr |2 rdacarrier | ||
520 | |a Abstract Genotype imputation is crucial for GWAS, but reference panels and existing benchmarking studies prioritize European individuals. Consequently, it is unclear which publicly available reference panel should be used for Pakistani individuals, and whether ancestry composition or sample size of the panel matters more for imputation accuracy. Our study compared different reference panels to impute genotype data in 1814 Pakistani individuals, finding the best performance balancing accuracy and coverage with meta-imputation with TOPMed and the expanded 1000 Genomes (ex1KG) reference. Imputation accuracy of ex1KG outperformed TOPMed despite its 30-fold smaller sample size, supporting efforts to create future panels with diverse populations. | ||
650 | 4 | |a Biology |7 (dpeaa)DE-84 | |
650 | 4 | |a 570 |7 (dpeaa)DE-84 | |
700 | 1 | |a Liu, Dongjing |4 aut | |
700 | 1 | |a Hassan, Arsalan |4 aut | |
700 | 1 | |a Genovese, Giulio |4 aut | |
700 | 1 | |a Cote, Alanna C. |4 aut | |
700 | 1 | |a Fennessy, Brian |4 aut | |
700 | 1 | |a Cheng, Esther |4 aut | |
700 | 1 | |a Charney, Alexander W. |4 aut | |
700 | 1 | |a Knowles, James A. |4 aut | |
700 | 1 | |a Ayub, Muhammad |4 aut | |
700 | 1 | |a Peterson, Roseann E. |4 aut | |
700 | 1 | |a Bigdeli, Tim B. |4 aut | |
700 | 1 | |a Huckins, Laura M. |0 (orcid)0000-0002-5369-6502 |4 aut | |
773 | 0 | 8 | |i Enthalten in |t bioRxiv.org |g (2023) vom: 29. Dez. |
773 | 1 | 8 | |g year:2023 |g day:29 |g month:12 |
856 | 4 | 0 | |u http://dx.doi.org/10.1101/2023.12.22.23300448 |z kostenfrei |3 Volltext |
912 | |a GBV_XBI | ||
951 | |a AR | ||
952 | |j 2023 |b 29 |c 12 |