Benchmarking of human Y-chromosomal haplogroup classifiers with whole-genome and whole-exome sequence data
© 2023 The Authors..
In anthropological, medical, and forensic studies, the nonrecombinant region of the human Y chromosome (NRY) enables accurate reconstruction of pedigree relationships and retrieval of ancestral information. Using high-throughput sequencing (HTS) data, we present a benchmarking analysis of command-line tools for NRY haplogroup classification. The evaluation was performed using paired Illumina data from whole-genome sequencing (WGS) and whole-exome sequencing (WES) experiments from 50 unrelated donors. Additionally, as a validation, we also used paired WGS/WES datasets of 54 individuals from the 1000 Genomes Project. Finally, we evaluated the tools on data from third-generation HTS obtained from a subset of donors and one reference sample. Our results show that WES, despite typically offering less genealogical resolution than WGS, is an effective method for determining the NRY haplogroup. Y-LineageTracker and Yleaf showed the highest accuracy for WGS data, classifying precisely 98% and 96% of the samples, respectively. Yleaf outperforms all benchmarked tools in the WES data, classifying approximately 90% of the samples. Yleaf, Y-LineageTracker, and pathPhynder can correctly classify most samples (88%) sequenced with third-generation HTS. As a result, Yleaf provides the best performance for applications that use WGS and WES. Overall, our study offers researchers with a guide that allows them to select the most appropriate tool to analyze the NRY region using both second- and third-generation HTS data.
Medienart: |
E-Artikel |
---|
Erscheinungsjahr: |
2023 |
---|---|
Erschienen: |
2023 |
Enthalten in: |
Zur Gesamtaufnahme - volume:21 |
---|---|
Enthalten in: |
Computational and structural biotechnology journal - 21(2023) vom: 09., Seite 4613-4618 |
Sprache: |
Englisch |
---|
Beteiligte Personen: |
García-Olivares, Víctor [VerfasserIn] |
---|
Links: |
---|
Themen: |
Comparative genomics |
---|
Anmerkungen: |
Date Revised 12.10.2023 published: Electronic-eCollection Citation Status PubMed-not-MEDLINE |
---|
doi: |
10.1016/j.csbj.2023.09.012 |
---|
funding: |
|
---|---|
Förderinstitution / Projekttitel: |
|
PPN (Katalog-ID): |
NLM363108726 |
---|
LEADER | 01000naa a22002652 4500 | ||
---|---|---|---|
001 | NLM363108726 | ||
003 | DE-627 | ||
005 | 20231226092641.0 | ||
007 | cr uuu---uuuuu | ||
008 | 231226s2023 xx |||||o 00| ||eng c | ||
024 | 7 | |a 10.1016/j.csbj.2023.09.012 |2 doi | |
028 | 5 | 2 | |a pubmed24n1210.xml |
035 | |a (DE-627)NLM363108726 | ||
035 | |a (NLM)37817776 | ||
040 | |a DE-627 |b ger |c DE-627 |e rakwb | ||
041 | |a eng | ||
100 | 1 | |a García-Olivares, Víctor |e verfasserin |4 aut | |
245 | 1 | 0 | |a Benchmarking of human Y-chromosomal haplogroup classifiers with whole-genome and whole-exome sequence data |
264 | 1 | |c 2023 | |
336 | |a Text |b txt |2 rdacontent | ||
337 | |a ƒaComputermedien |b c |2 rdamedia | ||
338 | |a ƒa Online-Ressource |b cr |2 rdacarrier | ||
500 | |a Date Revised 12.10.2023 | ||
500 | |a published: Electronic-eCollection | ||
500 | |a Citation Status PubMed-not-MEDLINE | ||
520 | |a © 2023 The Authors. | ||
520 | |a In anthropological, medical, and forensic studies, the nonrecombinant region of the human Y chromosome (NRY) enables accurate reconstruction of pedigree relationships and retrieval of ancestral information. Using high-throughput sequencing (HTS) data, we present a benchmarking analysis of command-line tools for NRY haplogroup classification. The evaluation was performed using paired Illumina data from whole-genome sequencing (WGS) and whole-exome sequencing (WES) experiments from 50 unrelated donors. Additionally, as a validation, we also used paired WGS/WES datasets of 54 individuals from the 1000 Genomes Project. Finally, we evaluated the tools on data from third-generation HTS obtained from a subset of donors and one reference sample. Our results show that WES, despite typically offering less genealogical resolution than WGS, is an effective method for determining the NRY haplogroup. Y-LineageTracker and Yleaf showed the highest accuracy for WGS data, classifying precisely 98% and 96% of the samples, respectively. Yleaf outperforms all benchmarked tools in the WES data, classifying approximately 90% of the samples. Yleaf, Y-LineageTracker, and pathPhynder can correctly classify most samples (88%) sequenced with third-generation HTS. As a result, Yleaf provides the best performance for applications that use WGS and WES. Overall, our study offers researchers with a guide that allows them to select the most appropriate tool to analyze the NRY region using both second- and third-generation HTS data | ||
650 | 4 | |a Journal Article | |
650 | 4 | |a Comparative genomics | |
650 | 4 | |a NRY haplogroup classification | |
650 | 4 | |a Next-generation sequencing | |
650 | 4 | |a Population genetics | |
650 | 4 | |a Y chromosome | |
700 | 1 | |a Muñoz-Barrera, Adrián |e verfasserin |4 aut | |
700 | 1 | |a Rubio-Rodríguez, Luis A |e verfasserin |4 aut | |
700 | 1 | |a Jáspez, David |e verfasserin |4 aut | |
700 | 1 | |a Díaz-de Usera, Ana |e verfasserin |4 aut | |
700 | 1 | |a Iñigo-Campos, Antonio |e verfasserin |4 aut | |
700 | 1 | |a Veeramah, Krishna R |e verfasserin |4 aut | |
700 | 1 | |a Alonso, Santos |e verfasserin |4 aut | |
700 | 1 | |a Thomas, Mark G |e verfasserin |4 aut | |
700 | 1 | |a Lorenzo-Salazar, José M |e verfasserin |4 aut | |
700 | 1 | |a González-Montelongo, Rafaela |e verfasserin |4 aut | |
700 | 1 | |a Flores, Carlos |e verfasserin |4 aut | |
773 | 0 | 8 | |i Enthalten in |t Computational and structural biotechnology journal |d 2012 |g 21(2023) vom: 09., Seite 4613-4618 |w (DE-627)NLM218960549 |x 2001-0370 |7 nnns |
773 | 1 | 8 | |g volume:21 |g year:2023 |g day:09 |g pages:4613-4618 |
856 | 4 | 0 | |u http://dx.doi.org/10.1016/j.csbj.2023.09.012 |3 Volltext |
912 | |a GBV_USEFLAG_A | ||
912 | |a GBV_NLM | ||
951 | |a AR | ||
952 | |d 21 |j 2023 |b 09 |h 4613-4618 |