Benchmarking of human Y-chromosomal haplogroup classifiers with whole-genome and whole-exome sequence data

© 2023 The Authors..

In anthropological, medical, and forensic studies, the nonrecombinant region of the human Y chromosome (NRY) enables accurate reconstruction of pedigree relationships and retrieval of ancestral information. Using high-throughput sequencing (HTS) data, we present a benchmarking analysis of command-line tools for NRY haplogroup classification. The evaluation was performed using paired Illumina data from whole-genome sequencing (WGS) and whole-exome sequencing (WES) experiments from 50 unrelated donors. Additionally, as a validation, we also used paired WGS/WES datasets of 54 individuals from the 1000 Genomes Project. Finally, we evaluated the tools on data from third-generation HTS obtained from a subset of donors and one reference sample. Our results show that WES, despite typically offering less genealogical resolution than WGS, is an effective method for determining the NRY haplogroup. Y-LineageTracker and Yleaf showed the highest accuracy for WGS data, classifying precisely 98% and 96% of the samples, respectively. Yleaf outperforms all benchmarked tools in the WES data, classifying approximately 90% of the samples. Yleaf, Y-LineageTracker, and pathPhynder can correctly classify most samples (88%) sequenced with third-generation HTS. As a result, Yleaf provides the best performance for applications that use WGS and WES. Overall, our study offers researchers with a guide that allows them to select the most appropriate tool to analyze the NRY region using both second- and third-generation HTS data.

Medienart:

E-Artikel

Erscheinungsjahr:

2023

Erschienen:

2023

Enthalten in:

Zur Gesamtaufnahme - volume:21

Enthalten in:

Computational and structural biotechnology journal - 21(2023) vom: 09., Seite 4613-4618

Sprache:

Englisch

Beteiligte Personen:

García-Olivares, Víctor [VerfasserIn]
Muñoz-Barrera, Adrián [VerfasserIn]
Rubio-Rodríguez, Luis A [VerfasserIn]
Jáspez, David [VerfasserIn]
Díaz-de Usera, Ana [VerfasserIn]
Iñigo-Campos, Antonio [VerfasserIn]
Veeramah, Krishna R [VerfasserIn]
Alonso, Santos [VerfasserIn]
Thomas, Mark G [VerfasserIn]
Lorenzo-Salazar, José M [VerfasserIn]
González-Montelongo, Rafaela [VerfasserIn]
Flores, Carlos [VerfasserIn]

Links:

Volltext

Themen:

Comparative genomics
Journal Article
NRY haplogroup classification
Next-generation sequencing
Population genetics
Y chromosome

Anmerkungen:

Date Revised 12.10.2023

published: Electronic-eCollection

Citation Status PubMed-not-MEDLINE

doi:

10.1016/j.csbj.2023.09.012

funding:

Förderinstitution / Projekttitel:

PPN (Katalog-ID):

NLM363108726