Classification of breed combinations for slaughter pigs based on genotypes-modeling DNA samples of crossbreeds as fuzzy sets from purebred founders

Copyright © 2023 Vinje, Brustad, Heggli, Sevillano, Van Son and Gangsei..

In pig production, the production animals are generally three- or four-way crossbreeds. Reliable information regarding the breed of origin of slaughtered pigs is useful, even a prerequisite, for a number of purposes, e.g., evaluating potential breed effects on carcass grading. Genetic data from slaughtered pigs can easily be extracted and used for crossbreed classification. In the current study, four classification methods, namely, random forest (RF), ADMIXTURE, partial least squares regression (PLSR), and partial least squares together with quadratic discriminant analysis (PLS-QDA) were evaluated on simulated (n = 7,500) genomic data of crossbreeds. The derivation of the theory behind PLS-QDA is a major part of the current study, whereas RF and ADMIXTURE are known and well-described in the literature. Classification success (CS) rate, square loss (SL), and Kullback-Leibler (KL) divergence loss for the simulated data were used to compare methods. Overall, PLS-QDA performed best with 99%/0.0018/0.002 (CS/SL/KL) vs. 97%/0.0084/0.051, 97%/0.0087/0.0623, and 17%/0.068/0.39 for PLSR, ADMIXTURE, and RF, respectively. PLS-QDA and ADMIXTURE, as the most relevant methods, were used on a real dataset (n = 1,013) from Norway where the two largest classes contained 532 and 192 (PLS-QDA), and 531 and 193 (ADMIXTURE) individuals, respectively. These two classes were expected to be dominating a priori. The Bayesian nature of PLS-QDA enables inclusion of desirable features such as a separate class "unknown breed combination" and informative priors for crossbreeds, making this a preferable method for the classification of breed combination in the industry.

Medienart:

E-Artikel

Erscheinungsjahr:

2023

Erschienen:

2023

Enthalten in:

Zur Gesamtaufnahme - volume:14

Enthalten in:

Frontiers in genetics - 14(2023) vom: 01., Seite 1289130

Sprache:

Englisch

Beteiligte Personen:

Vinje, H [VerfasserIn]
Brustad, H K [VerfasserIn]
Heggli, A [VerfasserIn]
Sevillano, C A [VerfasserIn]
Van Son, M [VerfasserIn]
Gangsei, L E [VerfasserIn]

Links:

Volltext

Themen:

ADMIXTURE
Breed classification
Crossbreeds
Fuzzy classification
Journal Article
Partial least squares
Quadratic discriminant analysis
Single-nucleotide polymorphism
Slaughter pigs

Anmerkungen:

Date Revised 21.12.2023

published: Electronic-eCollection

Citation Status PubMed-not-MEDLINE

doi:

10.3389/fgene.2023.1289130

funding:

Förderinstitution / Projekttitel:

PPN (Katalog-ID):

NLM366069659