A biplot correlation range for group-wise metabolite selection in mass spectrometry

Abstract Background Analytic methods are available to acquire extensive metabolic information in a cost-effective manner for personalized medicine, yet disease risk and diagnosis mostly rely upon individual biomarkers based on statistical principles of false discovery rate and correlation. Due to functional redundancies and multiple layers of regulation in complex biologic systems, individual biomarkers, while useful, are inherently limited in disease characterization. Data reduction and discriminant analysis tools such as principal component analysis (PCA), partial least squares (PLS), or orthogonal PLS (O-PLS) provide approaches to separate the metabolic phenotypes, but do not offer a statistical basis for selection of group-wise metabolites as contributors to metabolic phenotypes. Methods We present a dimensionality-reduction based approach termed ‘biplot correlation range (BCR)’ that uses biplot correlation analysis with direct orthogonal signal correction and PLS to provide the group-wise selection of metabolic markers contributing to metabolic phenotypes. Results Using a simulated multiple-layer system that often arises in complex biologic systems, we show the feasibility and superiority of the proposed approach in comparison of existing approaches based on false discovery rate and correlation. To demonstrate the proposed method in a real-life dataset, we used LC-MS based metabolomics to determine spectrum of metabolites present in liver mitochondria from wild-type (WT) mice and thioredoxin-2 transgenic (TG) mice. We select discriminatory variables in terms of increased score in the direction of class identity using BCR. The results show that BCR provides means to identify metabolites contributing to class separation in a manner that a statistical method by false discovery rate or statistical total correlation spectroscopy can hardly find in complex data analysis for predictive health and personalized medicine..

Medienart:

E-Artikel

Erscheinungsjahr:

2019

Erschienen:

2019

Enthalten in:

Zur Gesamtaufnahme - volume:12

Enthalten in:

BioData Mining - 12(2019), 1, Seite 24

Sprache:

Englisch

Beteiligte Personen:

Youngja H Park [VerfasserIn]
Taewoon Kong [VerfasserIn]
James R. Roede [VerfasserIn]
Dean P. Jones [VerfasserIn]
Kichun Lee [VerfasserIn]

Links:

doi.org [kostenfrei]
doaj.org [kostenfrei]
link.springer.com [kostenfrei]
Journal toc [kostenfrei]

Themen:

Analysis
Biplot correlation
Computer applications to medicine. Medical informatics
Feature selection
Metabolomics

doi:

10.1186/s13040-019-0191-2

funding:

Förderinstitution / Projekttitel:

PPN (Katalog-ID):

DOAJ007123884