A novel method to guide biomarker combinations to optimize the sensitivity

Abstract Logistic regression has demonstrated its utility in classifying binary labeled datasets through the maximum likelihood approach. However, in numerous biological and clinical contexts, the aim is often to determine coefficients that yield the highest sensitivity at the pre-specified specificity or vice versa. Therefore, the application of logistic regression is limited in such settings. To this end, we have developed an improved regression framework, SMAGS, for binary classification that, for a given specificity, finds the linear decision rule that yields the maximum sensitivity. Furthermore, we employed the method for feature selection to find the features that are satisfying the sensitivity maximization goal. We compared our method with normal logistic regression by applying it to real clinical data as well as synthetic data. In the real application data (colorectal cancer dataset), we found 14% improvement of sensitivity at 98.5% specificity.Availability and implementation Software is made available in Python (<jats:ext-link xmlns:xlink="http://www.w3.org/1999/xlink" ext-link-type="uri" xlink:href="https://github.com/smahmoodghasemi/SMAGS">https://github.com/smahmoodghasemi/SMAGS</jats:ext-link>).

Medienart:

Preprint

Erscheinungsjahr:

2024

Erschienen:

2024

Enthalten in:

bioRxiv.org - (2024) vom: 18. Apr. Zur Gesamtaufnahme - year:2024

Sprache:

Englisch

Beteiligte Personen:

Ghasem, Seyyed Mahmood [VerfasserIn]
Fahrmann, Johannes F. [VerfasserIn]
Hanash, Samir [VerfasserIn]
Do, Kim-Anh [VerfasserIn]
Long, James P. [VerfasserIn]
Irajizad, Ehsan [VerfasserIn]

Links:

Volltext [kostenfrei]

Themen:

570
Biology

doi:

10.1101/2024.04.12.589302

funding:

Förderinstitution / Projekttitel:

PPN (Katalog-ID):

XBI043290981