A novel method to guide biomarker combinations to optimize the sensitivity
Abstract Logistic regression has demonstrated its utility in classifying binary labeled datasets through the maximum likelihood approach. However, in numerous biological and clinical contexts, the aim is often to determine coefficients that yield the highest sensitivity at the pre-specified specificity or vice versa. Therefore, the application of logistic regression is limited in such settings. To this end, we have developed an improved regression framework, SMAGS, for binary classification that, for a given specificity, finds the linear decision rule that yields the maximum sensitivity. Furthermore, we employed the method for feature selection to find the features that are satisfying the sensitivity maximization goal. We compared our method with normal logistic regression by applying it to real clinical data as well as synthetic data. In the real application data (colorectal cancer dataset), we found 14% improvement of sensitivity at 98.5% specificity.Availability and implementation Software is made available in Python (<jats:ext-link xmlns:xlink="http://www.w3.org/1999/xlink" ext-link-type="uri" xlink:href="https://github.com/smahmoodghasemi/SMAGS">https://github.com/smahmoodghasemi/SMAGS</jats:ext-link>).
Medienart: |
Preprint |
---|
Erscheinungsjahr: |
2024 |
---|---|
Erschienen: |
2024 |
Enthalten in: |
bioRxiv.org - (2024) vom: 18. Apr. Zur Gesamtaufnahme - year:2024 |
---|
Sprache: |
Englisch |
---|
Beteiligte Personen: |
Ghasem, Seyyed Mahmood [VerfasserIn] |
---|
Links: |
Volltext [kostenfrei] |
---|
Themen: |
---|
doi: |
10.1101/2024.04.12.589302 |
---|
funding: |
|
---|---|
Förderinstitution / Projekttitel: |
|
PPN (Katalog-ID): |
XBI043290981 |
---|
LEADER | 01000caa a22002652 4500 | ||
---|---|---|---|
001 | XBI043290981 | ||
003 | DE-627 | ||
005 | 20240419090810.0 | ||
007 | cr uuu---uuuuu | ||
008 | 240417s2024 xx |||||o 00| ||eng c | ||
024 | 7 | |a 10.1101/2024.04.12.589302 |2 doi | |
035 | |a (DE-627)XBI043290981 | ||
035 | |a (biorXiv)10.1101/2024.04.12.589302 | ||
040 | |a DE-627 |b ger |c DE-627 |e rakwb | ||
041 | |a eng | ||
100 | 1 | |a Ghasem, Seyyed Mahmood |e verfasserin |0 (orcid)0000-0003-3313-9597 |4 aut | |
245 | 1 | 0 | |a A novel method to guide biomarker combinations to optimize the sensitivity |
264 | 1 | |c 2024 | |
336 | |a Text |b txt |2 rdacontent | ||
337 | |a Computermedien |b c |2 rdamedia | ||
338 | |a Online-Ressource |b cr |2 rdacarrier | ||
520 | |a Abstract Logistic regression has demonstrated its utility in classifying binary labeled datasets through the maximum likelihood approach. However, in numerous biological and clinical contexts, the aim is often to determine coefficients that yield the highest sensitivity at the pre-specified specificity or vice versa. Therefore, the application of logistic regression is limited in such settings. To this end, we have developed an improved regression framework, SMAGS, for binary classification that, for a given specificity, finds the linear decision rule that yields the maximum sensitivity. Furthermore, we employed the method for feature selection to find the features that are satisfying the sensitivity maximization goal. We compared our method with normal logistic regression by applying it to real clinical data as well as synthetic data. In the real application data (colorectal cancer dataset), we found 14% improvement of sensitivity at 98.5% specificity.Availability and implementation Software is made available in Python (<jats:ext-link xmlns:xlink="http://www.w3.org/1999/xlink" ext-link-type="uri" xlink:href="https://github.com/smahmoodghasemi/SMAGS">https://github.com/smahmoodghasemi/SMAGS</jats:ext-link>) | ||
650 | 4 | |a Biology |7 (dpeaa)DE-84 | |
650 | 4 | |a 570 |7 (dpeaa)DE-84 | |
700 | 1 | |a Fahrmann, Johannes F. |e verfasserin |4 aut | |
700 | 1 | |a Hanash, Samir |e verfasserin |4 aut | |
700 | 1 | |a Do, Kim-Anh |e verfasserin |4 aut | |
700 | 1 | |a Long, James P. |e verfasserin |4 aut | |
700 | 1 | |a Irajizad, Ehsan |e verfasserin |4 aut | |
773 | 0 | 8 | |i Enthalten in |t bioRxiv.org |g (2024) vom: 18. Apr. |
773 | 1 | 8 | |g year:2024 |g day:18 |g month:04 |
856 | 4 | 0 | |u http://dx.doi.org/10.1101/2024.04.12.589302 |x 0 |z kostenfrei |3 Volltext |
912 | |a GBV_XBI | ||
951 | |a AR | ||
952 | |j 2024 |b 18 |c 04 |