Nested ensemble selection : An effective hybrid feature selection method

© 2023 The Author(s)..

It has been shown that while feature selection algorithms are able to distinguish between relevant and irrelevant features, they fail to differentiate between relevant and redundant and correlated features. To address this issue, we propose a highly effective approach, called Nested Ensemble Selection (NES), that is based on a combination of filter and wrapper methods. The proposed feature selection algorithm differs from the existing filter-wrapper hybrid methods in its simplicity and efficiency as well as precision. The new algorithm is able to separate the relevant variables from the irrelevant as well as the redundant and correlated features. Furthermore, we provide a robust heuristic for identifying the optimal number of selected features which remains one of the greatest challenges in feature selection. Numerical experiments on synthetic and real-life data demonstrate the effectiveness of the proposed method. The NES algorithm achieves perfect precision on the synthetic data and near optimal accuracy on the real-life data. The proposed method is compared against several popular algorithms including mRMR, Boruta, genetic, recursive feature elimination, Lasso, and Elastic Net. The results show that NES significantly outperforms the benchmarks algorithms especially on multi-class datasets.

Medienart:

E-Artikel

Erscheinungsjahr:

2023

Erschienen:

2023

Enthalten in:

Zur Gesamtaufnahme - volume:9

Enthalten in:

Heliyon - 9(2023), 9 vom: 28. Sept., Seite e19686

Sprache:

Englisch

Beteiligte Personen:

Kamalov, Firuz [VerfasserIn]
Sulieman, Hana [VerfasserIn]
Moussa, Sherif [VerfasserIn]
Reyes, Jorge Avante [VerfasserIn]
Safaraliev, Murodbek [VerfasserIn]

Links:

Volltext

Themen:

Ensemble selection
Feature selection
Filter method
Journal Article
Machine learning
Random forest
Synthetic data
Wrapper method

Anmerkungen:

Date Revised 18.10.2023

published: Electronic-eCollection

Citation Status PubMed-not-MEDLINE

doi:

10.1016/j.heliyon.2023.e19686

funding:

Förderinstitution / Projekttitel:

PPN (Katalog-ID):

NLM363030239