BukaGini: A stability-aware Gini index feature selection algorithm for robust model performance

dc.contributor.authorBouke, Mohamed Aly
dc.contributor.authorAbdullah, Azizol
dc.contributor.authorFrnda, Jaroslav
dc.contributor.authorCengiz, Korhan
dc.contributor.authorSalah, Bashir
dc.date.accessioned2024-02-20T06:44:23Z
dc.date.available2024-02-20T06:44:23Z
dc.date.issued2023
dc.description.abstractFeature interaction is a vital aspect of Machine Learning (ML) algorithms, and gaining a deep understanding of these interactions can significantly enhance model performance. This paper introduces the BukaGini algorithm, an innovative and robust approach for feature interaction analysis that capitalizes on the Gini impurity index. By exploiting the unique properties of the BukaGini index, our proposed algorithm effectively captures both linear and nonlinear feature interactions, providing a richer and more comprehensive representation of the underlying data. We thoroughly evaluate the BukaGini algorithm against traditional Gini index-based methods on various real-world datasets. These datasets include the High School Students’ Performance (HSSP) dataset, which examines factors affecting student performance; Cancer Data, which focuses on identifying cancer types based on gene expression; Spambase, which targets spam email classification; and the UNSW-NB15 dataset, which addresses network intrusion detection. Our experimental results demonstrate that the BukaGini algorithm consistently outperforms traditional Gini index-based methods in terms of accuracy. Across the tested datasets, the BukaGini algorithm achieves improvements ranging from 0.32% to 2.50%, underscoring its effectiveness in handling diverse data types and problem domains. This performance gain highlights the potential of the BukaGini algorithm as a valuable tool for feature interaction analysis in various ML applications.cs
dc.description.firstpage59386cs
dc.description.lastpage59396cs
dc.description.sourceWeb of Sciencecs
dc.description.volume11cs
dc.identifier.citationIEEE Access. 2023, vol. 11, p. 59386-59396.cs
dc.identifier.doi10.1109/ACCESS.2023.3284975
dc.identifier.issn2169-3536
dc.identifier.urihttp://hdl.handle.net/10084/152211
dc.identifier.wos001016812900001
dc.language.isoencs
dc.publisherIEEEcs
dc.relation.ispartofseriesIEEE Accesscs
dc.relation.urihttps://doi.org/10.1109/ACCESS.2023.3284975cs
dc.rights.accessopenAccesscs
dc.rights.urihttp://creativecommons.org/licenses/by-nc-nd/4.0/cs
dc.subjectBukaGini algorithmcs
dc.subjectGini indexcs
dc.subjectensemble learningcs
dc.subjectfeature interaction analysiscs
dc.subjectdata miningcs
dc.titleBukaGini: A stability-aware Gini index feature selection algorithm for robust model performancecs
dc.typearticlecs
dc.type.statusPeer-reviewedcs
dc.type.versionpublishedVersioncs

Files

Original bundle

Now showing 1 - 1 out of 1 results
Loading...
Thumbnail Image
Name:
2169-3536-2023v11p59386.pdf
Size:
1.09 MB
Format:
Adobe Portable Document Format
Description:

License bundle

Now showing 1 - 1 out of 1 results
Loading...
Thumbnail Image
Name:
license.txt
Size:
718 B
Format:
Item-specific license agreed upon to submission
Description: