Please use this identifier to cite or link to this item:
http://repository.aaup.edu/jspui/handle/123456789/1681
Title: | An Efficient High-dimensional Feature Selection Approach Driven By Enhanced Multi-strategy Grey Wolf Optimizer for Biological Data Classification |
Authors: | Mafarja, Majdi $Other$Palestinian Thaher, Thaer$AAUP$Palestinian Too, Jingwei $Other$Other Chantar, Hamouda$Other$Other Turabieh, Hamza $Other$Other Houssein, Essam H.$Other$Other Emam, Marwa M.$Other$Other |
Keywords: | Feature selection Binary grey wolf optimizer Classification Meta-heuristics Biological data |
Issue Date: | Jan-2023 |
Publisher: | Neural Computing and Applications / Springer |
Citation: | Mafarja, M., Thaher, T., Too, J. et al. An Efficient High-dimensional Feature Selection Approach Driven By Enhanced Multi-strategy Grey Wolf Optimizer for Biological Data Classification. Neural Comput & Applic 35, 1749–1775 (2023). https://doi.org/10.1007/s00521-022-07836-8 |
Series/Report no.: | 35; |
Abstract: | Biological data generally contain complex and high-dimensional samples. In addition, the number of samples in biological datasets is much fewer than the number of features, so the vast number of features should be selected carefully and determine the optimal subset of features. Feature selection (FS) is a vital stage in biological data mining applications (e.g., classification) for dealing with the curse of dimensionality problems and finding highly informative features. This work proposes an effective FS approach based on a new version of Gray Wolf Optimizer (GWO) called Multi-strategy Gray Wolf Optimizer (MSGWO) for better features selection for biological data classification. The use of MSGWO in feature selection is to find the optimal subset of features between classes, solve premature convergence, and enhance the local search ability of the GWO algorithm. Multiple exploration and exploitation strategies are proposed to enhance the global search and local search abilities of the GWO algorithm through the optimization process. The support vector machine (SVM) classifier is used to evaluate the proposed GWO-based FS approaches. MSGWO was evaluated on thirteen high dimensional biological datasets obtained from the UCI repository with a smaller number of instances. The reported results confirm that employing multiple exploration and multiple exploitation strategies is highly useful for enhancing the search tendency of the MSGWO in the FS domain. Statistical tests proved that the superiority of the proposed approach is statistically significant as compared to the basic GWO and similar wrapper-based FS techniques, including binary particle swarm optimization (BPSO), binary bat algorithm (BBA), binary gravitational search algorithm (BGSA), and binary whale optimization algorithm (BWOA). In terms of classification accuracy, MSGWO yielded better accuracy rates than the standard GWO algorithm on 84% of applied biological datasets. MSGWO also recorded better accuracy rates than its other competitors in all 13 cases. In terms of the lowest number of selected features, MSGWO yielded excellent reduction rates compared to its peers. |
URI: | http://repository.aaup.edu/jspui/handle/123456789/1681 |
ISSN: | ttps://doi.org/10.1007/s00521-022-07836-8 |
Appears in Collections: | Faculty & Staff Scientific Research publications |
Files in This Item:
File | Description | Size | Format | |
---|---|---|---|---|
paper-sample.pdf | sample | 86.2 kB | Adobe PDF | View/Open |
Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.
Admin Tools