Please use this identifier to cite or link to this item: http://repository.aaup.edu/jspui/handle/123456789/1707
Full metadata record
DC FieldValueLanguage
dc.contributor.authorMafarja, Majdi $Other$Palestinian-
dc.contributor.authorThaher, Thaer$AAUP$Palestinian-
dc.contributor.authorAzmi Al-Betar, Mohammed $Other$Other-
dc.contributor.authorToo, Jingwei $Other$Other-
dc.contributor.authorA. Awadallah, Mohammed $Other$Other-
dc.contributor.authorDoush, Iyad Abu$Other$Other-
dc.contributor.authorTurabieh, Hamza $Other$Other-
dc.date.accessioned2023-10-05T07:51:06Z-
dc.date.available2023-10-05T07:51:06Z-
dc.date.issued2023-02-09-
dc.identifier.citationMafarja, M., Thaher, T., Al-Betar, M.A. et al. Classification framework for faulty-software using enhanced exploratory whale optimizer-based feature selection scheme and random forest ensemble learning. Appl Intell (2023). https://doi.org/10.1007/s10489-022-04427-xen_US
dc.identifier.issnhttps://doi.org/10.1007/s10489-022-04427-x-
dc.identifier.urihttp://repository.aaup.edu/jspui/handle/123456789/1707-
dc.description.abstractSoftware Fault Prediction (SFP) is an important process to detect the faulty components of the software to detect faulty classes or faulty modules early in the software development life cycle. In this paper, a machine learning framework is proposed for SFP. Initially, pre-processing and re-sampling techniques are applied to make the SFP datasets ready to be used by ML techniques. Thereafter seven classifiers are compared, namely K-Nearest Neighbors (KNN), Naive Bayes (NB), Linear Discriminant Analysis (LDA), Linear Regression (LR), Decision Tree (DT), Support Vector Machine (SVM), and Random Forest (RF). The RF classifier outperforms all other classifiers in terms of eliminating irrelevant/redundant features. The performance of RF is improved further using a dimensionality reduction method called binary whale optimization algorithm (BWOA) to eliminate the irrelevant/redundant features. Finally, the performance of BWOA is enhanced by hybridizing the exploration strategies of the grey wolf optimizer (GWO) and harris hawks optimization (HHO) algorithms. The proposed method is called SBEWOA. The SFP datasets utilized are selected from the PROMISE repository using sixteen datasets for software projects with different sizes and complexity. The comparative evaluation against nine well-established feature selection methods proves that the proposed SBEWOA is able to significantly produce competitively superior results for several instances of the evaluated dataset. The algorithms’ performance is compared in terms of accuracy, the number of features, and fitness function. This is also proved by the 2-tailed P-values of the Wilcoxon signed ranks statistical test used. In conclusion, the proposed method is an efficient alternative ML method for SFP that can be used for similar problems in the software engineering domain.en_US
dc.language.isoenen_US
dc.publisherApplied Intelligence / Springeren_US
dc.subjectSoftware fault predictionen_US
dc.subjectMachine learningen_US
dc.subjectSMOTEen_US
dc.subjectDimension reductionen_US
dc.subjectMeta-heuristicsen_US
dc.subjectImbalanced dataen_US
dc.titleClassification framework for faulty-software using enhanced exploratory whale optimizer-based feature selection scheme and random forest ensemble learningen_US
dc.typeArticleen_US
Appears in Collections:Faculty & Staff Scientific Research publications

Files in This Item:
File Description SizeFormat 
paper-sample.pdfpaper_sample171.16 kBAdobe PDFView/Open
Show simple item record


Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.

Admin Tools