Abstract
Background: Feature selection (FS) is a crucial strategy for dimensionality reduction in data preprocessing since microarray data sets typically contain redundant and extraneous features that degrade the performance and complexity of classification models.
Objective: The purpose of feature selection is to reduce the number of features from highdimensional cancer datasets and enhance classification accuracy.
Methods: This research provides a wrapper-based hybrid model integrating information gain (IG) and Jaya algorithm (JA) for determining the optimum featured genes from high-dimensional microarray datasets. This paper's comprehensive study is divided into two segments: we employed the parameterless JA to identify the featured gene subsets in the first stage without filter methods. Various classifiers evaluate JA's performance, such as SVM, LDA, NB, and DT. In the second section, we introduce a hybrid IG-JA model. The IG is used as a filter to eliminate redundant and noisy features. The reduced feature subset is then given to the JA as a wrapper to improve the hybrid model's performance using the classifiers outlined above.
Results: We used 13 benchmark microarray data sets from the public repository for experimental analysis. It is noteworthy to state that the hybrid IG-JA model performs better as compared to its counterparts.
Conclusion: Tests and statistics show that the suggested model outperforms the standard feature selection method with JA and other existing models. Our proposed model is unable to provide the best accuracy compared to other existing approaches; however, it is quite steady and good. In the future, this work could be implemented with various filter methods and real-time data sets. A multi-filter approach with the Jaya algorithm will be used to check the efficiency of the proposed one. And it would be better to choose any other hybrid model (chaos-based) with Jaya to enhance the feature selection accuracy with a high dimensional dataset.
Graphical Abstract
[http://dx.doi.org/10.1145/3136625]
[http://dx.doi.org/10.1126/science.290.5500.2323] [PMID: 11125150]
[http://dx.doi.org/10.4018/978-1-5225-2857-9.ch001]
[http://dx.doi.org/10.1007/978-3-319-27400-3_1]
[http://dx.doi.org/10.3233/HIS-160226]
[http://dx.doi.org/10.1016/j.neucom.2011.03.034]
[http://dx.doi.org/10.1109/ACCESS.2019.2906757]
[http://dx.doi.org/10.1016/j.jksuci.2019.11.007]
[http://dx.doi.org/10.1016/j.knosys.2020.106131]
[http://dx.doi.org/10.1007/s12652-018-1031-9]
[http://dx.doi.org/10.1016/j.neucom.2014.06.067]
[http://dx.doi.org/10.1016/j.patrec.2007.05.011]
[http://dx.doi.org/10.1007/s00500-007-0193-8]
[http://dx.doi.org/10.1016/j.eswa.2020.113971]
[http://dx.doi.org/10.1016/j.csda.2019.106839]
[http://dx.doi.org/10.1016/j.compeleceng.2013.11.024]
[http://dx.doi.org/10.1016/j.asoc.2012.11.042]
[http://dx.doi.org/10.1016/j.neucom.2017.11.077]
[http://dx.doi.org/10.3390/computation7010012]
[http://dx.doi.org/10.1109/TCBB.2015.2478454]
[http://dx.doi.org/10.1016/j.asoc.2017.09.038]
[http://dx.doi.org/10.1016/j.asoc.2019.106031]
[http://dx.doi.org/10.1109/ACCESS.2019.2923846]
[http://dx.doi.org/10.1016/j.eswa.2016.01.021]
[http://dx.doi.org/10.1016/j.swevo.2017.04.002]
[http://dx.doi.org/10.1016/j.eswa.2019.06.044]
[http://dx.doi.org/10.1016/j.knosys.2017.10.028]
[http://dx.doi.org/10.1016/j.ins.2019.08.040]
[http://dx.doi.org/10.1016/j.ins.2019.05.038]
[http://dx.doi.org/10.1016/j.cose.2018.11.005]
[http://dx.doi.org/10.1007/s12652-019-01193-6]
[http://dx.doi.org/10.1007/s12652-017-0655-5]
[http://dx.doi.org/10.1016/j.eswa.2019.112824]
[http://dx.doi.org/10.1016/j.eswa.2019.113103]
[http://dx.doi.org/10.1016/j.compeleceng.2020.106963]
[http://dx.doi.org/10.1007/s13369-020-04871-2]
[http://dx.doi.org/10.5916/jkosme.2016.40.5.437]
[http://dx.doi.org/10.1080/01969720802188292]
[http://dx.doi.org/10.1016/j.asoc.2016.11.026]
[http://dx.doi.org/10.1016/j.engappai.2020.104079]
[http://dx.doi.org/10.3390/sym12030408]
[http://dx.doi.org/10.1007/s12539-020-00372-w] [PMID: 32441000]
[http://dx.doi.org/10.1504/IJDMB.2017.088538]
[http://dx.doi.org/10.14419/ijet.v7i4.15.23007]
[http://dx.doi.org/10.1177/1550147719895210]
[http://dx.doi.org/10.1109/ICIT.2017.43]
[http://dx.doi.org/10.4018/IJSIR.2019040101]