DENIZ, Ayca and KIZILOZ, Hakan Ezgi (2022). Boosting initial population in multiobjective feature selection with knowledge-based partitioning. In: 2022 International Joint Conference on Neural Networks (IJCNN). IEEE. [Book Section]
Documents
31056:610797
PDF
Boosting_Initial_Population_in_Multiobjective_Feature_Selection_with_Knowledge_Based_Partitioning.pdf - Accepted Version
Available under License All rights reserved.
Boosting_Initial_Population_in_Multiobjective_Feature_Selection_with_Knowledge_Based_Partitioning.pdf - Accepted Version
Available under License All rights reserved.
Download (265kB) | Preview
Abstract
The quality of features is one of the main factors that affect classification performance. Feature selection aims to remove irrelevant and redundant features from data in order to increase classification accuracy. However, identifying these features is not a trivial task due to a large search space. Evolutionary algorithms have been proven to be effective in many optimization problems, including feature selection. These algorithms require an initial population to start their search mechanism, and a poor initial population may cause getting stuck in local optima. Diversifying the initial population is known as an effective approach to overcome this issue; yet, it may not suffice as the search space grows exponentially with increasing feature sizes. In this study, we propose an enhanced initial population strategy to boost the performance of the feature selection task. In our proposed method, we ensure the diversity of the initial population by partitioning the candidate solutions according to their selected number of features. In addition, we adjust the chances of features being selected into a candidate solution regarding their information gain values, which enables wise selection of features among a vast search space. We conduct extensive experiments on many benchmark datasets retrieved from UCI Machine Learning Repository. Moreover, we apply our algorithm on a real-world, large-scale dataset, i.e., Stanford Sentiment Treebank. We observe significant improvements after the comparisons with three off-the-shelf initialization strategies.
More Information
Statistics
Downloads
Downloads per month over past year
Metrics
Altmetric Badge
Dimensions Badge
Share
Actions (login required)
View Item |