Evolutionary Algorithms for Feature Selection in Data Mining

Authors

  • Mahadevi Verma Swami Vivekanand College of Science and Technology, Bhopal, India Author

DOI:

https://doi.org/10.15662/IJEETR.2021.0305001

Keywords:

Feature Selection, Evolutionary Algorithms, Genetic Algorithm (GA), Particle Swarm Optimization (PSO), Nature-Inspired Metaheuristics, Hybrid Methods, High-Dimensional Data, Swarm Intelligence, Combinatorial Optimization, Wrapper Methods

Abstract

Feature selection plays a pivotal role in data mining and machine learning, aiding in enhancing model performance, reducing overfitting, and improving interpretability. Evolutionary algorithms (EAs)—such as Genetic Algorithms (GA), Particle Swarm Optimization (PSO), Ant Colony Optimization (ACO), and their variants—apply bio-inspired search strategies to identify optimal feature subsets from high-dimensional datasets. These algorithms offer a compelling blend of global search capability and adaptability, particularly suited to navigating large and complex search spaces.

PSO and GA have been especially prominent, with PSO used in nearly half of reported swarm-intelligence-based feature selection cases, underscoring its effectiveness across diverse domains—from intrusion detection to gene expression classification . Hybrid methods combining EAs with filter techniques (e.g., χ² statistics or information gain) or local search heuristics have consistently demonstrated enhanced convergence, higher classification accuracy, and more compact feature sets .

Notably, advanced PSO variants—including binary PSO, adaptive PSO with leadership learning, combinatorial PSO, and two dimensional learning frameworks—have improved performance on specific tasks such as intrusion detection and gene expression data selection, achieving high detection rates with minimal features . These techniques balance exploration and exploitation more effectively, addressing issues like premature convergence and scalability.

This overview synthesizes pre-2020 progress by examining EA designs, hybrid strategies, application contexts, strengths, and limitations. It highlights both the versatility and challenges of evolutionary feature selection— illuminating pathways for future enhancements in scalability, convergence reliability, and integration with domain specific methods.

References

 A hybrid two-layer GA + Elastic Net for feature selection

 Tribe Competition-based GA (TCbGA) for pattern classification

 Tunable Particle Swarm Size Optimization (TPSO) algorithm

 GA-wrapped Bayes Naive for coronary artery diagnosis

 PSO-based feature selection in intrusion detection (PCA comparisons)

 GARS for high-dimensional bioinformatics feature selection

 Metaheuristic/PSO variants and hybridization reviews

 GA performance in intrusion detection datasets

 Foundational GA methodology

 EA in data mining context

Downloads

Published

2021-09-01

How to Cite

Evolutionary Algorithms for Feature Selection in Data Mining. (2021). International Journal of Engineering & Extended Technologies Research (IJEETR), 3(5), 3686-3690. https://doi.org/10.15662/IJEETR.2021.0305001