Feature Selection using Multi-Objective Clustering based Gray Wolf Optimization for Big Data Analytics

Authors

  • Aakriti Shukla
  • Dr. Damodar Prasad Tiwari

Keywords:

Feature Selection, Big Data, Classification, Multi-objective, Clustering, Optimization.

Abstract

Although numerous efforts have been made to develop feature selection framework which is efficient in Big Data technology, complexity of processing big data remains a significant barrier. As a result, the computational complexity and intricacy of big data may block the data mining process. The feature selection method means, a required pre-processing approach to minimize dataset dimensionality for great advanced features and classifier performance optimization. In order to increase performance, feature selection are regarded to constitute the core of big data technologies. In recent years, many academics have moved their focus to data science and analytics for application scenarios leveraging integrating tools of big data. People take quite some time to engage, when it comes to big data. As a consequence, in a decentralized system with a high workload, it is crucial in making feature selection dynamic and adaptable. Multi objective optimal strategies for feature selection are provided in this work. This research adds to the creation of a strategy for enhancing feature selection efficiency in large, complex data sets. In this paper, a multi-objective clustering-based gray-wolf optimization algorithm (MOCGWO) is proposed for classification problems. Five datasets were used to show the robustness of proposed algorithm. The result analysis was compared with other optimization methodology such as GWO and PSO. This shows efficacy of MOCGWO algorithm.

Downloads

Download data is not yet available.

Author Biographies

Aakriti Shukla

Department of CSE

Bansal Institute of Science & Technology

Bhopal (M.P.), India

Dr. Damodar Prasad Tiwari

Department of CSE

Bansal Institute of Science & Technology

Bhopal (M.P.), India

References

Zhu, Li, Fei Richard Yu, Yige Wang, Bin Ning, and Tao Tang. "Big data analytics in intelligent transportation systems: A survey." IEEE Transactions on Intelligent Transportation Systems 20, no. 1 (2018): 383-398.

Batko, Kornelia, and Andrzej ?l?zak. "The use of Big Data Analytics in healthcare." Journal of big Data 9, no. 1 (2022): 1-24.

Mehta, Nishita, and Anil Pandit. "Concurrence of big data analytics and healthcare: A systematic review." International journal of medical informatics 114 (2018): 57-65.

Luo, Mi, Yifu Wang, Yunhong Xie, Lai Zhou, Jingjing Qiao, Siyu Qiu, and Yujun Sun. "Combination of feature selection and catboost for prediction: The first application to the estimation of aboveground biomass." Forests 12, no. 2 (2021): 216.

Too, Jingwei, and Seyedali Mirjalili. "A hyper learning binary dragonfly algorithm for feature selection: A COVID-19 case study." Knowledge-Based Systems 212 (2021): 106553.

Agrawal, Prachi, Hattan F. Abutarboush, Talari Ganesh, and Ali Wagdy Mohamed. "Metaheuristic algorithms on feature selection: A survey of one decade of research (2009-2019)." IEEE Access 9 (2021): 26766-26791.

C. Cîmpanu, L. Ferariu, T. Dumitriu and F. Ungureanu, "Multi-Objective Optimization of Feature Selection procedure for EEG signals classification," 2017 E-Health and Bioengineering Conference (EHB), 2017, pp. 434-437, doi: 10.1109/EHB.2017.7995454.

Luo, Juanjuan, Dongqing Zhou, Lingling Jiang, and Huadong Ma. "A particle swarm optimization based multiobjective memetic algorithm for high-dimensional feature selection." Memetic Computing (2022): 1-17.

Hao, Jin-Kao; Legrand, Pierrick; Collet, Pierre; Monmarché, Nicolas; Lutton, Evelyne; Schoenauer, Marc (2012). [Lecture Notes in Computer Science] Artificial Evolution Volume 7401 || A Rigorous Runtime Analysis for Quasi-Random Restarts and Decreasing Stepsize. , 10.1007/978-3-642-35533-2(Chapter 4), 37–48. doi:10.1007/978-3-642-35533-2_4

BenSaid, Fatma, and Adel M. Alimi. "Online feature selection system for big data classification based on multi-objective automated negotiation." Pattern Recognition 110 (2021): 107629.

Guo, Jianmei, Jia Hui Liang, Kai Shi, Dingyu Yang, Jingsong Zhang, Krzysztof Czarnecki, Vijay Ganesh, and Huiqun Yu. "SMTIBEA: a hybrid multi-objective optimization algorithm for configuring large constrained software product lines." Software & Systems Modeling 18, no. 2 (2019): 1447-1466.

Abdi, Yousef, and Mohammad-Reza Feizi-Derakhshi. "Hybrid multi-objective evolutionary algorithm based on search manager framework for big data optimization problems." Applied Soft Computing 87 (2020): 105991.

Yi, Jiao-Hong, Suash Deb, Junyu Dong, Amir H. Alavi, and Gai-Ge Wang. "An improved NSGA-III algorithm with adaptive mutation operator for Big Data optimization problems." Future Generation Computer Systems 88 (2018): 571-585.

Meera, S., and C. Sundar. "A hybrid metaheuristic approach for efficient feature selection methods in big data." Journal of Ambient Intelligence and Humanized Computing 12, no. 3 (2021): 3743-3751.

Rostami, Reza Ramzanzadeh, and Soheila Karbasi. "Detecting Fake Accounts on Twitter Social Network Using Multi-Objective Hybrid Feature Selection Approach." Webology 17, no. 1 (2020).

Zhang, Yong, Dun-wei Gong, Xiao-zhi Gao, Tian Tian, and Xiao-yan Sun. "Binary differential evolution with self-learning for multi-objective feature selection." Information Sciences 507 (2020): 67-85.

Zhou, Yu, Junhao Kang, Sam Kwong, Xu Wang, and Qingfu Zhang. "An evolutionary multi-objective optimization framework of discretization-based feature selection for classification." Swarm and Evolutionary Computation 60 (2021): 100770.

Karagoz, Gizem Nur, Adnan Yazici, Tansel Dokeroglu, and Ahmet Cosar. "A new framework of multi-objective evolutionary algorithms for feature selection and multi-label classification of video data." International Journal of Machine Learning and Cybernetics 12, no. 1 (2021): 53-71.

Ewees, Ahmed A., Mohamed Abd Elaziz, and Diego Oliva. "A new multi-objective optimization algorithm combined with opposition-based learning." Expert Systems with Applications 165 (2021): 113844.

Downloads

Published

11/08/2022

How to Cite

Shukla, A. ., & Tiwari, D. D. P. . (2022). Feature Selection using Multi-Objective Clustering based Gray Wolf Optimization for Big Data Analytics. SMART MOVES JOURNAL IJOSCIENCE, (11), 1–7. Retrieved from https://ijoscience.com/index.php/ojsscience/article/view/497