Feature Selection using Multi-Objective Clustering based Gray Wolf Optimization for Big Data Analytics
Keywords:Feature Selection, Big Data, Classification, Multi-objective, Clustering, Optimization.
Although numerous efforts have been made to develop feature selection framework which is efficient in Big Data technology, complexity of processing big data remains a significant barrier. As a result, the computational complexity and intricacy of big data may block the data mining process. The feature selection method means, a required pre-processing approach to minimize dataset dimensionality for great advanced features and classifier performance optimization. In order to increase performance, feature selection are regarded to constitute the core of big data technologies. In recent years, many academics have moved their focus to data science and analytics for application scenarios leveraging integrating tools of big data. People take quite some time to engage, when it comes to big data. As a consequence, in a decentralized system with a high workload, it is crucial in making feature selection dynamic and adaptable. Multi objective optimal strategies for feature selection are provided in this work. This research adds to the creation of a strategy for enhancing feature selection efficiency in large, complex data sets. In this paper, a multi-objective clustering-based gray-wolf optimization algorithm (MOCGWO) is proposed for classification problems. Five datasets were used to show the robustness of proposed algorithm. The result analysis was compared with other optimization methodology such as GWO and PSO. This shows efficacy of MOCGWO algorithm.
Zhu, Li, Fei Richard Yu, Yige Wang, Bin Ning, and Tao Tang. "Big data analytics in intelligent transportation systems: A survey." IEEE Transactions on Intelligent Transportation Systems 20, no. 1 (2018): 383-398.
Batko, Kornelia, and Andrzej ?l?zak. "The use of Big Data Analytics in healthcare." Journal of big Data 9, no. 1 (2022): 1-24.
Mehta, Nishita, and Anil Pandit. "Concurrence of big data analytics and healthcare: A systematic review." International journal of medical informatics 114 (2018): 57-65.
Luo, Mi, Yifu Wang, Yunhong Xie, Lai Zhou, Jingjing Qiao, Siyu Qiu, and Yujun Sun. "Combination of feature selection and catboost for prediction: The first application to the estimation of aboveground biomass." Forests 12, no. 2 (2021): 216.
Too, Jingwei, and Seyedali Mirjalili. "A hyper learning binary dragonfly algorithm for feature selection: A COVID-19 case study." Knowledge-Based Systems 212 (2021): 106553.
Agrawal, Prachi, Hattan F. Abutarboush, Talari Ganesh, and Ali Wagdy Mohamed. "Metaheuristic algorithms on feature selection: A survey of one decade of research (2009-2019)." IEEE Access 9 (2021): 26766-26791.
C. Cîmpanu, L. Ferariu, T. Dumitriu and F. Ungureanu, "Multi-Objective Optimization of Feature Selection procedure for EEG signals classification," 2017 E-Health and Bioengineering Conference (EHB), 2017, pp. 434-437, doi: 10.1109/EHB.2017.7995454.
Luo, Juanjuan, Dongqing Zhou, Lingling Jiang, and Huadong Ma. "A particle swarm optimization based multiobjective memetic algorithm for high-dimensional feature selection." Memetic Computing (2022): 1-17.
Hao, Jin-Kao; Legrand, Pierrick; Collet, Pierre; Monmarché, Nicolas; Lutton, Evelyne; Schoenauer, Marc (2012). [Lecture Notes in Computer Science] Artificial Evolution Volume 7401 || A Rigorous Runtime Analysis for Quasi-Random Restarts and Decreasing Stepsize. , 10.1007/978-3-642-35533-2(Chapter 4), 37–48. doi:10.1007/978-3-642-35533-2_4
BenSaid, Fatma, and Adel M. Alimi. "Online feature selection system for big data classification based on multi-objective automated negotiation." Pattern Recognition 110 (2021): 107629.
Guo, Jianmei, Jia Hui Liang, Kai Shi, Dingyu Yang, Jingsong Zhang, Krzysztof Czarnecki, Vijay Ganesh, and Huiqun Yu. "SMTIBEA: a hybrid multi-objective optimization algorithm for configuring large constrained software product lines." Software & Systems Modeling 18, no. 2 (2019): 1447-1466.
Abdi, Yousef, and Mohammad-Reza Feizi-Derakhshi. "Hybrid multi-objective evolutionary algorithm based on search manager framework for big data optimization problems." Applied Soft Computing 87 (2020): 105991.
Yi, Jiao-Hong, Suash Deb, Junyu Dong, Amir H. Alavi, and Gai-Ge Wang. "An improved NSGA-III algorithm with adaptive mutation operator for Big Data optimization problems." Future Generation Computer Systems 88 (2018): 571-585.
Meera, S., and C. Sundar. "A hybrid metaheuristic approach for efficient feature selection methods in big data." Journal of Ambient Intelligence and Humanized Computing 12, no. 3 (2021): 3743-3751.
Rostami, Reza Ramzanzadeh, and Soheila Karbasi. "Detecting Fake Accounts on Twitter Social Network Using Multi-Objective Hybrid Feature Selection Approach." Webology 17, no. 1 (2020).
Zhang, Yong, Dun-wei Gong, Xiao-zhi Gao, Tian Tian, and Xiao-yan Sun. "Binary differential evolution with self-learning for multi-objective feature selection." Information Sciences 507 (2020): 67-85.
Zhou, Yu, Junhao Kang, Sam Kwong, Xu Wang, and Qingfu Zhang. "An evolutionary multi-objective optimization framework of discretization-based feature selection for classification." Swarm and Evolutionary Computation 60 (2021): 100770.
Karagoz, Gizem Nur, Adnan Yazici, Tansel Dokeroglu, and Ahmet Cosar. "A new framework of multi-objective evolutionary algorithms for feature selection and multi-label classification of video data." International Journal of Machine Learning and Cybernetics 12, no. 1 (2021): 53-71.
Ewees, Ahmed A., Mohamed Abd Elaziz, and Diego Oliva. "A new multi-objective optimization algorithm combined with opposition-based learning." Expert Systems with Applications 165 (2021): 113844.
How to Cite
Copyright (c) 2022 Aakriti Shukla, Dr. Damodar Prasad Tiwari
This work is licensed under a Creative Commons Attribution 4.0 International License.
IJOSCIENCE follows an Open Journal Access policy. Authors retain the copyright of the original work and grant the rights of publication to the publisher with the work simultaneously licensed under a Creative Commons CC BY License that allows others to distribute, remix, adapt, and build upon your work, even commercially, as long as they credit you for the original creation. Authors are permitted to post their work in institutional repositories, social media or other platforms.
Under the following terms:
- No additional restrictions — You may not apply legal terms or technological measures that legally restrict others from doing anything the license permits.