Correlation Enhanced Machine Learning Approach based Online News Popularity Prediction
DOI:
https://doi.org/10.24113/ijoscience.v4i3.124Abstract
News popularity is the maximum growth of attention given for particular news article. The popularity of online news depends on various factors such as the number of social media, the number of visitor comments, the number of Likes, etc. It is therefore necessary to build an automatic decision support system to predict the popularity of the news as it will help in business intelligence too. The work presented in this study aims to find the best model to predict the popularity of online news using machine learning methods. Initially, correlation techniques are used to gain dependence on the popularity received from an article and to obtain attributes or characteristics that are optimal for subsequent classification. Data has been procured from UCI Machine Learning Repository with 39644 articles with sixty condition attributes and one decision attribute. Then different learning algorithms such as Proposed Hybrid SVM-RF, AdaBoost, LPBoost, and KNN are implemented in order to predict the news popularity. The performance of system is tested on the dataset which comes from UCI machine learning repository. The prediction performances of all methodologies are studied by considering evaluation measures. Hybrid SVM-RF turns out to be the best model for prediction and it has achieved accuracy of 99.6% for binary classification. Further this work is enhanced for multiclass classification with different learning algorithms such as Proposed Hybrid SVM-RF, Naïve Bayes and KNN. Hybrid SVM-RF had achieved the accuracy of about 73% accuracy as compared with other classifiers.
Downloads
References
Ilias N. Lymperopoulos, “Predicting the popularity growth of online content”, Elsevier, Vol. 369, pp. 585-613, 10 November 2016.
Kelwin Fernandes, Pedro Vinagre, Paulo Cortez, “A Proactive Intelligent Decision Support System for Predicting the Popularity of Online News”, Springer, EPIA 2015, pp. 535-546, 2015.
He Ren, Quan Yang, “Predicting and Evaluating the Popularity of Online News”, Standford University Machine Learning Report.
Bandari Roja, Sitaram Asur, and Bernardo A. Huberman. “The pulse of news in social media: Forecasting popularity.” arXiv preprint arXiv:1202.0332, 2012.
Ioannis Arapakis, B. Barla Cambazoglu, and Mounia Lalmas, “On the Feasibility of Predicting News Popularity at Cold Start”, Springer, pp. 290-299, 2014.
R. Shreyas, D.M Akshata, B.S Mahanand, B. Shagun, C.M Abhishek, “Predicting Popularity of Online Articles using Random Forest Regression”, International Conference on Cognitive Computing and Information Processing, IEEE, 2016
Alexandru Tatar, Marcelo Dias de Amorim, Serge Fdida and Panayotis Antoniadis, “A survey on predicting the popularity of web content”, Journal of Internet Services and Applications 2014, A Springer Open Journal, pp. 1-20, 2014.
Alexandru Tatar, Panayotis Antoniadis, Marcelo Dias de Amorim, Serge Fdida, “From Popularity Prediction to Ranking Online News”, HAL, pp. 1-14, 2014.
Ioannis Arapakis, B. Barla Cambazoglu, and Mounia Lalmas, “On the Feasibility of Predicting News Popularity at Cold Start”, Springer, pp. 290-299, 2014.
Ren He and Quan Yang.“Predicting and Evaluating the Popularity of Online News.”
Swati Choudhary, Angkirat Singh Sandhu and Tribikram Pradhan, “Genetic Algorithm Based Correlation Enhanced Prediction of Online News Popularity” Computational Intelligence in Data Mining, Advances in Intelligent Systems and Computing, Springer, 2017, pp.133-144.
UCI Machine Learning Database, https://archive.ics.uci.edu/ml/datasets/Online+News+Popularity, May 2015.
Downloads
Published
How to Cite
Issue
Section
License
Copyright (c) 2018 Akanksha Kathal, Mayank Namdev

This work is licensed under a Creative Commons Attribution 4.0 International License.
IJOSCIENCE follows an Open Journal Access policy. Authors retain the copyright of the original work and grant the rights of publication to the publisher with the work simultaneously licensed under a Creative Commons CC BY License that allows others to distribute, remix, adapt, and build upon your work, even commercially, as long as they credit you for the original creation. Authors are permitted to post their work in institutional repositories, social media or other platforms.
Under the following terms:
-
Attribution — You must give appropriate credit, provide a link to the license, and indicate if changes were made. You may do so in any reasonable manner, but not in any way that suggests the licensor endorses you or your use.
- No additional restrictions — You may not apply legal terms or technological measures that legally restrict others from doing anything the license permits.