Fake news Detection on Social Media Using Machine Learning
2024, Journal Of Electrical Systems
Abstract
The spread of misinformation on the internet and social media has become a growing problem, impacting public opinion and decisionmaking. In this study, we explore how machine learning can be used to detect and classify fake news more effectively. We gathered a diverse dataset from various online sources and applied a series of preprocessing steps to prepare the data for analysis. This included techniques like tokenization, normalization, removing punctuation and stop words, and lemmatization. To improve the quality of the analysis, we developed an Enhanced Feature Engineering framework for Fake News Detection. This framework combined key features such as TF-IDF, Bag of Words, tweet length, and sentiment analysis, creating a robust dataset for training machine learning models. Among the various models we tested, the ensemble voting classifier stood out for its accuracy and reliability in distinguishing real news from fake. Its strong performance demonstrates the potential of combining multiple algorithms to tackle complex problems like misinformation. Through this research, we aim to contribute to the ongoing efforts to combat fake news, helping to create a more reliable and trustworthy digital space.
References (33)
- T. Zarger and S. Lal, "Machine learning perspective for analysis of geospatial data," Journal of Data Acquisition and Processing, vol. 38, no. 1, 2023.
- V. L. Rubin, Y. Chen, and N. K. Conroy, "Deception detection for news: three types of fakes," Proceedings of the Association for Information Science and Technology, vol. 52, no. 1, pp. 1-4, 2015.
- M. Siering, J.-A. Koch, and A. V Deokar, "Detecting fraudulent behavior on crowdfunding platforms: The role of linguistic and content-based cues in static and dynamic contexts," Journal of Management Information Systems, vol. 33, no. 2, pp. 421-455, 2016.
- C. Cai, L. Li, and D. Zeng, "Detecting social bots by jointly modeling deep behavior and content information," in Proceedings of the 2017 ACM on Conference on Information and Knowledge Management, 2017, pp. 1995-1998.
- J. Golbeck et al., "Fake news vs satire: A dataset and analysis," in Proceedings of the 10th ACM conference on web science, 2018, pp. 17-21.
- Y. Liu and Y.-F. Wu, "Early detection of fake news on social media through propagation path classification with recurrent and convolutional networks," in Proceedings of the AAAI conference on artificial intelligence, 2018.
- J. Ma, W. Gao, and K.-F. Wong, "Rumor detection on twitter with tree-structured recursive neural networks," Association for Computational Linguistics, 2018.
- J. Zhang, B. Dong, and S. Y. Philip, "Fakedetector: Effective fake news detection with deep diffusive neural network," in 2020 IEEE 36th international conference on data engineering (ICDE), IEEE, 2020, pp. 1826-1829.
- X. Zhou and R. Zafarani, "Network-based fake news detection: A pattern-driven approach," ACM SIGKDD explorations newsletter, vol. 21, no. 2, pp. 48-60, 2019.
- S. Castelo et al., "A topic-agnostic approach for identifying fake news pages," in Companion proceedings of the 2019 World Wide Web conference, 2019, pp. 975-980.
- X. Zhou, A. Jain, V. V Phoha, and R. Zafarani, "Fake news early detection: A theory-driven model," Digital Threats: Research and Practice, vol. 1, no. 2, pp. 1-25, 2020.
- T. Bian et al., "Rumor detection on social media with bi-directional graph convolutional networks," in Proceedings of the AAAI conference on artificial intelligence, 2020, pp. 549-556.
- T. Bian et al., "Rumor detection on social media with bi-directional graph convolutional networks," in Proceedings of the AAAI conference on artificial intelligence, 2020, pp. 549-556.
- V. Sabeeh, M. Zohdy, and R. Al Bashaireh, "Enhancing the fake news detection by applying effective feature selection based on semantic sources," in 2019 international conference on computational science and computational intelligence (CSCI), IEEE, 2019, pp. 1365-1370.
- G. Guibon, L. Ermakova, H. Seffih, A. Firsov, and G. Le Noé-Bienvenu, "Multilingual fake news detection with satire," in International Conference on Computational Linguistics and Intelligent Text Processing, Springer, 2019, pp. 392-402.
- Y.-F. Huang and P.-H. Chen, "Fake news detection using an ensemble learning model based on self-adaptive harmony search algorithms," Expert Syst Appl, vol. 159, p. 113584, 2020.
- T. Thaher, M. Saheb, H. Turabieh, and H. Chantar, "Intelligent detection of false information in arabic tweets utilizing hybrid harris hawks based feature selection and machine learning models," Symmetry (Basel), vol. 13, no. 4, p. 556, 2021.
- B. Al-Ahmad, A. Al-Zoubi, R. Abu Khurma, and I. Aljarah, "An evolutionary fake news detection method for covid- 19 pandemic information," Symmetry (Basel), vol. 13, no. 6, p. 1091, 2021.
- F. A. Ozbay and B. Alatas, "Adaptive Salp swarm optimization algorithms with inertia weights for novel fake news detection model in online social media," Multimed Tools Appl, vol. 80, no. 26, pp. 34333-34357, 2021.
- B. Probierz, J. Kozak, P. Stefański, and P. Juszczuk, "Adaptive goal function of ant colony optimization in fake news detection," in Computational Collective Intelligence: 13th International Conference, ICCCI 2021, Rhodes, Greece, September 29-October 1, 2021, Proceedings 13, Springer, 2021, pp. 387-400.
- S. Sheikhi, "An effective fake news detection method using WOA-xgbTree algorithm and content-based features," Appl Soft Comput, vol. 109, p. 107559, 2021.
- A. Sawan and T. Thaher, "Sentiment analysis model for fake news identification in Arabic tweets," in 2021 IEEE 15th international conference on application of information and communication technologies (AICT), IEEE, 2021, pp. 1-6.
- M. Zivkovic, C. Stoean, A. Petrovic, N. Bacanin, I. Strumberger, and T. Zivkovic, "A novel method for covid-19 pandemic information fake news detection based on the arithmetic optimization algorithm," in 2021 23rd international symposium on symbolic and numeric algorithms for scientific computing (SYNASC), IEEE, 2021, pp. 259-266.
- M. Khashei and M. Bijari, "A novel hybridization of artificial neural networks and ARIMA models for time series forecasting," Appl Soft Comput, vol. 11, no. 2, pp. 2664-2675, 2011.
- M. J. C. Samonte, "Polarity analysis of editorial articles towards fake news detection," in Proceedings of the 2018 1st international conference on internet and e-business, 2018, pp. 108-112.
- A. Jain, A. Shakya, H. Khatter, and A. K. Gupta, "A smart system for fake news detection using machine learning," in 2019 International conference on issues and challenges in intelligent computing techniques (ICICT), IEEE, 2019, pp. 1-4.
- H. S. Al-Ash, M. F. Putri, P. Mursanto, and A. Bustamam, "Ensemble learning approach on indonesian fake news classification," in 2019 3rd international conference on informatics and computational sciences (ICICoS), IEEE, 2019, pp. 1-6.
- P. Bharadwaj and Z. Shao, "Fake news detection with semantic features and text mining," 2019.
- A. Y. A. Amer and T. Siddiqui, "Detection of covid-19 fake news text data using random forest and decision tree classifiers," International Journal of Computer Science and Information Security (IJCSIS), vol. 18, no. 12, pp. 88- 100, 2020.
- M. G. Hussain, M. R. Hasan, M. Rahman, J. Protim, and S. Al Hasan, "Detection of bangla fake news using mnb and svm classifier," in 2020 International conference on computing, electronics & communications engineering (iCCECE), IEEE, 2020, pp. 81-85.
- D. Varshney and D. K. Vishwakarma, "A review on rumour prediction and veracity assessment in online social network," Expert Syst Appl, vol. 168, p. 114208, 2021.
- D. Varshney and D. K. Vishwakarma, "A review on rumour prediction and veracity assessment in online social network," Expert Syst Appl, vol. 168, p. 114208, 2021.
- T. Davidson, D. Warmsley, M. Macy, and I. Weber, "Automated hate speech detection and the problem of offensive language," in Proceedings of the international AAAI conference on web and social media, 2017, pp. 512-515.