Identifying Characteristic Features of Fake News Articles for Deep Learning-Based Identification
DOI:
https://doi.org/10.58445/rars.314Abstract
Fake news articles rapidly spread online, spreading misinformation, and weakening democracy and credible journalism. This research identifies characteristic stylistic features in the text of fake news articles to build a deep learning-based classifier of news articles, including topics, sentiment, and length of the titles and bodies of articles. An experimental approach is taken to compare the effectiveness of multiple feature selections as input data for the binary classification of fake news articles by a neural network. The top-performing feature selection and neural network architecture result in a classifier that achieves 89.7% accuracy on a testing set.
References
Meschi, Meloria, David Eastwood, and Ravi Kanabar. “The Real-World Effects of ‘Fake News’ – and How to Quantify Them.” SCL, August 10, 2020. www.scl.org/articles/12022-the-real-world-effects-of-fake-news-and-how-to-quantify- them.
Lee, Terry. “The Global Rise of ‘Fake News’ and the Threat to Democratic Elections in the USA.” emeraldinsight.com, March 18, 2019. www.emerald.com/insight/content/doi/10.1108/PAP-04-2019-0008/full/pdf.
WPSU - Penn State Public Media. “Case Study – Fake News Dissemination: Pizzagate (Continued).” Case study – fake news dissemination: Pizzagate (continued). The Arthur W. Page Center. Accessed October 22, 2021. www.pagecentertraining.psu.edu/public- relations-ethics/introduction-to-the-ethical-implications-of-fake-news-for-pr-professionals/lesson-2-fake-news-content/case-study-fake-news-dissemination-pizzagate- continued/.
“Evaluating Online Information: The Cornerstone of Civic Online Reasoning.” Stanford History Education Group, November 22, 2016. stacks.stanford.edu/file/druid:fv751yt5934/SHEG%20Evaluating%20Information%20On line.pdf.
Barthel, Michael, Amy Mitchell, and Jesse Holcomb. “Many Americans Believe Fake News Is Sowing Confusion.” Pew Research Center’s Journalism Project. Pew Research Center, December 15, 2016. www.pewresearch.org/journalism/2016/12/15/many- americans-believe-fake-news-is-sowing-confusion/.
Stewart, Elizabeth. “Detecting Fake News: Two Problems for Content Moderation.” Philosophy & Technology. Springer Netherlands, February 11, 2021. link.springer.com/article/10.1007/s13347-021-00442-x.
Zhou, Xinyi, and Reza Zafarani. “A Survey of Fake News: Fundamental Theories, Detection Methods, and Opportunities.” ACM Comput. Surv. 1, July 17, 2020. arxiv.org/pdf/1812.00315.pdf.
UTK Machine Learning Club. “Fake News”. Retrieved from kaggle.com/c/fake-news/data
Řehůřek, R., & Sojka, P. (2010, Μάιος 22). Software Framework for Topic Modelling with Large Corpora. Proceedings of the LREC 2010 Workshop on New Challenges for NLP Frameworks, 45–50. Valletta, Malta: ELRA
Hutto, C.J. & Gilbert, E.E. (2014). “VADER: A Parsimonious Rule-Based Model for Sentiment Analysis of Social Media Text”. Eighth International Conference on Weblogs and Social Media (ICWSM-14). Ann Arbor, MI, June 2014
Chollet, F., & Others. (2015). Keras. keras.io
Google. (n.d.). Machine learning glossary. Google Developers. developers.google.com/machine-learning/glossary
Google (n.d.) Classification: ROC Curve and AUC. Google Developers. developers.google.com/machine-learning/crash-course/classification/roc-and-AUC
Downloads
Posted
Categories
License
Copyright (c) 2023 Arjun Sharma

This work is licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License.
You are free to:
- Share — copy and redistribute the material in any medium or format for any purpose, even commercially.
- Adapt — remix, transform, and build upon the material for any purpose, even commercially.
- The licensor cannot revoke these freedoms as long as you follow the license terms.
Under the following terms:
- Attribution — You must give appropriate credit, provide a link to the license, and indicate if changes were made. You may do so in any reasonable manner, but not in any way that suggests the licensor endorses you or your use.
- No additional restrictions — You may not apply legal terms or technological measures that legally restrict others from doing anything the license