Analysis of Machine Learning Algorithms via Detection of Fake News

Authors

  • Manoj Kumar G. Assistant Professor, Department of Computer Science and Engineering, Faculty of Engineering and Technology, SRMIST, Kattankulathur, Tamil Nadu India Author
  • Chinmay Misra Department of Computer Science and Engineering, Faculty of Engineering and Technology, SRMIST,Kattankulathur, Tamil Nadu, India Author
  • Achint Singh Rawat Department of Computer Science and Engineering, Faculty of Engineering and Technology, SRMIST,Kattankulathur, Tamil Nadu, India Author

DOI:

https://doi.org/10.61841/t4rxrn39

Keywords:

Fake, News, Classifiers, Vectorizers, N-gram s, F1-score, Precision, Recall, Faux

Abstract

In the modern political climate, fake news is a growing and legitimate threat to our institutions and all voters. Fake news articles are those that are “intentionally and verifiably false.". This project is aimed at implementing combinations of various feature extraction techniques along with various machine learning algorithms from distinct categories for the purpose of detecting fake news articles through their content. The results of this supervised binary text-classification problem will be compared and ranked. Kaggle, which is owned by Google LLC and is a community of data scientists and machine learning engineers, will fulfill our requirement of a reliable source that provides us with a verifiable dataset of real and fake news. To mirror the real-world environment, the quantity of fake news articles in the dataset will be substantially less than the amount of real news articles. A data set with an approximately 85:15 ratio will be used. 

Downloads

Download data is not yet available.

References

[1] “Pérez-Rosas, Verónica, Kleinberg, Bennett, Lefevre, Alexandra, & Mihalcea, Rada (2017).

Automatic Detection of Fake News.” (https://arxiv.org/abs/1708.07104v1)

[2] “E. Tacchini, G. Ballarin, M. L. Della Vedova, S. Moret, and L. de Alfaro, Some Like It Hoax:

Automated Fake News Detection in Social Networks.” (http://arxiv.org/pdf/1704.07506.)

[3] “Thota, Aswini; Tilak, Priyanka; Ahluwalia, Simrat; and Lohia, Nibrat (2018) "Fake News Detection:

A Deep Learning Approach," SMU Data Science Review: Vol. 1: No. 3, Article 10

https://scholar.smu.edu/datasciencereview/vol1/iss3/

10”

[4] “Getting Real about Fake News [Online]

https://www.kaggle.com/mrisdal/fake-news/data”

[5] “Detecting Fake News with Scikit-learn,” “https://www.datacamp.com/community/tutorials/sc ikitlearn-fake-news.”

[6] W. Y. Wang, "Liar, Liar Pants on Fire": “A New Benchmark Dataset for Fake News Detection. Available: http://arxiv.org/pdf/1705.00648.”

[7] Anaconda distribution https://www.anaconda.com/distribution/

[8] “scikit-learn: machine learning in Python — scikit-learn 0.20.2 documentation. [Online]

Available: http://scikit-learn.org/stable/.”

[9] “Text Classification. A Comprehensive Guide to Classifying Text with Machine Learning

https://monkeylearn.com/text-classification/”

[10] “Natural Language Processing course of National Research University Higher School

of Economics https://www.hse.ru/en/edu/courses/219930752”

[11] “How Fake News Goes Viral: A Case Study

https://www.nytimes.com/2016/11/20/business/med ia/how-fake-news-spreads.html”

[12] “Fake News Is Not the Only Problem https://points.datasociety.net/fake-news-is-notthe- problem-f00ec8cdfcb”

[13] “We Tracked Down A Fake-News Creator In The Suburbs. Here's What We Learned

https://www.npr.org/sections/alltechconsidered/201

6/11/23/503146770/npr-finds-the-head-of-a-covert- fake-news-operation-in-the-suburbs”

[14] “Johnson, J.: 2016. The Five Types of Fake News. https://www.huffpost.com/entry/the-fivetypes-of-fake-ne_b_13609562”

[15] “F1 Score documentation available with Scikit- learn module.

[16] NLTK Documentation available with nltk module

Downloads

Published

31.05.2020

How to Cite

G., M. K., Misra, C., & Singh Rawat, A. (2020). Analysis of Machine Learning Algorithms via Detection of Fake News. International Journal of Psychosocial Rehabilitation, 24(3), 3635-3646. https://doi.org/10.61841/t4rxrn39