Fake Tweet Detector

Inspiration

Misinformation throughout social media, spread many times by harmful bots, which can have serious impact on the real world, as seen in the Covid pandemic.

What it does

Performs statistical and machine learning tests on any public Twitter account, giving the following results:

Probability of the account being a bot
Probability of account spreading misinformation/"fake news"
Sentiment rating based on the account's tweets

How we built it

Bot Accounts

To find bot accounts, we used a Kolmogorov-Smirnov test n comparing the number of likes and retweets on the account's tweets to Benford's Law. This is in the idea that bots that are artificially created would follow each other, in an artificial way, such that they would not follow Benford's Law with precision.

Fake news accounts

For this, we used a machine learning algorithm (Support Vector Classification), trained with the PHEME dataset, which contains tweets from both misinformed and trustworthy sources. We were able to achieve an accuracy level of 85% on the test data.

Sentiment Analysis

We used a built-in ML algorithm from NLTK python library for analysing the type of words/phrases used, and therefore giving a result on the sentiment behind each tweet.