Inspiration

I got the idea for this project while I was working on getting certified in cybersecurity. I thought it would be a good idea to build something that implements some of the concepts that I have learned during the process.

What it does

PhishingNet's goal is to check if the text or email that you have, which you believe to be potentially suspicious, is a phishing attempt. It uses multiple datasets and a complex framework to make sure you are safe and not exposed to any malicious data.

How I built it

I built this project originally with just one data set. However, as the process went on, I realized that this was not enough, as the model was not at the level that I wanted it to be. Due to this, I added another, more recent dataset, along with some more complex feature engineering to create the best model I can.

Challenges I ran into

The biggest and most frustrating challenge I ran into was the model not living up to the expectation. There was a lot of feature engineering I had to do to create the most accurate model. Even after merging multiple datasets to train on, it was still quite faulty, not recognizing some key words that would indicate a scam. After many hours of manual correction, the final product is something that I am very proud of.

What I've learned

This was my first project dealing with natural language processing. This was something that I have always found interesting, and I know that this is something that every scientist should be comfortable with. I have learned more about development servers and how, for a project like this, it is not necessary to have a large frontend folder. Overall, I have learned more about web development as a whole. I am thankful to get another project under my belt and, more importantly, I am excited to build more!

Share this project:

Updates