Inspiration I've been hearing about data science and how valuable related skills are/becoming, so I wanted to try out python and do something with web data scraping. I was inspired to make this project in particular because the endless search for finding an internship is quite tedious, and I wanted a way to stream line the process as well as conveniently keep track of the positions I applied to.

What it does Scrapes data from indeed.com based off the user's query. The user can then save selected job posts and refer to them at another time and indicate where they have applied to that position.

How I built it I used Flask as a framework for the website. For the search function I had a user post request for the search which runs the web scraping script. This python script uses beautiful-soup to parse through all job posts on indeed.com that match the user's query. With java script I populate the web page with the script results and the user can save the posts. Which saves the data to the browser's local storage and can be used in the saved posts page. I then hosted the website with Heroku.

Challenges I ran into Surprisingly coding the python scripts wasn't the real challenge. Flask would have to take the cake since I spent the entire 2nd day trying to get the website to look nice and work properly and then how to host it as well.

Accomplishments that I'm proud of Making the web scraping python script is something I'm proud of. Not only did I figure out python quickly, which means that I'm getting better at understanding the fundamentals of coding, but I also made a script that has a lot of practical power. While it is limited, it would only take a little bit of elbow grease to use it an any website I please.

What I learned How to code in python, running scripts, using the command line, web scraping, making a Flask website, and hosting a website/server through the cloud (Heroku).

What's next for Job and Internship Scraper Write more web scrapers for other job listing websites so the search function can have a more diverse data basis. Also instead of storing the saved posts in the browser's local storage, I would make a dedicated database and have users save them there instead.

Share this project:

Updates