Bloomberg challege

Inspiration

Many documents with different subjects should be classified. Moreover, the task is boring for human.

It helps classify documents of US government. In other words, documents can be automatically classified based on their similar core content.

We tried to find the similarity between tittles of each document and put them in their own cluster.

Using KNN for text classifier is challenging and it requires time to master the technique.

A stepping stone to gain more knowledge in NLP.

Even though the project is not complete, we have been able to learn more about language processing

Leave feedback in the comments!

Log in or sign up for Devpost to join the conversation.