Our primary inspiration was our eagerness to explore the world of Speech Recognition, Speech Processing, and NLP in general. We wanted to use our theoretical knowledge in building a real world application that can serve very meaningful day-day life purposes, and fortunately we have come close to building such an application.

What it does

speakNotes allows you to use your classical Notepad in a modern futuristic way. It allows you to speak out your thoughts onto your notepad instead of typing it, hence allowing a free flow of thought and greater human-computer interaction. It also allows you to convert an audio file into text, and vice versa.

How we built it

We used Python's Tkinter GUI library to build the interface and then used Facebook's Wit.AI to leverage AI capabilities.

Challenges we ran into

We ran into a hell lot of challenges. Since this was the first time that we ventured into Python GUI programming and application development, we had a tough time going through all the documentations and stuff.

Another challenge that we faced was to make the application a multi threaded one so that it doesn't freeze on I/O operations. We are very proud to have been successful in accomplishing it and we gained a lot of working knowledge of how Threading works in Python.

Accomplishments that we're proud of

1) Made a full scale Windows application in python for the first time and have been very successful in making it.

2) Leveraged the power of AI in our project in a very useful way.

3) Got a good feedback from our friends who tested it.

4) Got a chance to apply our software engineering skills

What we learned

We learned about the process of making a robust software application and the various application development life cycle principles associated with software engineering.

We have also learned Python GUI programming along with how to use Wit.AI's speech recognition services.

What's next for

We plan to take the project further by developing its versions for Android, ios, Mac OS and Linux. We will also update the UI and the speech recognition functionality.

Built With

Share this project: