We wanted to create something that can know who is talking and then use it for authentication features.
What it does
It enrolls users and then determines who the user is when someone talks.
How we built it
We used Microsoft's Bing speech api to build it. It is an android application and therefore done in java.
Challenges we ran into
We had a lot of trouble calling the right api's, because there are many of them and not so much information about them.
Accomplishments that we're proud of
The app works and can distinguish between voices.
What we learned
We learned about post requests and api calls.
What's next for Speaker Recognition
Something we tried but didn't work. We wanted to translate the speech to text, this way we can record lectures/ conversations/ meetings and have a complete transcript of it. Furthermore, we wanted to use a summarizer api to be able to summarize the transcript of the lectures, conversations and meetings to have a summary of what happened.