Inspiration
What it does
How I built it
Challenges I ran into
Accomplishments that I'm proud of
What I learned
What's next for Python Speech to Text Client Library
Python Speech to Text Client Inspiration:
What it does: We built a speech to text client library using the Websockets protocol in python.
How we built it:Microsoft Cognitive Services provides the Bing Speech API. From the free trial subscription keys from the Cognitive Services subscription page, we obtained the API keyKey. We used websockets-a client library in python which works on the websockets protocol. We used the following python libraries sys, json, platform, asyncio, websockets and utils. Our client library takes into input- in the form of Audio File and Microphone Input, uses the 3 Recognizion Modes: Interactive, Dictation and Conversation and uses Microsoft Bing Speech API to convert the spoken speech to text
Challenges we faced: Having never worked on Client-server library programming, it was time-consuming to read on the existing libaries already existing and implement the same in python. There was almost no library in python for websockets client which actually sends headers in the send method after the handshake and we had to do some workaround it.
Accomplishments that we're proud of: Completing the project in our first hackathon.
What we learned:Learning about asynchronous communication.
What's next for Python Speech to Text Client: The project was overall quite exciting and as an extension to our project, we would introduce the UI component as well.
Built With
- bing-speech-api
- python
- websockets
Log in or sign up for Devpost to join the conversation.