Inspiration
My friend is a professional trader and he wanted a cheap tool to transcribe audio efficiently and I wanted to save him time from reading the transcription by simply leveraging powerful LLM so he can get important information through just chatting.
What it does
It transcribes your audio with speaker diarization and gives out the transcription in text then it also allows you to chat with the data of the transcription.
How we built it
Built it using Google Gemini Pro api and Gradio for UI and DeepGram for transcription.
Challenges we ran into
how to transfer data from audio file to the api and how to get results from the api response.
Accomplishments that we're proud of
This was the first time I used gradio so I am proud of using it in a good way using 2 apis.
What we learned
How to use gradio
What's next for AudioChat.ai
I will be adding subtitle generation and translator.
Built With
- deepgram
- gemini
- gradio
- llm
Log in or sign up for Devpost to join the conversation.