Inspiration

My friend is a professional trader and he wanted a cheap tool to transcribe audio efficiently and I wanted to save him time from reading the transcription by simply leveraging powerful LLM so he can get important information through just chatting.

What it does

It transcribes your audio with speaker diarization and gives out the transcription in text then it also allows you to chat with the data of the transcription.

How we built it

Built it using Google Gemini Pro api and Gradio for UI and DeepGram for transcription.

Challenges we ran into

how to transfer data from audio file to the api and how to get results from the api response.

Accomplishments that we're proud of

This was the first time I used gradio so I am proud of using it in a good way using 2 apis.

What we learned

How to use gradio

What's next for AudioChat.ai

I will be adding subtitle generation and translator.

Built With

Share this project:

Updates