ReCapAI

recap.docx
logo
prints to terminal too!

Inspiration

An audio recapper/analyzer tutorial is provided on the openai website. The key difference for this project is that gradio is being used as an interface so that anyone in the public can access this via web link

What it does

Recaps into 1 paragraph, analyzes tone, and extracts key points & action items of transcribed audio file so that users can quickly understand the main points and focus that was discussed in an audio file without having to listen to it

How we built it

This project would not be successful without the help of packages such as openai, whispers, and gradio.

Challenges we ran into

interacting with google colab
whispers can't transcribe audio files over 25MB, thus, needed to figure out how to chunk audio files into smaller parts then concatenate them back together for OpenAI chat completion
there were dependencies that needed to be upgraded before pip install gradio & import gradio could be successful
Difficult to get gradio to display the recap.docx file that was generated

Accomplishments that we're proud of

On the backend, chatgpt successfully generates a coherent, concise, and non-hallucinating recap of the audio
On the backend, chatgpt's completion was successfully saved into a Word document

What we learned

chatCompletions and how chatGPT API works
quickly create a professional UI experience with gradio

What's next for ReCapAI

Improve the UI using gradio so that users can upload their own audio file download the recap.docx file that is generated
Allow for users to upload bigger files

Built With

assemblyai
gradio
openai
python

Updates

Wen-Hsin Chen posted an update — Dec 30, 2023 10:45 PM EST

https://github.com/23wc01/ReCapAI

Log in or sign up for Devpost to join the conversation.

Wen-Hsin Chen posted an update — Dec 30, 2023 10:45 PM EST

v2.0.0 updates:

Modified code so that program is compatible with OpenAI API as of Dec. 2023 (see recapai.py)
Switched to implementing AssemblyAI API instead of WhisperAI for transcription (see transcribe.py)
AssemblyAI allows for up audio totaling to 5 hrs/per month for transcription, thus, eliminating the need for audio-chunking for the average audio file
Added pdf as type of file that recap can be outputted into (see output.py)
Improved upon gradio interface so that recap can be outputted to multiple different file types (see gradioUI.py)
Modularized overall code, included documentation & more clarifying comments

Log in or sign up for Devpost to join the conversation.

Wen-Hsin Chen started this project — Nov 05, 2023 10:40 AM EST

Leave feedback in the comments!

Log in or sign up for Devpost to join the conversation.