Inspiration

An audio recapper/analyzer tutorial is provided on the openai website. The key difference for this project is that gradio is being used as an interface so that anyone in the public can access this via web link

What it does

Recaps into 1 paragraph, analyzes tone, and extracts key points & action items of transcribed audio file so that users can quickly understand the main points and focus that was discussed in an audio file without having to listen to it

How we built it

This project would not be successful without the help of packages such as openai, whispers, and gradio.

Challenges we ran into

  • interacting with google colab
  • whispers can't transcribe audio files over 25MB, thus, needed to figure out how to chunk audio files into smaller parts then concatenate them back together for OpenAI chat completion
  • there were dependencies that needed to be upgraded before pip install gradio & import gradio could be successful
  • Difficult to get gradio to display the recap.docx file that was generated

Accomplishments that we're proud of

  • On the backend, chatgpt successfully generates a coherent, concise, and non-hallucinating recap of the audio
  • On the backend, chatgpt's completion was successfully saved into a Word document

What we learned

  • chatCompletions and how chatGPT API works
  • quickly create a professional UI experience with gradio

What's next for ReCapAI

  • Improve the UI using gradio so that users can upload their own audio file download the recap.docx file that is generated
  • Allow for users to upload bigger files

Built With

Share this project:

Updates

posted an update

v2.0.0 updates:

  • Modified code so that program is compatible with OpenAI API as of Dec. 2023 (see recapai.py)
  • Switched to implementing AssemblyAI API instead of WhisperAI for transcription (see transcribe.py)
  • AssemblyAI allows for up audio totaling to 5 hrs/per month for transcription, thus, eliminating the need for audio-chunking for the average audio file
  • Added pdf as type of file that recap can be outputted into (see output.py)
  • Improved upon gradio interface so that recap can be outputted to multiple different file types (see gradioUI.py)
  • Modularized overall code, included documentation & more clarifying comments

Log in or sign up for Devpost to join the conversation.