Summary and Subtitles

Frontend Home page for Summary and Subtitles Web App
Example of Subtitles extraction

Inspiration

The idea was to get summary and subtitles from audio files. We were curious about how powerful Gemini API was with extracting text from audio hence this project. Moreover, as college students we sometimes have to listen to audio for class hence this was a way to enable us to synthesize this information faster.

What it does

The web app takes a audio file and gets subtitles or summary from this audio file.

How we built it

We used react js for the frontend and Flask for the backend.

Challenges we ran into

We struggled with how to process the audio. We were contemplating on where to pass the file in memory of store it on the server. We trying store the file in memory but that did not work casue we can't pass the audio file to Gemini API.

Accomplishments that we're proud of

Process audio file correctly.
Store file on the server successfully
Spinned up a backend with Flask

What we learned

Learn about the Flask framework.
Learned how to successfully store files to a server
Learnt React JS
Learnt using Git and Github in more depth (especially making of pull requests and the different types)

What's next for Summary and Subtitles

Implement generating subtitles and summaries for Youtube videos
Convert this to a plugin for users to use
Implement an feature to describe videos in-depth for blind people.

Built With

Submitted to

Google AI Hackathon

Created by

I created the React app. It is the frontend part of the project. What you see and interact with is what I built. :)

Rabin Kalikote
I worked on the flask backend. It was my first time wokring with flask. It was fun and I learnt a lot from the documentation.

Gerald Akorli
I worked on the UI design for the homepage of the app. I learned a lot about the different UI design resources.

Asta Shakti Suman Sharma

Updates

Rabin Kalikote started this project — May 03, 2024 02:01 AM EDT

Leave feedback in the comments!

Log in or sign up for Devpost to join the conversation.