Poet.AI

Takes your poetry, visually imagines scenes in a video

Inspiration

My grandmother loves to write poetry.
When I was a kid she would write poems,
We would record her reading them,
I would google images for scenes, edit it all together, and upload to YouTube.
I never forget the glee on her face when we released a new video.
She would go tell all of her friends about her new video!
I created Poet.AI to automate my role as googler and editor,
To use AssemblyAI, GPT-3 and Stable Diffusion to do what I did but much better!

What it does

You upload an audio recording of your poetry,
Poet.AI generates a transcript of what you said,
Uses GPT-3 to partition that into scenes and generate prompts for those scenes,
Uses Stable Diffusion to generate images for those scenes,
Puts it all together in Python and gives it back to you.

How we built it

The website is built in Django and it uses the AssemblyAI, OpenAI GPT-3 and Replicate Stable Diffusion APIs.

Challenges we ran into

It was a pleasure using all of the APIs,
However for a long time the videos for the poetry just didn't look good!
Then I found a blog post of using GPT-3 to make better prompts, implemented that and it made it so much better!

Accomplishments that we're proud of

It actually works and generates beautiful videos!

What we learned

How to use a bunch of different machine learning APIs (Assembly AI, Replicate Stable Diffusion)
How to use GPT-3 for weird things that it probably shouldn't be used for lol (extracting important phrases from text, etc.).
- I would loved to have used Cohere for this but just didn't find the time to learn another new API/toolkit.
Creating a polished UI

What's next for Poet.AI

Making it much faster, it takes longer than 10minutes to generate a single video.

Built With

assemblyai
django
python
stable-diffusion

Updates

Arjun Krishna started this project — Dec 10, 2022 07:19 PM EST

Leave feedback in the comments!

Log in or sign up for Devpost to join the conversation.