Inspiration
The saying goes “A picture can talk a thousand words…” AI can generate text and also images How about AI generating visual stories?
What it does
It takes short text as input and generates an image as output
How we built it
We used 3 models:
- GPT-2: To generate a story from short phrases
- BERT Summarization model: To summarize the paragraph and get exact keywords.
- BigGANxCLIP model: It was used to generate an image from the keywords.
Challenges we ran into
- To select an appropriate model to generate images from various genres
- The image quality was not as expected.
- We required to use GPU and CUDA. So it was difficult to run on the local machine. hence we used Google Colab
Accomplishments that we're proud of
Attempt to create first visual AI story generator model as per our best knowledge
What we learned
GAN, Neural Networks, BERT NLP model
What's next for AI stories generator
To decrease the number of iterations for generating an image To try with other text to image generators like DALL-E
Built With
- bert
- python
- spacy
- torch
- torchvision
- transformers
Log in or sign up for Devpost to join the conversation.