Inspiration
Inspired by the way people on social media and forums share stories and ideas in picture format. The 4 panel comic has become a staple on the internet as a hybrid between the storytelling of a comic strip and the immediate, simple design of memes. A tool that could allow people to share complex ideas, emotions, and stories using generative AI would enhance peoples abilities to create these posts with less time investment.
What it does
It takes a simple story prompt such as "Superman saves the world", creates a 4 part story for that prompt, creates a comic panel description for each part and then generate an image using said description. This allows user to easily create a complex story from a simple sentence, or watch a more complicated story prompt be brought to life before their eyes.
How we built it
We used PartyRock to string together multiple text generator applications each with specific instructions to turn the initial story prompt into a full story that fit our specification and described each scene. We then used image generation to turn those descriptions into an image that should form part of a cohesive 4 part story.
Challenges we ran into
The first challenge was finding the correct prompts at each stage to give us the desired output. This required us iterating on the wording and structure of the text generation pipeline to give us repeatable desired responses.
The second challenge was finding the correct parameter values for the image generation widgets. We experimented with many different formats until settling on a Digital Art style preset as it gave us the correct art style and was more consistent than the comic book preset. We also decided on a CFG of 18 to find a balance between make the model to follow the prompt as strictly as we could without sacrificing creativity.
Accomplishments that we're proud of
It took a lot of work to create an app that will produce a repeatable product given the same prompt. It required a lot of experimentation with prompts and learning how widgets can be strung together to create pipelines with repeatable outcomes.
What we learned
That the wording of a prompt is very important to ensure that the AI generator creates the desired output.
What's next for 4 Panel Comic Generator
As Stable diffusion improves the quality of the comics will improve and will allow for complete comic strips to be created. An addition of a comic text panel would be a first step toward creating a full comic generator, although it goes against the spirit of this project. The same methodology used here could also be used for AI video generation for short films, which would allow normal people without access to animation tools or skills to create videos and films, thus democratising the creation of animated film.
Built With
- partyrock
Log in or sign up for Devpost to join the conversation.