Inspiration
Every day, millions of small business owners face the same challenge: they need fresh, professional social media content to stay relevant, but they can't afford dedicated designers or expensive agency retainers. I watched local businesses struggle with inconsistent branding, poorly formatted graphics, and the exhausting cycle of content creation that takes hours away from running their actual business.
Aetheria was born from a simple question: What if any business owner could get agency-quality branded content in few minutes?
The vision is democratizing professional design, making it possible for a local bakery to have the same visual presence as a Fortune 500 company.
What it does
Aetheria is an AI-powered brand content generator that creates professional social media graphics and captions tailored to your brand identity.
The workflow is elegantly simple:
Brand Setup — Input your company name, brand color, logo, and optionally a product image
Smart Refinement — AI-generated questions help capture your brand voice, target audience, and content goals
Intelligent Generation — A multi-step AI agent builds a creative brief, designs the visual concept, generates the image, and integrates your logo
Multi-Platform Export — Generate variants in different aspect ratios (1:1 for LinkedIn, 4:5 for Instagram, 9:16 for TikTok/Stories, etc.)
Key Features:
Brand-aware design — Colors, typography, and style match your brand identity
Logo integration — Automatically places your logo appropriately on generated graphics
Product photography — Can incorporate actual product images into marketing materials
Multi-format output — One design, multiple platform-optimized ratios
Caption generation — Platform-appropriate copy with relevant hashtags
Content type flexibility — Social posts or advertisement graphics with CTA overlays
How we built it
Technical Stack: Frontend: React + TypeScript + Tailwind CSS + Framer Motion
AI Orchestration: Gemini 3 Flash with function calling (agentic workflow)
Image Generation: Gemini 3 pro image API
The core innovation is using Gemini as an autonomous creative director. Rather than a simple prompt-to-image pipeline, Aetheria employs a multi-tool agent that:
const agentTools = [ { name: "analyze_brand_identity" }, // Understands brand from inputs { name: "create_creative_brief" }, // Builds structured design direction { name: "generate_background_image" }, // Creates base visual { name: "add_logo_to_image" }, // Composites brand elements { name: "finalize_content" } // Outputs final package ];
The agent reasons through each step, making decisions about:
Visual style (minimalist, bold, elegant, playful)
Color harmony with brand colors
Logo placement (corner, centered, watermark)
Typography style for any text overlays
Composition that leaves space for captions on social platforms
Dynamic Question Generation
Before generation, Gemini analyzes the brand inputs this ensures the AI understands whether you're B2B or B2C, your tone of voice, and specific campaign goals.
Challenges we ran into
- Target Audience Ambiguity Problem: The agent couldn't determine if a brand was B2B or B2C, leading to mismatched content tone.
Solution: Implemented dynamic question generation that specifically probes for audience type, industry context, and communication style before generation begins.
Logo Integration Quality Problem: Logos sometimes appeared awkward—wrong size, poor placement, or clashing with the background.
Schema Validation Errors Problem: AI responses occasionally didn't match expected JSON structure, causing runtime errors.
Model Selection Paradox Problem: Initially assumed Gemini 3 Pro would produce better results than 3 Flash.
Discovery: Flash actually generated superior design directions! Pro's verbosity created overly complex image prompts that confused image model, resulting in cluttered outputs. Flash's concise, focused prompts led to cleaner, more professional designs.
Accomplishments that we're proud of
End-to-end generation in mins — From brand input to finished graphic
Truly brand-aware outputs — Not just slapping a logo on stock images
Seamless aspect ratio variants — One generation, infinite formats
Autonomous creative agent — AI makes design decisions, not just follows instructions
Production-ready quality — Outputs businesses can actually post without embarrassment
What we learned
Model size ≠ better results — Gemini Flash outperformed Pro for creative direction. Conciseness in prompts leads to better image generation.
Agentic workflows unlock creativity — Letting the AI orchestrate multiple tools produces more coherent results than chaining isolated API calls.
Context is everything — The questionnaire dramatically improved output relevance. Asking the right questions matters more than having the most powerful model.
Multimodal is the future — Combining text understanding, image generation, and image analysis (for logo placement) in one flow creates experiences impossible with single-modality AI.
What's next for Aetheria Content
Planned Features:
Daily Content Automation — Brands set up their profile once, receive fresh graphics every morning via email
Content Calendar Integration — Connect to Buffer, Hootsuite, or native platform schedulers
Brand Asset Library — Save multiple logos, product images, and brand guidelines
A/B Variant Generation — Generate multiple creative directions to test
Analytics Feedback Loop — Learn from engagement metrics to improve future generations
The vision is transforming Aetheria from a generation tool into a complete AI creative department that handles the entire content lifecycle, from ideation to posting to optimization.
Built With
- aistudio
- gemini
- react
- tailwind
- typescript
Log in or sign up for Devpost to join the conversation.