-
-
VISAGE is an AI design agent that generates personalized portfolios and comp cards tailored to your unique color palette and story
-
The platform showcases AI-generated design layouts featuring custom color schemes, user photos, and personalized branding elements.
-
Users begin by entering design preferences through text prompts and uploading reference files to visualize their desired mood and contents
-
The AI generates multiple mood board options based on user input, allowing selection of the preferred visual direction and color palette
-
Users select and merge multiple images from their uploads to be incorporated into the final personalized portfolio design
-
Users select their preferred layout from AI-generated design options that arrange their uploaded images in different compositions
-
The generated moodboard preview displays the final composition with options to download or edit further
-
Users can customize their design in the editor by adjusting image settings, styles, with various AI image editor
Inspiration
The creative economy paradox inspired us: millions of actors, models, designers, and freelancers need professional portfolios to compete, yet 80% can't afford designers or lack design skills. We observed comp card production costs ranging from $200-500, creating barriers to entry for emerging talent. We asked: "Can multimodal AI democratize personal branding by understanding not just what users upload, but who they aspire to be?" Visage bridges the gap between raw talent and professional presentation through AI that comprehends aesthetic intent.
What it does
Visage is a multimodal AI design agent that transforms personal branding from a manual, expensive process into an intelligent, automated workflow:
Core Capabilities:
- Aesthetic Intelligence: Analyzes user-uploaded PDFs (resumes, portfolios), images, and natural language descriptions to infer personal brand identity and visual preferences
- Contextual Design Generation: Creates 3-5 portfolio/comp card variations tailored to industry standards (acting, modeling, design, business)
- Adaptive Curation: Learns from user selections to refine aesthetic understanding across iterations
- Cross-format Export: Generates print-ready PDFs, web-optimized images, and editable SVG files
Unique Value Proposition: Unlike template-based tools (Canva, Adobe Express), Visage doesn't just fill blanks—it interprets your creative DNA from heterogeneous inputs (text + image + context) to generate truly personalized designs that reflect your unique brand narrative.
How we built it
Architecture:
User Input → Gemini 3 Analysis → Design Pipeline → Output Generation
Technology Stack & Implementation:
1. Gemini 3 as the Brain (Multimodal Understanding)
- PDF Parsing: Extracts career highlights, skills, achievements from resume/portfolio PDFs using Gemini's document understanding capabilities
- Aesthetic Profiling: Processes natural language prompts ("vintage cinematic", "minimalist tech") to create design constraints
- Cross-modal Reasoning: Correlates text descriptions with uploaded reference images to identify stylistic patterns
2. Imagen 4 for Aesthetic Recommendations and Design Suggestions
- Analyzes user's aesthetic preferences and generates multiple design direction recommendations
- Creates background textures, color palettes, and decorative elements aligned with user's aesthetic profile
- Produces industry-specific design suggestions (e.g., fashion-forward for models, corporate-clean for executives)
- Generates visual mood boards to help users explore different aesthetic directions
3. SVG Generation via Gemini 3 Critical Innovation:
- Traditional text-in-image generators produce pixelated, uneditable text.
- We leverage Gemini 3's SVG code generation to create vector-based layouts with crisp, scalable typography.
- Generates semantic HTML/SVG structures that remain editable post-generation.
4. Nano-Banana Pro for AI Image Enhancement
- Advanced AI-powered image editing and enhancement of user-uploaded photos
- Portrait optimization while preserving authenticity (no uncanny valley effects)
- Automatic background refinement and lighting adjustments
- Ensures uploaded images meet professional portfolio standards
5. Flutter + Firebase Architecture
- Flutter: Cross-platform app (iOS, Android, Web) with 90% code reuse
- Firebase Firestore: Stores user profiles, design history, and aesthetic preferences
- Cloud Storage: Manages uploaded assets (PDFs, images) with automatic optimization
- Cloud Functions: Orchestrates Gemini/Imagen API calls and handles billing logic
Prompt Engineering Breakthrough: We developed a 3-stage prompt cascade:
1. Extraction Prompt: "Analyze this PDF and identify: career level, industry, key achievements, tone (formal/creative)" 2. Aesthetic Reasoning Prompt: "Given [career context] + [user keywords] + [reference images], define: color psychology, typography style, layout density" 3. Generation Prompt: "Create SVG comp card with [aesthetic constraints] ensuring: readability at 300dpi, professional visual hierarchy, cultural appropriateness"
Challenges we ran into
- The "Aesthetic Alignment" Problem
Challenge: Early versions generated beautiful but generic designs—lacking personal connection Solution: Implemented semantic aesthetic mapping. Instead of keyword matching ("minimalist" → white background), we trained Gemini to understand why someone chooses minimalism (e.g., "I'm a UX designer; minimalism shows restraint and focus"). Now prompts include intent: "minimalist to convey professionalism in tech industry"
2. Multimodal Context Loss
- Challenge: Gemini 3 would "forget" PDF contents when processing images in the same session
- Solution*: Developed persistent context injection—converting all inputs into a unified JSON schema that's included in every API call:
3. Text Rendering Quality
- Challenge: Imagen 4 produced visually stunning designs, but text was often illegible or misspelled (common in image generators)
- Breakthrough: Hybrid approach—Imagen generates backgrounds/imagery, Gemini 3 generates SVG overlays with perfect typography. Final composition happens client-side in Flutter for instant editability
4. Design Quality Consistency
- Challenge: How do you ensure consistent professional-grade output across diverse user inputs?
- Solution: Built a multi-stage validation system using Gemini 3 to evaluate designs against professional standards (hierarchy, balance, contrast, alignment, proximity) before presenting to users
5. Latency vs. Quality Trade-off
- Challenge: Full pipeline took 45-60 seconds (unacceptable UX)
- Optimization:
- Parallel API calls (Gemini + Imagen simultaneously)
- Aggressive caching (repeated aesthetic profiles)
- Progressive rendering (show layout wireframes in 5s, full design in 15s)
- Current: 12-18 second end-to-end generation
Accomplishments that we're proud of
Technical Achievements:
- First-ever SVG-based AI portfolio generator leveraging Gemini 3's code generation for editable, print-quality text
- 98.7% text accuracy (vs. 60-70% in pure image generators like DALL-E/Midjourney for text-heavy designs)
- Multimodal fusion pipeline that maintains context across PDF, image, and text inputs without hallucination
- Professional design standards guaranteed through intelligent validation and quality control
AI Innovation:
- Demonstrated that aesthetic reasoning can be taught to LLMs through structured prompt engineering
- Proved hybrid generative approaches (LLM for structure + diffusion for imagery) outperform single-model solutions for design tasks
- Pioneered intent-aware design generation—not just "what" users ask for, but "why" they need it
What we learned
About Multimodal AI:
- Context is king: Gemini 3's strength isn't just understanding images OR text—it's reasoning across modalities. The best results came when we provided narrative context ("I'm a jazz musician targeting upscale venues") alongside visual inputs
- Prompt structure > prompt length: Our best-performing prompts were 150-200 tokens with clear role definitions ("You are a brand strategist...") vs. 500+ token kitchen-sink prompts
About Design AI:
- Users don't want full automation—they want intelligent assistance. V1 auto-published designs; users felt disconnected. V2 generates 3 options + lets users tweak; satisfaction jumped 40%
- Cultural sensitivity matters: Color symbolism varies (white = purity in West, mourning in East). We added geographic context to prompts for culturally appropriate palettes
About Product Development:
- Latency kills adoption: Every 10-second delay → 25% drop in completion rate. Optimization wasn't optional; it was existential
- Edge cases define quality: 80% of dev time went into handling the 20% of users with unusual needs (actors needing headshots + full-body shots, models with 10+ looks, etc.)
About the Gemini Ecosystem:
- Gemini 3 + Imagen 4 is transformative for creative tools: The combination enables "understand intent → generate visual" workflows impossible with single models
- Firebase + Gemini integration is seamless: Cloud Functions calling Vertex AI APIs = production-grade AI apps in weeks, not months
What's next for Visage : AI-Powered Custom Portfolio/Comp Card Design Agent
Q1 2026 - Expansion:
- Industry Templates: Pre-tuned prompts for 15+ professions (musicians, consultants, real estate agents, chefs)
- Video Comp Cards: 15-second reels combining AI-generated motion graphics with user video clips (using Veo 2)
- Multi-language Support: Gemini's multilingual strength → portfolios in 30+ languages with culturally-adapted designs
Q2 2026 - Intelligence:
- Feedback Loop Learning: Every user edit teaches the system—if 70% of users change font X, future generations avoid it
- Competitive Analysis: Upload competitor portfolios → "Here's how to differentiate your design"
- Brand Consistency Enforcement: Organizations can define brand guidelines; Visage ensures all team members' portfolios align
Q3 2026 - Platform & Personal Brand Marketing:
- Personal Brand Marketing Suite: Expand beyond portfolios to comprehensive marketing materials
- Social media content generation (Instagram posts, LinkedIn banners, Twitter headers)
- Email signature designs and digital business cards
- Marketing collateral (flyers, brochures, presentation templates)
- Print-on-Demand Integration: One-click ordering of physical comp cards/portfolios via Printful/Moo
- Portfolio Website Generator: Export designs as live, responsive websites (Flutter Web)
Long-term Vision - Personal Branding Marketing Platform: Transform Visage into a comprehensive personal branding marketing tool:
- Marketing Campaign Generator: AI creates coordinated marketing campaigns across multiple channels maintaining consistent brand identity
- Content Calendar Integration: Automated posting schedules with design variations for different platforms
- Performance Analytics: Track which designs drive the most engagement and conversions
- Brand Evolution: AI monitors career progression and suggests brand refresh strategies
- Networking Tools: Generate customized pitch decks and one-pagers for specific opportunities or clients


Log in or sign up for Devpost to join the conversation.