Plaroom Diet - AI supported parenting
Upload a photo of your child's playroom
Works on desktop and mobile
A multi-agent AI system analyzes the toys
Gemini vision analyzes the image and identifies every toy
Personalized six-month development roadmap and skill radar with projected growth after roadmap
Theme support
Safety-checked recommendations with shopping links
Maps toys to O*NET abilities by the US Department of Labor, includes career prediction
Play Quest based on your current toys, with parenting tips for creative learning using what you already own
Interactive system architecture
The vision behind the project
Playroom Diet

Playroom Diet

Multi-agent AI with Apache Airflow and Gemini meets child development. Just like a diet needs balance, so does play. With Gemini 3, O*NET data, and an intuitive UI, you get a personalized roadmap, because parenting is hard enough.

Playroom Diet is an AI-powered child development tool that transforms chaotic toy piles into science-backed growth plans. Upload a photo of your child's playroom, and a multi-agent AI system analyzes the toys, maps them to professional skill frameworks, and creates a personalized six-month development roadmap. It also delivers play quests that encourage creative use of the toys you already own. Think of it as a nutritionist for your child’s play, ensuring a balanced diet of developmental stimulation.

Next time your kids make a mess in their playroom, don't ask them to clean up right away. Take a photo and use that data.

Try it yourself: playroom-diet.app

Keep in mind: this is a hackathon prototype. Daily limits apply to control costs.

Playroom Mockup

📌 tl;dr

This project demonstrates a multi-agent AI pipeline using Gemini 3 and Apache Airflow:

Agent 1: Vision agent extracts toy inventory with bounding box coordinates for visualization.
Agent 2: Maps toys to the O*NET professional skills framework to identify which cognitive and physical abilities the toys develop and which require further development. Create recommendations of toys to add. Also uses a custom tool function to access an O*NET abilities and occupations database to inform parents about which abilities are important for specific careers.
Agent 3: Safety-checks recommendations against CPSC guidelines and generates search queries to find the suggested toys on platforms like Amazon. Uses a tool function to retrieve up-to-date product data.
Agent 4: Generates a Play Quest using existing toys to learn and enjoy until new toys arrive.

🧸 The result: Open Playroom Diet on any device, snap a photo of the chaos, and instantly get a science-backed development roadmap, verified shopping links, and a creative Play Quest you can start right now with the toys you already have.

Playroom Screenshots

🧠 Gemini 3 integration

Playroom Diet uses both Gemini 3 Pro and Gemini 3 Flash, choosing the right model for each agent's task complexity.

Vision and multimodal understanding is the entry point: the first agent receives raw playroom photographs and identifies every visible toy, including bounding box coordinates for visual feedback. Gemini 3 Flash's state-of-the-art vision capabilities make this reliable even with cluttered, real-world scenes.

Structured output ensures every agent returns data conforming to strict Pydantic schemas, skill scores, roadmap items, safety decisions, making the results directly usable by the frontend without any post-processing.

Function calling enables agents to interact with external systems in real time. The analysis agent calls a custom tool that queries an O*NET database to map toy-based skills to professional careers. The safety agent uses a web search tool to validate toy recommendations against current CPSC guidelines.

Dynamic thinking is tuned per agent using Gemini 3's thinking_level parameter. The vision agent runs at low for fast object detection, the Play Quest agent at medium for creative generation, and the analysis agent at high for deep multi-step reasoning with tool use. The vision agent also uses media_resolution: HIGH for detailed image analysis of cluttered playroom scenes.

Advanced reasoning is where Gemini 3 Pro is being used: the analysis agent performs multi-step reasoning, scoring six developmental categories, identifying gaps, calling the O*NET career tool, and synthesizing scientific recommendations. Pro's superior reasoning depth is reserved for this critical agent, while Flash handles the remaining agents for speed and cost efficiency.

🌟 Potential impact

Parenting is arguably the most important job in the world, yet it is one of the few fields that has remained largely untouched by advanced AI.

As a parent of two, I realized that we often manage our health, finances, and work with sophisticated AI tools, but we manage our children's development with guesswork and by succumbing to toy marketing. Playroom Diet addresses this gap by shifting the paradigm from consuming toys to curating development.

This can have a real impact on parenting:

Democratizing child development expertise
Reducing parental mental load
Promoting sustainable and intentional play
Promoting AI for parenting

💡 Inspiration

Every parent has been there: a room overflowing with toys. At the same time, during nearly every grocery trip, the kids notice new toys they want to add to their playroom. We keep adding toys without really considering a healthy mix that supports cognitive development.

👦👧 Having two kids myself, I've been there, many times.

The problem isn't too few toys, it's the wrong mix.

Just like a diet needs balance across food groups, a child's play environment needs balance across developmental domains. But most parents aren't child development experts, and the toy industry's marketing doesn't help.

Playroom Diet bridges this gap by treating play as a child's job and mapping toy interactions to the same professional skill frameworks used by the US Department of Labor (O*NET). The result? Parents get clear, actionable insights instead of vague advice.

Imagine a world where every playroom is optimized not for quantity, but for developmental quality.

Next time your kids make a mess in their playroom, don't ask them to clean up right away. Take a photo and use that data.

🤖 What it does

Simply upload a photo of your playroom, enter your child's age, and Playroom Diet delivers:

1. Toy inventory with visual heatmap

Gemini vision analyzes the image and identifies every toy, categorizing them by type and play mode. Each toy is mapped with bounding box coordinates for visual feedback. The result shows an image with detected toy clusters.

Toy inventory

2. Ability assessment based on the O*NET framework

The US Department of Labor's O*NET database defines the cognitive and physical abilities that predict success across 1,000+ occupations, from surgeons to software engineers. These are measurable capacities like Visualization, Finger Dexterity, and Oral Expression that form the foundation of career readiness.

These abilities don't suddenly appear in adulthood. They're built through play. A child stacking blocks is training Visualization. Finger painting develops Fine Motor Control. Pretend play builds Social Perceptiveness.

Playroom Diet maps your toys to these O*NET abilities, scoring your playroom across 6 developmental categories:

Cognitive: Problem solving, reasoning, memory.
Fine motor: Finger dexterity, precision, hand-eye coordination.
Gross motor: Body coordination, balance, strength.
Social-emotional: Empathy, cooperation, emotional expression.
Creative: Imagination, artistic expression, open-ended play.
Language: Communication, vocabulary, storytelling.

Skill assessment and development

3. Personalized 6-month roadmap

Based on skill gaps, Playroom Diet generates a prioritized development plan:

Now: Most critical gap to address immediately.
3 months: Second priority for near-term growth.
6 months: Third priority for longer-term development.

For each of the 3 steps, it generates a recommendation on which toy to buy, including the specific O*NET ability being developed and scientific reasoning. It also includes links to matching products on Amazon, Target, and Walmart, for one-click shopping along with a clear explanation of how the toy supports the child’s ability development. It also gives parents an idea of which occupations these abilities may be valuable for.

4. Safety verification based on CPSC guidelines

Every toy recommendation passes through a safety audit against Consumer Product Safety Commission (CPSC) guidelines for the child's age. Unsafe recommendations are automatically substituted with safer alternatives.

Safety adjusted recommendation

5. Play quest

Don't wait for new toys to arrive. A dedicated AI agent creates a fun, structured play activity using toys you already own. Each quest targets a specific O*NET skill, includes step-by-step instructions, and gives parents tips to maximize engagement, to start improving development today.

Play quest

6. Integrations

Google Calendar: Schedule Play Quests directly to your calendar with one click.
Google Shopping: Quick-access links to find recommended toys across retailers.

⚙️ How it was built

Terminology: The analysis of a playroom is called a scan throughout the project. A scan can have multiple states which are explained later.

Tech stack

Backend

Python 3.12 + uv
FastAPI + Uvicorn
Supabase (Postgres + Storage)
APScheduler (periodic background cleanup)
Pydantic (data validation)

Frontend

Vue.js 3 (Composition API)
Vite
Tailwind CSS + daisyUI 5
Vue Router 4
Vue Flow (system visualization)
Three.js (3D background)
canvas-confetti (🎉)

Airflow (AI pipeline orchestration)

Apache Airflow 3.2
Airflow Common AI provider
Pydantic AI
Gemini API with Gemini 3
O*NET abilities and occupations database (data source)

Infrastructure

Backend: Google Cloud Run
Frontend: Firebase Hosting
Airflow: Astronomer Astro
Database: Supabase
Storage: Supabase

All components are implemented in a platform-independent manner. For example, the database can be any PostgreSQL instance, and Airflow can run on any hosting provider, as only open-source features are used. This approach maximizes flexibility, and platforms are chosen solely for cost optimization.

System Architecture

System architecture

The multi-agent pipeline

The heart of Playroom Diet is the orchestrated multi-agent architecture with Gemini and Apache Airflow.

Airflow pipeline

The Airflow project contains one Dag (process_scans), which orchestrates 4 AI agents that transform a playroom photo into a personalized child development plan. Each agent has a focused task with structured Pydantic outputs.

The Dag has no schedule and is meant to be triggered via the Airflow REST API by the Playroom Diet backend every time a playroom picture is uploaded.

Once started, it will claim pending scans from the underlying Postgres scans table using atomic locking. All agent tasks use dynamic task mapping to process scans in parallel, with runtime generated tasks (one per scan). To ensure system stability, the parallelism per task is limited via max_active_tis_per_dag and the overall Dag parallelism is limited via max_active_runs.

Race condition handling

The Dag uses atomic scan claiming via UPDATE ... FOR UPDATE SKIP LOCKED to prevent race conditions:

When get_new_scans runs, it atomically claims open scans by setting their status from processing to in_flight in one atomic operation.
The FOR UPDATE SKIP LOCKED clause ensures concurrent Dag runs claim different scans.
Multiple Dag runs can process in parallel, each working on its own set of scans.
This eliminates wait times in case a user submits a playroom picture while the Dag is running.

Status flow

Each scan is moving through 3 distinct states. The backend creates each scan with the processing status, while the Airflow pipeline updates the status accordingly.

Status flow: processing (new) -> in_flight (claimed) -> done (complete).

Scan status flow

AI agents

The Airflow Common AI provider is an open-source provider package, that facilitates the integration and orchestration of LLMs and AI agents directly within Apache Airflow pipelines. It provides decorator-based tasks to seamlessly incorporate AI functionality into traditional data and machine learning workflows. It is based on PydanticAI so that many of the features can be used.

The pipeline uses two models: gemini-3-pro-preview for the analysis agent (complex reasoning with tool use) and gemini-3-flash-preview for the remaining agents (vision, safety, play quest). Each agent is configured with per-task thinking_level and the vision agent uses media_resolution: HIGH for detailed image analysis.

For Playroom Diet, all agents are defined with a model, a strict Pydantic output type, a system prompt based on the role in the multi-agent system, GoogleModelSettings for model-specific tuning, and some are provided with tool functions.

Example:

@task.agent(
    agent=Agent(
        model="gemini-3-pro-preview",
        output_type=AnalysisResult,
        system_prompt="...",
        tools=[get_careers_for_skill],
        model_settings=GoogleModelSettings(
            google_thinking_config={"thinking_level": "high"}
        )
    )
)

🤖 Agent 1: Vision Agent analyze_image

Model: Gemini 3 Flash (thinking_level: low, media_resolution: HIGH)
Input: Playroom photo from the image storage
Output: ToyInventory with bounding boxes
Purpose: Extract every visible toy with normalized coordinates (0-1 range)

🤖 Agent 2: Room Analyze Agent analyze_playroom

Model: Gemini 3 Pro (thinking_level: high)
Input: Toy inventory + child's age
Tools: get_careers_for_skill (database lookup)
Output: AnalysisResult with skill scores and 3-item roadmap
Purpose: Map toys to O*NET abilities, score 6 development categories, identify gaps, and forecast future careers.

Career forecasting This agent uses a custom tool to query the O*NET database. It connects identified toy-based skills (like "Manual Dexterity") to high-value future careers.

"While magnetic tiles develop basic spatial awareness... High levels of Manual Dexterity are critical for careers like Oral and Maxillofacial Surgeons and General Dentists."

The tool is backed by a custom SQL function that joins the O*NET abilities and occupations tables to find the top 5 careers requiring high levels of a given skill:

create or replace function get_careers_for_skill(skill_name text)
returns table (job_title text)
language sql
set search_path = public
as $$
  select o.title
  from occupations o
  join abilities a on o.onetsoc_code = a.onetsoc_code
  where a.element_name = skill_name
    and a.scale_id = 'LV'
    and (a.data_value::numeric) > 4.5
  order by (a.data_value::numeric) desc
  limit 5;
$$;

🤖 Agent 3: CPSC Child Safety Agent safety_check

Model: Gemini 3 Flash (thinking_level: low)
Input: Development roadmap + child's age
Tools: duckduckgo_search_tool (to search current data about products)
Output: ToyRecommendation with safety decisions
Purpose: Validate against CPSC guidelines, substitute unsafe toys, generate shopping queries

🤖 Agent 4: Play Quest Agent generate_play_quest

Model: Gemini 3 Flash (thinking_level: medium)
Input: Toy inventory + child's age
Output: PlayQuest activity details
Purpose: Create an immediate play activity using existing toys
Note: Runs in parallel with Agent 2

Agent overview

💾 Database

The database (Postgres) is using 3 main tables:

scans: Stores the job state, image path, and final JSON result.
occupations: O*NET occupation data (Imported from Occupation Data.txt).
abilities: O*NET ability scores (Imported from Abilities.txt).

Source:

Database layout

📁 Project Structure

gemini-playroom-diet/
├── frontend/                 # Frontend application
├── backend/                  # Backend server
├── airflow/                  # Airflow implementation
│   └── dags/
│       └── playroom_diet.py  # The multi-agent AI pipeline
├── bin/start                 # Local development startup script
├── DEPLOYMENT.md             # Notes on deployment routine
└── README.md                 # This README :)

📖 What I learned

I learned how to productionize multi-agent AI systems, and why using Apache Airflow is a powerful solution for this.

Using the Airflow Common AI provider for multi-agent pipelines transforms the development of agentic systems from fragile Python scripts into robust, production-grade workflows. By defining agents as tasks within an Airflow Dag, you gain built-in observability. This architecture treats agents not as isolated scripts but as integrated components of a larger data ecosystem, enabling features like automatic retries, caching, and explicit dependency management between agents.

I also learned that with Gemini 3, vision and reasoning capabilities have reached a level where even critical everyday challenges, like parenting, can be supported safely and effectively.

Building Playroom Diet pushed me to think beyond single-prompt AI demos:

Multi-agent orchestration with Airflow creates more robust, maintainable AI systems.
Structured outputs (Pydantic + JSON mode) are essential for production AI.
Professional frameworks (O*NET, CPSC) add credibility and real-world value.
Visual feedback (heatmaps, detection boxes) builds user trust in AI decisions.

Most important, I was able to solve a real problem that I, as a parent, faced every day.

➡️ What's next

Toy donation recommendation: Add a feature to suggest toys for donations for other kids, and directly link toy donation websites based on over-developed abilities.
Wish list: Include an optional wish list where parents can list toys their kids want, allowing these preferences to be incorporated into recommendations. This can also involve suggesting variations of wishes to create a better balance of play. This could be an interesting feature for Christmas 🎄.
Toy variations: Generate more toy recommendations per roadmap stage, so that parents can dismiss items and have alternatives prepared.
Progress visualization: Use Airflow callback functions to update the status of scans after each step, and use this data to visualize the progress at the loading screen.
Scaling: To keep costs near zero, all components are scaled to the minimum. A playroom scan takes a couple of minutes, mainly because the backend and Airflow infrastructure automatically scale to zero and often require a warm-up phase. Depending on how well the project is received, this can be improved by scaling the infrastructure accordingly.
More parenting support: Toy management is just the beginning. Parenting is an underexplored AI use case, and many features in this area can be added.

❤️ Vision

Every parent feels overwhelmed. The challenges are constant, the decisions endless. Playroom Diet is a step toward a future where AI helps us with the most important job of all, raising our children.

But this is just the beginning.

As a father, I see opportunities everywhere in my daily life. My daughter is learning to write, but she struggles with specific letters, sometimes writes them backwards, sometimes just wrong. What if a service like Playroom Diet could analyze a photo of her homework, identify the patterns, and generate playful, research-backed activities to help us work through them together?

The possibilities of Gemini in parenting are endless. And they're worth exploring.

The skills they build today become the strengths they carry forever.

Built With

airflow
fastapi
firebase
gcp
gemini
postgresql
pydantic
pydanticai
python
supabase
tailwind
three.js
uv
vite
vue.js