NAVI — An end-to-end Coding Agent for building, testing, and deploying Apps in ChatGPT

One-liner: NAVI is a specialized, autonomous coding agent that scaffolds, iterates, evaluates, and ships Apps in ChatGPT using OpenAI’s new Apps SDK—from the first prompt to production. ([OpenAI][1])

Why NAVI

Building Apps in ChatGPT unlocks interactive, in-chat experiences that can call tools, render UIs, transact, and connect to your data. But stitching together specs, code, tests, evals, and deployment is still tedious. NAVI compresses that entire loop into a single conversational workflow. ([OpenAI][1])

What NAVI does

Scaffold an App Turns a natural-language spec into a working ChatGPT App project (routes, UI surfaces, tool definitions, and server functions) aligned with the Apps SDK. ([OpenAI][1])
Code + Refactor Implements features, manages prompts, and keeps the codebase consistent as requirements change.
Test & Evaluate Generates unit/integration tests and agentic evals; runs them locally and in CI. (Pairs well with OpenAI’s emerging agent tooling and evaluation patterns.) ([OpenAI][2])
Ship to ChatGPT Prepares manifests, environment variables, and submits updates for your App’s preview and release channels inside ChatGPT. ([OpenAI][1])
Operate & Improve Watches logs/telemetry, proposes fixes, and applies safe migrations with rollbacks.

How it works (high level)

Describe your App in plain language (use cases, data sources, tools, UI needs).
NAVI proposes a plan: architecture, SDK surface, tool contracts, evals.
Autonomous build: code generation + dependency install + lint/format.
Run evals/tests: produce a quality gate and diffs.
Preview deploy: spin up a preview and register with ChatGPT Apps.
Promote to prod with semantic versioning, changelog, and rollback hooks. ([OpenAI][1])

Key capabilities

Apps SDK–native: Targets the official Apps in ChatGPT model—NAVI outputs projects that plug directly into ChatGPT’s in-conversation app surfaces and tool calling. ([OpenAI][1])
Agent-first architecture: Uses OpenAI’s latest agent patterns (planning, tool use, retries, reflection) to complete multi-step tasks with minimal hand-holding. ([OpenAI][3])
AgentKit alignment (optional): If your stack uses OpenAI’s AgentKit building blocks (workflows, connectors, evals), NAVI maps to them automatically. ([OpenAI][2])

Architecture (conceptual)

User ↔ NAVI (planner/critic loops)
       ├─ Apps SDK Project Generator  ──► /app (UI, routes, manifests)
       ├─ Tool/Connector Builder      ──► /tools (schemas, auth, SDK calls)
       ├─ Test & Eval Orchestrator    ──► /tests, /evals (gates, reports)
       ├─ Deployer                    ──► ChatGPT App preview/prod
       └─ Ops Monitor                 ──► logs, metrics, remediation PRs

Example conversation → outcome

You: “Build a travel-planner App in ChatGPT that searches flights and saves itineraries to Notion.”
NAVI:
- drafts the project plan and SDK surfaces
- generates UI and tool handlers (flight search, Notion API)
- writes tests/evals, runs them, fixes regressions
- deploys a preview inside ChatGPT and returns the App link + usage notes. ([OpenAI][1])

Who is NAVI for?

Teams launching Apps in ChatGPT fast (POCs → production).
Product engineers who want agentic automation across build/test/deploy.
Startups and enterprises standardizing on OpenAI’s newest Apps + agent stack. ([OpenAI][1])

Built With

love
vibeco.ai

Updates

Andres Niño started this project — Oct 18, 2025 06:50 PM EDT

Leave feedback in the comments!

Log in or sign up for Devpost to join the conversation.