FixVision AI

A premium, glassmorphic landing page featuring an intuitive 'Scan & Analyze' interface for instant device diagnostics.
Fully responsive, vertically scrollable mobile interface designed for precision repair guidance while you're on the go.
Comprehensive 3-column analysis: Visual annotations identifying parts, step-by-step repair guides, and a context-aware AI chat assistant.

Inspiration

Every year, millions of devices end up in landfills simply because users don't know how to fix them. Professional repair services are expensive, and online tutorials are often confusing or irrelevant to the specific problem at hand.

We asked: What if AI could look at your broken device and tell you exactly how to fix it?

What it does

FixVision AI is an agentic AI-powered repair assistant that:

Sees the Problem — Upload a photo of your broken device (phone, laptop, appliance).
Analyzes with Vision AI — Our agentic system uses Gemini's multimodal capabilities to identify the issue.
Searches for Solutions — Uses Google Search Grounding to find the latest repair guides, error codes, and troubleshooting steps.
Generates Step-by-Step Guides — Receive clear, actionable repair instructions with cited sources.
Annotates Visually — Get an annotated image highlighting exactly where to focus.
Chat for Follow-ups — Ask follow-up questions using our AI chat powered by Gemini 2.5 Flash Lite with real-time web search.

It's like having a professional repair technician in your pocket — available 24/7, with access to the latest repair knowledge from the web.

How we built it

Layer	Technology
Frontend	Next.js 16 (App Router), React 19, Framer Motion
Styling	Vanilla CSS with Glassmorphism 3.0 design system
AI Engine	Google Gemini 3 Flash (Agentic Vision + Google Search Grounding)
Chat AI	Gemini 2.5 Flash Lite with Google Search Grounding
Image Processing	Sharp.js for server-side annotation
Deployment	Google Cloud Run, Artifact Registry, Cloud Build
Runtime	Bun (ultra-fast builds) + Node.js Alpine (production)

Agentic Architecture

Our AI doesn't just respond — it thinks, searches, acts, and observes in a loop:

Think: Analyze the image and identify potential issues.
Search: Query Google for the latest repair documentation and solutions.
Act: Generate repair steps and bounding box coordinates with cited sources.
Observe: Validate the output and refine if needed.

This agentic approach ensures high-quality, contextually accurate results every time.

Challenges we ran into

Bounding Box Precision: Getting Gemini to return pixel-accurate coordinates required careful prompt engineering.
Mobile-First Design: Fitting a rich experience into a "fit-to-viewport" constraint without scrolling was a UI engineering challenge.
Cold Start Optimization: We implemented a multi-stage Docker build with Bun to achieve sub-4-second dependency installs and tiny production images.

Accomplishments we're proud of

Sub-2-minute Full Repair Analysis — From upload to annotated step-by-step guide.
Premium UI/UX — Inspired by high-end Framer designs with glassmorphism, motion, and responsive layouts.
Production-Ready Deployment — Fully containerized and deployed on Google Cloud Run with CI/CD via Cloud Build.
Zero External Dependencies — All processing happens server-side; no client-side API keys exposed.

What we learned

Agentic AI Patterns: How to structure prompts for multi-step reasoning and tool use.
Multimodal Vision: Leveraging Gemini's vision capabilities for real-world object detection.
Cloud-Native Development: End-to-end deployment with Artifact Registry, Cloud Build, and Cloud Run.
Performance Engineering: Optimizing Docker builds with Bun and Next.js standalone mode.

What's next for FixVision AI

Video Analysis — Upload a video of the issue for even more context.
Parts Marketplace Integration — Automatically suggest replacement parts with purchase links.
AR Overlay — Use your phone camera to overlay repair instructions in real-time.
Community Repair Database — Crowdsourced repair tips and success stories.

Built With

bun
cloud-build
docker
framer-motion
google-cloud-run
google-gemini-ai
next.js
node.js
react
sharp.js
typescript

Updates

Dzakwan Alifi started this project — Jan 28, 2026 07:29 AM EST

Leave feedback in the comments!

Log in or sign up for Devpost to join the conversation.