About the Project

FooCam is a playful, AI-powered mobile application designed to turn anyone from a "potato-camera" shooter into a confident photographer. Unlike typical editing apps that use filters to mask poor technique, FooCam acts as a real-time coach. It focuses on teaching the "why" behind a great photo—composition, lighting, and storytelling—so that the user’s skill becomes their best editor.


Inspiration: From Automation to Augmentation

The inspiration for FooCam was born out of a frustration with the "filter culture" of the early 2020s. I realized that while AI was getting better at fixing bad photos, it wasn't getting any better at making better photographers. Users were becoming dependent on algorithms to hide a lack of skill, leading to a sea of identical, over-processed images.

The turning point came when I started working with the Gemini 3 Flash and Nano Banana ecosystem in late 2025. Four specific breakthroughs shifted my vision:

  1. The "Agentic Coach" Paradigm: Watching Gemini 3’s ability to analyze live video of sports—providing posture corrections and tactical drills in real-time—made me ask: "Why can't we do this for the lens?" I was inspired to move away from reactive chatbots and toward a proactive "Photography Agent" that understands artistic intent, not just pixels.

  2. Nano Banana’s Visual North Star: Previously, AI image generation felt like a "black box." But Nano Banana Pro’s ability to maintain character and style consistency across edits showed me that AI could be used to create a "Visual Target." I realized I could use this to show users a professional-grade version of their own shot—not as a replacement, but as a teaching tool to illustrate better framing and lighting.

  3. The Persistence of Skill: I was deeply moved by the philosophical shift in 2026 toward Augmented Intelligence. While an AI can generate an image, it cannot experience the "decisive moment." My goal was to use Gemini’s multimodal reasoning to bridge the gap between human intuition and technical mastery.

  4. The Collaborative Codebase: Perhaps the most meta-realization was the efficiency of the build itself. 90% of FooCam’s codebase was generated by Gemini, serving as a lead engineer that handled the heavy lifting of complex computer vision boilerplate and UI logic. This allowed me to focus entirely on the "soul" of the app—the creative pedagogy and user experience—proving that AI doesn't just augment the artist, it empowers the architect.

I wanted to prove that Skill is Forever. By using AI to explain the physics of light and the math of composition, FooCam ensures that when the phone goes away, the photographer’s eye remains. As we say in the dev docs:

"We don't want to build the photographer you wish AI could be; we want to help you become the photographer that AI can only simulate."

Share this project:

Updates