Inspiration
Traditional construction estimation is a time-consuming, error-prone process requiring specialized expertise. We were inspired by the potential to democratize this knowledge using AI, making professional-grade estimation accessible to contractors, builders, and homeowners worldwide—especially in emerging markets where technical resources are limited.
What it does
BuildEstimate AI leverages Gemini 3's multimodal capabilities to transform construction workflows: Image analysis: Processes photos of building sites or blueprints to identify materials and dimensions, meaning that you simply upload any image or drawing blueprint in pdf of a building structure → and see the instant estimate
How we built it
We integrated Gemini 3's vision-language model as our core engine, feeding it: Construction images with geolocation metadata User-specified regional parameters The model processes these inputs through a specialized prompt engineering framework that: Extracts structural elements from visual data Cross-references materials with local availability databases Generates technical instructions following industry best practices Frontend implementation uses React with TypeScript, while camera functionality leverages the MediaDevices API with careful stream management to ensure hardware resources are properly released.
Challenges we ran into
Multimodal alignment: Ensuring consistent interpretation between visual inputs and textual outputs required extensive prompt tuning Regional specificity: Building accurate supplier databases for diverse countries (Nigeria, India, UK, USA) demanded significant research Camera resource management: Preventing persistent camera activation required implementing robust cleanup protocols across all navigation paths Technical instruction quality: Generating actionable "Setting Out" guidance that meets professional standards necessitated iterative refinement of the AI prompt structure.
Accomplishments that we're proud of
Successfully implemented location-aware material recommendations that reference specific regional requirements (e.g., coastal corrosion protection) Created a comprehensive country-specific supplier network covering major construction markets globally
What we learned
Through this project, we discovered that effective AI integration in specialized domains requires: Contextual precision: Generic responses are insufficient; outputs must reflect domain-specific constraints Multimodal synergy: Combining visual analysis with geographic context yields significantly more valuable insights than either modality alone Resource responsibility: Hardware access (camera/microphone) demands meticulous lifecycle management to maintain user trust Cultural adaptation: Technical solutions must account for regional variations in both materials and practices
What's next for BuildEstimate-AI
Enhanced AR integration: Real-time overlay of material quantities and costs onto live camera views. Expanded regional coverage: Adding detailed supplier networks Collaborative features: Multi-user project sharing with version-controlled estimate comparisons. Regulatory compliance: Direct integration with local building code databases for automatic compliance verification.
Built With
- github
- google-ai
Log in or sign up for Devpost to join the conversation.