Idea Navigator

Demo 1
Demo 2

💡 About the Project: Idea Navigator

🚀 Inspiration

Every startup begins with a spark—but too often, that spark fades due to unvalidated assumptions, misaligned market targeting, or lack of clarity on competitive positioning. Our team at AgentHacks 2025 wanted to solve this problem for early-stage founders and builders.

We asked ourselves: "What if founders had an intelligent partner that could break down their startup idea across strategic layers and provide actionable feedback?" That’s how IdeaForge AI was born—a system designed to simulate a product strategist, market analyst, and investor rolled into one AI-powered evaluation engine.

🧠 What It Does

IdeaForge AI is a multi-layered agentic evaluation framework that takes a raw startup idea and analyzes it across 5 core dimensions:

Vision & Macro Trends – Jobs-to-be-Done, Blue/Red Ocean strategy, emerging shifts
Customer Fit & Validation – Painkiller vs. Vitamin, ICP definition, adoption curve
Market & Competitive Analysis – Real-time competitor search with Serper + Claude synthesis
Business Model Viability – Revenue streams, unit economics, defensibility
Execution & KPIs – MVP, validation loops, GTM strategy, and traction metrics

The result is an interactive evaluation report, with confidence scores, extracted insights, and suggested next steps—generated completely autonomously by a crew of AI agents.

🛠️ How We Built It

We built IdeaForge AI as a fully agentic, modular system using the following stack:

Language Model: Claude 3 (Haiku/Sonnet) via anthropic SDK, orchestrated through crewai
Tool Integration:
- Serper.dev for real-time Google search-based competitive intelligence
- Claude agents for market trend synthesis and vision analysis
Crew Architecture:
- Built custom agents (strategist, analyst) with separate roles and tools
- Defined task flows across 5 analysis layers using crewai.Task + Crew(process=sequential)
Backend: Flask REST API (/start, /analyze, /summary) with CORS-enabled endpoints
Frontend: React-based UI with step-by-step walkthroughs (in progress)
Key Libraries: anthropic, crewai, flask, serper, pydantic, openai, chroma, uvicorn

Environment managed via a comprehensive requirements.txt with explicit version control for reproducibility.

⚔️ Challenges We Faced

LLM Rate Limiting: Integrated custom retry logic for Claude API with exponential backoff and error parsing to gracefully handle 429 and connection errors
Multi-Agent Orchestration: Synchronizing tasks across strategist and analyst agents while maintaining session state and shared context
Tool Routing: Balancing LLM synthesis with live search (Serper) required intelligent delegation per agent role
Model Compatibility: Adapting prompt engineering and output parsing for Claude’s structured response format (especially for insight and confidence extraction)

🎓 What We Learned

How to design agentic systems using CrewAI with clear goal-role-tool separation
Best practices in modular LLM toolchain design, including retry logic, confidence scoring, and insight distillation
Combining real-time search APIs with LLM reasoning for grounded, up-to-date evaluations
How to build a system that moves beyond “chat” to structured, layered decision support

🏆 Accomplishments We’re Proud Of

🧠 Built a multi-agent system that autonomously evaluates startup ideas across 5 strategic dimensions
🛠️ Implemented real-time search + LLM synthesis hybrid tools, ensuring outputs are grounded in live data
🧩 Created a fully functional Flask-based API backend that powers an LLM-driven evaluation engine
⏱️ Designed resilient retry logic to handle rate limits and keep analysis flowing under pressure
🏁 Successfully ran end-to-end evaluations for 10+ startup ideas with structured reports and confidence metrics
🥇 Developed and deployed within 24 hours at AgentHacks 2025 under the “Agentic Web Assistants” track

🔮 What’s Next

Build a founder-facing UX for non-technical users to submit ideas and track revisions
Add PDF export & pitch deck generation based on the insights
Integrate a feedback loop where human mentors can refine or score AI-generated outputs
Train fine-tuned LLMs for specific verticals like healthtech, edtech, and climate startups

Built With

analyze`
and-`/summary`-endpoints-??-**python**-?-core-language-for-agent-logic
and-insight-extraction-?-**flask**-?-backend-api-server-powering-`/start`
anthropic
chromadb
claude3
codespaces
crewai
css
curl
dataclasses
fastapi
flask
html
javascript
json
litellm
pydantic
python
requests
serper
tool-definitions
trend-synthesis

Updates

Private user started this project — May 24, 2025 07:10 PM EDT

Leave feedback in the comments!

Log in or sign up for Devpost to join the conversation.