Inspiration
Millions of people spend hours filling out government forms—repeatedly copying their identity details from documents into forms across different agencies. Governments struggle with manual verification delays. HR teams waste time on employee onboarding paperwork. We saw an opportunity to apply agentic AI to this real-world problem: automate the cognitive work of document extraction, compliance validation, and intelligent form mapping.
What it does
FormPilot is an active agent orchestration system that:
- Extracts structured identity data from government ID documents (Aadhaar, Passport, PAN, Driving Licence) using Gemini Pro Vision
- Validates eligibility rules deterministically (UIDAI/GST consistency, age requirements, document validity) with optional human review
- Maps extracted data semantically to target form fields across different application schemas
- Generates production-ready PDFs with embedded confidence scores and audit trails
- Notifies via Slack and uploads to SharePoint—integration-ready for enterprise workflows
Reduces form filling from 30-60 minutes to under 3 seconds. Audit trail and HITL governance built in.
How we built it
- Backend: Python 3.12 FastAPI microservices with a deterministic compliance rule engine
- AI orchestration: Airia platform integration for multi-agent pipeline execution and HITL checkpoints
- Vision AI: Google Gemini 1.5 Flash for document OCR and field extraction
- Compliance: Custom rule engine supporting India (Aadhaar/GST), USA, UK, and Canada eligibility checks
- PDF generation: ReportLab with metadata embedding and confidence scoring
- Integrations: Slack Block Kit messaging + Microsoft Graph API for SharePoint uploads
- Frontend: Vanilla HTML/CSS/JavaScript dashboard (no build tooling) showcasing real-time pipeline visualization
- Testing: Pytest with synthetic test assets and benchmark case studies (100-run validation suite)
Challenges we ran into
- Document OCR accuracy variance: Different document orientations, lighting conditions, and wear patterns required fine-tuning Gemini prompts and confidence thresholding
- Field mapping ambiguity: Government forms are inconsistent in field naming and structure. Solved with semantic similarity + FuzzyWuzzy fallback
- Compliance rule complexity: Eligibility rules vary significantly by country and application type. Implemented extensible rule engine with operator precedence
- HITL workflow timeout: Users could disappear mid-approval. Implemented configurable timeouts with rejection-on-timeout safety
- PDF generation performance: ReportLab rendering was slow at scale. Optimized with template caching and streaming
Accomplishments that we're proud of
✅ Production-grade system: Not a prototype—includes audit trails, error handling, and enterprise integrations (Slack, SharePoint, Graph API)
✅ Deterministic compliance: All validation rules are transparent, reproducible, and explainable—critical for government auditability
✅ Human governance: HITL approval workflow with configurable triggers ensures accountability in high-stakes scenarios
✅ Multi-country support: Supports India, USA, UK, and Canada with document type detection and country-specific rule sets
✅ Sub-3-second pipeline: End-to-end processing (extract → validate → map → PDF) completes in milliseconds
✅ Agentic architecture: Designed for orchestration platforms (Airia) with pluggable tools and deterministic state transitions
✅ Full feature coverage: Dashboard, API docs, persistent workflows, benchmark case studies, and automated test suite all shipping
What we learned
- AI isn't enough: Confidence scoring from LLMs needs deterministic fallback logic to be enterprise-ready
- Compliance automation requires explainability: Government workflows need transparent audit trails of why decisions were made, not just what was decided
- HITL is critical infrastructure: Humans should govern risky decisions, not be replaced by AI; good UX for approval workflows is essential
- Document extraction is domain-specific: Generic OCR doesn't cut it; domain-adapted prompts (with sample documents as context) dramatically improve accuracy
- Enterprise wins on integration: The real value is Slack notifications, SharePoint uploads, and audit trails—not just the core algorithm
What's next for FormPilot
- Expansion to 20+ countries: Scale compliance rules to ASEAN, EU, and Middle East
- Multi-document fusion: Extract and validate across multiple documents (e.g., ID + proof of residence)
- Real-time document liveness: Add face-match verification for identity document authentication
- Mobile app: Native iOS/Android for field workers and remote applicants
- API marketplace: Publish FormPilot agents to Airia Community with managed compliance rule packs by vertical (HR, Finance, Government)
- Batch processing: Support bulk form generation for organizations processing thousands of applications
Log in or sign up for Devpost to join the conversation.