💡 Inspiration
In financial institutions, cheque verification and processing still depend on manual effort or costly legacy OCR systems. Even modern cloud-based AI tools introduce data privacy risks, as sensitive financial information leaves the organization's secure environment.
We were inspired by the potential of Chrome’s Built-in AI APIs — especially Gemini Nano — to move AI computation on-device. This lets banks and financial teams analyze cheques instantly, securely, and cost-free — all within the browser, no data ever leaves the device.
⚙️ What It Does
ChequeAI is a fully client-side web application that:
- Analyzes cheque images locally using Chrome’s Built-in AI (Gemini Nano via
LanguageModel.prompt()). - Extracts structured data—including Payee, Amount (words & figures), IFSC, Date, and Cheque Number etc—using on-device multimodal - Prompt API Gemini Nano + Zod Schema (no cloud OCR).
- Smart Review Interface: Editable form synced with image preview (zoom/rotate/mirror) so users can verify or correct extracted fields.
- Validates data integrity by detecting inconsistencies like amount mismatches or missing fields.
- Supports voice and text remarks that feed into the AI analysis.
- Generates a forensic AI report (via
Prompt API Multimodel) - Generates Summary via Summarizer API
- Translates the Summary on-device into multiple languages (via
Translator.translate()). - Runs 100% offline—no data leaves the browser, aligning with Zero-Trust and Privacy-by-Design principles.
🛠️ How We Built It
- Frontend Framework: Next.js 15+ (App Router) + TypeScript
- Styling: Tailwind CSS v4 + shadcn/ui
- AI Engine: Chrome Built-in AI (
LanguageModel - Prompt API,LanguageDetector API,Translator API,Summarizer API) Prompt APIs Used:
prompt()→ field extraction (OCR + responseConstraint Schema) & analysissummarize()→ generate summarize of final reportstranslate()→ instant translation of summary
Architecture Overview
[ User Uploads Cheque ]
↓
[ Next.js Frontend (Client Only) ]
↓
[ Chrome Built-in AI (Gemini Nano) ]
↓
[ Structured Fields]
↓
[Editable Form & Preview]
↓
[Remarks - Text & Audio Input (Optional)]
↓
[Analysis - AI Report → Summary (on Demand) → Translation (On Demand) ]
- Zero Backend: No server or cloud components.
- No API Keys: Uses only built-in browser AI models.
- All Processing: Performed securely within Chrome’s sandbox.
🚧 Challenges We Ran Into
- Managing varied cheque formats and handwriting styles using prompt-based extraction — Gemini Nano often produced inconsistent data, leading to integrity issues during extraction, summarization, and translation (both requiring user interaction, limiting automation).
- Dealing with the large model size, which increases first-time load time.
- Ensuring a fully offline experience while keeping the UI fast, responsive, and modern.
- Designing a privacy-first architecture — no cloud, no telemetry, no cookies.
- Integrating multiple Built-in AI APIs (
Prompt API - LanguageModel,SummarizerTranslator) seamlessly within the browser environment. - Debugging and testing with experimental Chrome flags for Gemini Nano support.
- Handling undocumented TypeScript types, requiring manual exploration and experimentation.
- On Summary Markdown in Indian Language not generate correct format
🏅 Accomplishments We’re Proud Of
- First working prototype of a fully offline financial document analyzer.
- Achieved complete on-device AI pipeline — no network calls at all.
- Demonstrated real-time field extraction accuracy > 85% for standard cheque formats.
- Built a modern UI with smooth step-based workflow (Upload → Review → Remarks → Analyze → Summary → Translate in Local Language).
- Proved enterprise-ready compliance through Zero-Trust architecture.
- Successfully integrated experimental Chrome AI APIs.
📚 What We Learned
- On-device AI is ready for enterprise-grade privacy use cases.
- Building with Built-in AI (Gemini Nano, Prompt API, Translator) is powerful but still evolving — documentation gaps require deep experimentation.
- Prompt design and schema validation with structured-outputs are key to maintaining data accuracy in multimodal extraction.
- Offline-first web apps can be powerful and secure when paired with browser-native AI.
- Chrome’s Built-in AI APIs offer a new paradigm — lightweight AI without infrastructure.
- Strong documentation and developer ergonomics make adoption easier for teams.
🚀 What’s Next for ChequeAI
- Enhanced On-Device AI Models – Optimize Gemini Nano pipelines for faster, smaller, and more accurate cheque data extraction.
- Add signature & handwriting verification modules.
- Enhance forensic analysis with MICR line validation, signature presence detection, and tamper-heuristic rules.
- Support batch upload and analysis dashboard for auditors.
- Collaborate with Chrome AI & fintech partners to promote secure, privacy-first AI adoption.
Built With
- chrome
- gemini-nano
- languagemodel
- nextjs
- pnpm
- react
- tailwind
- typescript
- vercel
- zod
Log in or sign up for Devpost to join the conversation.