Overtab

Your on-device AI browsing companion - private, fast, and built with Gemini Nano!
Highlight text to get instant, privacy-safe explanations and summaries.
Ask questions by voice and get instant page insights: no typing needed.
Right-click any image to get AI-powered descriptions, fully on-device.
All processing happens locally. No cloud, no data collection (ever).

💡Inspiration:

The web is overflowing with information, but understanding it shouldn't require copy-pasting into AI tools or leaving your browsing flow. When Google announced the Chrome Built-in AI Challenge with Gemini Nano, I saw an opportunity to build something truly privacy-first: an AI assistant that lives in your browser, processes everything on-device, and never breaks your reading flow :)

I wanted to solve a simple problem: "What if you could highlight any text, right-click any image, or speak any question and get instant, intelligent help - without data ever leaving your device?" That's how Overtab was born!

🎯 What it does:

Overtab is your privacy-first AI browsing companion that makes understanding the web effortless:

Highlight text → Get instant explanations, simplifications, or translations (6 languages)
Right-click images → AI describes them using page context
Speak a question → Voice command answers using the current page
Generate research links → Find similar resources on any topic
Chat with context → Ask follow-up questions about anything

Everything appears in a clean sidebar. Your browsing never gets interrupted.

🛠️ How I built it:

Chrome Extension (Manifest V3) with a smart AI fallback system:

Primary: Chrome Built-in AI (Gemini Nano)
- 100% on-device, zero network requests
- Uses 5 different APIs: LanguageModel, Summarizer, Rewriter, Translator, Proofreader
Optional: OpenAI fallback
- Users provide their own API key (encrypted client-side with AES-GCM)
- Only used if Gemini unavailable or user chooses

Key Technical Features:

Web Speech Recognition for voice commands
Smart image context extraction (11 metadata sources)
URL verification system (catches AI hallucinations)
Client-side encryption (PBKDF2 + AES-GCM)
Conversation state management

🧩 Challenges I ran into:

1. Chrome's AI has no vision capabilities - can't actually "see" images
Created a context extraction system that scrapes 11 metadata sources (captions, alt text, filenames, aria-labels) and generates descriptions from what the page says about the image.

2. AI hallucinates non-existent URLs when generating research links
Built an async verification pipeline that fetches each URL, checks status codes, inspects content for hidden 404s, normalizes redirects, and deduplicates - only showing verified links.

3. Storing API keys securely without a backend server
Implemented client-side encryption using Web Crypto API: PBKDF2 key derivation (100K iterations) + AES-GCM encryption. Keys stay encrypted in local storage; passcode lives in session storage and clears on browser close.

🏆 Accomplishments I am proud of:

✅ Live on Chrome Web Store! - Published, approved, and ready for real users.
✅ Zero-server architecture - Literally no backend. Everything runs in-browser.
✅ 5 Chrome AI APIs integrated - LanguageModel, Summarizer, Rewriter, Translator, Proofreader all working together seamlessly.
✅ URL verification prevents hallucinations - 60% of AI-generated links were broken before verification. Now 0%.
✅ Beautiful UX that feels native - Tooltips, sidebar, voice UI: everything feels like it's part of Chrome.
✅ Military-grade encryption, zero backend - AES-GCM + PBKDF2 protecting API keys entirely client-side.

📚 What I learned:

Prompt engineering is everything with Gemini Nano - be explicit, directive, structured
Privacy is a killer feature: users love "no servers, no tracking"
AI-generated URLs need verification - 60% were broken without it
Sidebar > New tabs for AI results - never interrupt the user's flow!

🚀 What's next for Overtab

Short-term:

Vision API integration (when Chrome releases it)
More languages + inline translation
Summarization presets (TL;DR, academic, bullet points)

Long-term:

Writing assistant with tone control
Multi-page research mode
Text-to-speech for accessibility
Browser-to-browser sharing (WebRTC, no servers)

So here is:

Overtab: Highlight. Understand. Move on.

🔒 Privacy-first • 🤖 On-device AI • ⚡ Zero interruptions

Built With

aes-encryption
chrome-ai
chrome-extensions-api
chrome-scripting
chrome-storage
css3
fetch-api
gemini-nano
html5
javascript
manifest
openai-api
pbkdf2
web-crypto
web-speech-api

Updates

Riyanshi Bohra started this project — Oct 15, 2025 03:40 PM EDT

Leave feedback in the comments!

Log in or sign up for Devpost to join the conversation.