🌍 Global Pathfinder
The AI Immigration Lawyer that Reads, Listens, and Teaches
"Mistral Eyes • Upstash Memory • Gemini Brain • ElevenLabs Voice"
💡 Inspiration
Moving to a new country is terrifying. It's not just culture shock—it's the paperwork.
Official immigration documents—like the 116-page US Citizenship Guide—are dense, legalistic, and unforgiving. A single missed detail (wrong ink color, deadline missed by one day) can mean visa rejection.
The reality:
- Immigration lawyers charge $300+/hour
- Most applicants rely on forums and rumors—often wrong
- 40% of visa rejections are due to preventable errors
I wanted to build something different. Not a system that "summarizes" text—but one that understands legal nuance and acts as a proactive companion.
The goal: Turn that scary 116-page PDF into a friendly, intelligent voice that guides you home.
🚀 What It Does
Global Pathfinder is an Agentic AI platform for Global Mobility. It transforms complex PDF documents into interactive, voice-enabled study sessions.
| Feature | Description |
|---|---|
| 📄 Deep Document Understanding | Drag-and-drop massive handbooks. Multimodal OCR reads every table, footnote, and checkbox. |
| 🎙️ Real-Time Voice Consultation | Talk naturally: "Does the age exemption apply to me?" Get instant answers with citations. |
| 🧠 Proactive Teaching | The "Smart Memory" predicts when you'll forget critical rules and quizzes you before your interview. |
⚙️ How We Built It (The Architecture)
We moved beyond simple RAG to build a fully Agentic Orchestrator—five specialized engines working in harmony:
1. 👁️ The "Eyes" — Mistral OCR + Upstash Redis
Standard text extraction fails on government forms (tables, checkboxes, footnotes). We use Mistral OCR for high-fidelity markdown extraction.
Optimization: Raw extracted data is cached immediately in Upstash Redis. Re-uploads hit cache instead of API → 90% cost reduction on repeat processing.
2. 🧩 The "Memory" — Parent-Child Chunking (Upstash Vector)
We implemented a Parent-Child Indexing Strategy to solve the "lost context" problem:
┌─────────────────────────────────────────┐
│ PARENT CHUNK │
│ (Full context in Redis) │
│ ┌─────────┐ ┌─────────┐ ┌─────────┐ │
│ │ Child 1 │ │ Child 2 │ │ Child 3 │ │
│ │(Vector) │ │(Vector) │ │(Vector) │ │
│ └─────────┘ └─────────┘ └─────────┘ │
└─────────────────────────────────────────┘
- Child Chunks: Small, dense snippets embedded in Upstash Vector for precise retrieval
- Parent Chunks: Full surrounding context stored in Upstash Redis
Result: Vector search finds the match → retrieves full parent context → AI gets complete picture → zero hallucinations from missing context
3. 🧠 The "Brain" — Gemini 3 Flash (Agentic Orchestrator)
This is where the magic happens. Gemini 3 Flash serves as the central decision-maker:
- Dynamic Reranking: Specific query? → Tighten
top-k, rerank for precision - Legal Reasoning: Massive context window holds multiple laws simultaneously, reasons through conflicts ("Rule A says X, but Exception B applies here...")
4. 🔊 The "Voice" — ElevenLabs Streaming
For stressful topics like immigration, robotic voices add anxiety. We use ElevenLabs Streaming API via WebSockets for human-quality, empathetic responses.
Latency target: <400ms (feels like a phone call, not a command line)
🌐 Multi-Language & Voice Cloning Support
ElevenLabs natively provides multi-language capabilities and voice cloning options—all built into the platform:
| Feature | What It Offers |
|---|---|
| 🗣️ Multi Language | Spanish, Mandarin, Hindi, Arabic, French, Portuguese, and more—users can select their preferred language |
| 🎭 Voice Cloning | Create custom voice profiles or choose from diverse voice styles (warm, professional, youthful, etc.) |
| ⚙️ Easy Configuration | Simply specify the desired language and voice in the ElevenLabs dashboard or API parameters |
For users: Select your language preference and voice style directly through ElevenLabs configuration. Whether you need guidance in Hindi, Spanish, or Mandarin—with a voice that feels familiar and comforting—it's just a setting away.
5. 📚 The "Tutor" — Adaptive Logic Engine
We implemented the Ebbinghaus Forgetting Curve algorithm to track mastery:
Effective Mastery (M) = S × e^(-t/h)
Where:
S = stored memory strength (0-1)
t = time since last review (hours)
h = retention half-life (varies per topic)
In practice:
- User learns "Form N-648 requires medical certification"
- System stores:
{fact_id: "n648-medical", strength: 0.9, last_review: timestamp} - After 48 hours, mastery decays below threshold (0.6)
- System triggers: "Quick quiz! What document proves a disability exemption?"
🛠️ Tech Stack
| Layer | Technology |
|---|---|
| Frontend | Tankstack start, TailwindCSS, Shadcn/ui |
| Voice I/O | ElevenLabs WebSocket Streaming |
| OCR | Mistral OCR API |
| Vector DB | Upstash Vector (with native embeddings) |
| Cache/KV | Upstash Redis |
| LLM | Google Gemini 3 Flash |
| Framework | Vercel AI SDK |
🚧 Challenges We Faced
⏱️ Latency vs. Accuracy
Voice interfaces need <500ms response, but OCR + Reranking take time.
Solution: Upstash Native Vector Embeddings (skip external API hop) + aggressive Redis caching
📄 PDF Complexity
Government PDFs are messy. PyPDF failed on tables completely.
Solution: Mistral OCR—game-changer for tables, checkboxes, footnotes
⚠️ Hallucinations
In law, "mostly right" = completely wrong.
Solution: Parent-Child retrieval forces LLM to read full context before answering. Hallucination rate dropped from ~15% to <2%
🏅 Accomplishments We're Proud Of
✅ Successfully ingesting 116-page dense legal PDF with accurate answers about specific form numbers (N-648, I-485, etc.)
✅ Building a truly Multimodal Pipeline: Vision (OCR) → Audio (ElevenLabs) → Text (Gemini) flowing seamlessly
✅ The "Smart Memory Cluster": Transforming a passive chatbot into an active tutor that tracks your learning curve over time
✅ Sub-400ms voice response latency despite complex retrieval pipeline
📚 What We Learned
"Context is King" Gemini's large context window enables patterns that weren't possible before. Sometimes letting the model read the whole "Parent" chunk yields better reasoning than atomic chunking.
"The Power of Agentic RAG" Giving the LLM control over its own retrieval parameters (dynamic k, reranking strategies) is vastly superior to hard-coded pipelines.
"Cache Everything" Upstash Redis isn't just for sessions—it's the glue that makes real-time AI affordable.
🏢 Beyond Immigration: Industry Applications
The same Agentic AI architecture that powers immigration guidance can transform any industry drowning in complex documentation. Here's how:
💼 Enterprise Use Cases
| Industry | Use Case | Pain Point Solved |
|---|---|---|
| 🛒 Sales Enablement | New reps learn 500-page product catalogs in days, not months | "What's the difference between Enterprise and Pro tier?" → Instant answer with pricing context |
| 📈 Marketing Onboarding | Brand guidelines, competitor analysis, campaign playbooks | Onboard new marketers without 20 hours of reading; voice-query competitor positioning |
| 🏦 Banking & Finance | Regulatory compliance (Basel III, SOX, AML) | Compliance officers quiz themselves before audits; reduce violation risk |
| ⚕️ Healthcare | Clinical protocols, drug interactions, HIPAA training | Nurses voice-query treatment guidelines mid-shift; zero time wasted searching PDFs |
| 🏭 Manufacturing | Safety manuals, ISO standards, equipment SOPs | Floor workers ask "What's the lockout procedure for Machine 7?" hands-free |
| 🏠 Real Estate | Property laws, contract clauses, market reports | Agents explain complex clauses to buyers in plain language, backed by legal source |
| 📖 Education | Textbooks, course materials, exam prep | Students get voice-tutored through dense academic content with spaced repetition |
🎯 Deep Dive: Sales & Marketing Teams
The Problem: New sales reps face information overload:
- 200+ page product documentation
- Pricing matrices with dozens of variables
- Competitor battle cards updated monthly
- Objection handling scripts
Reality: Reps take 3-6 months to become productive. Meanwhile, they give wrong answers and lose deals.
The Global Pathfinder Solution:
┌─────────────────────────────────────────────────────────────┐
│ SALES ENABLEMENT MODE │
├─────────────────────────────────────────────────────────────┤
│ │
│ 📄 INGEST │
│ ├── Product Catalog (PDF) │
│ ├── Pricing Guide (Excel → PDF) │
│ ├── Competitor Analysis (Slides → PDF) │
│ └── Objection Handling Playbook │
│ │
│ 🎙️ REAL-TIME QUERIES │
│ ├── "What's our edge over Competitor X on security?" │
│ ├── "Can we offer 20% discount on 3-year Enterprise?" │
│ └── "How do I handle the 'too expensive' objection?" │
│ │
│ 🧠 SMART COACHING │
│ ├── Pre-call briefings: "You're meeting Acme Corp..." │
│ ├── Post-call quizzes: "What's our SLA guarantee?" │
│ └── Weakness detection: "You struggle with pricing Q's" │
│ │
└─────────────────────────────────────────────────────────────┘
Projected ROI:
| Metric | Before | After | Impact |
|---|---|---|---|
| Ramp Time | 4.5 months | 6 weeks | 📉 70% faster |
| Answer Accuracy | 67% | 94% | 📈 40% improvement |
| Deal Win Rate | 22% | 31% | 💰 41% increase |
🏥 Deep Dive: Healthcare Compliance
The Problem:
- Nurses reference 400+ clinical protocols
- HIPAA violations cost $50,000+ per incident
- Paper manuals are outdated before they're printed
The Solution:
| Feature | Application |
|---|---|
| Voice-First Interface | Query protocols hands-free during patient care |
| Smart Memory Quizzes | Automated compliance refreshers before certification expires |
| Citation Tracking | Every answer links to official protocol version + page |
Example Interaction:
👩⚕️ "What's the isolation protocol for suspected TB?"
🤖 "Per Protocol IC-2024-017, Section 4.2: Airborne precautions required. N95 respirator, negative pressure room, door closed. Would you like me to read the full checklist?"
🎯 Why This Architecture Scales Across Industries
The same 5-engine system adapts to any domain:
| Engine | Immigration | Sales | Healthcare |
|---|---|---|---|
| 👁️ Eyes (OCR) | Visa forms | Product specs | Clinical protocols |
| 🧩 Memory (Vectors) | Legal clauses | Pricing tiers | Drug interactions |
| 🧠 Brain (Gemini) | Exception analysis | Objection handling | Symptom reasoning |
| 🔊 Voice (ElevenLabs) | Interview prep | Call coaching | Hands-free queries |
| 📚 Tutor (Spaced Rep) | Citizenship test | Product certification | Compliance renewal |
🚀 The Bigger Vision
"Any organization with complex documentation and high stakes for getting it wrong is our customer."
From the immigrant studying for their citizenship test → to the sales rep closing a million-dollar deal → to the nurse administering critical care—everyone deserves an AI that truly understands their documents and teaches them proactively.
🔗 Try It
- Live Demo: https://www.youtube.com/watch?v=-cm42Ldveh8
- GitHub: https://github.com/jacksonkasi1/echo-learn
*Built with ❤️ for everyone chasing a better life in a new country—and beyond.*t
Built With
- bun
- elevenlabs
- gemini
- github
- markdown
- mistral
- rag
- react
- tailwindcss
- turborepo
- typescript
- upstash
Log in or sign up for Devpost to join the conversation.