Resume Buddy - Project Story
Inspiration
As a solo developer frustrated with my own job search, I watched my carefully crafted resume get auto-rejected by ATS systems. After spending hours tailoring applications only to never hear back, I discovered that 75% of resumes never reach human eyes. But even when my resume did pass ATS, I struggled in interviews - I couldn't articulate my achievements confidently, and I had no idea which skills employers actually cared about.
I realized the problem wasn't just resume optimization - it was the entire job search journey: understanding what to highlight on your resume, finding the right jobs that match your skills, and practicing how to talk about your experience convincingly. I decided to build Resume Buddy to decode this entire black box - from ATS screening to semantic job matching to AI-powered interview preparation - using AI to guide job seekers through every step.
What it does
Resume Buddy is my attempt to democratize resume optimization using AI and graph databases. Built entirely as a solo project on a shoestring budget, it provides enterprise-level analysis for individual job seekers:
- AI-Powered Line Analysis: Evaluates every resume line for ATS compatibility and impact, scoring credibility (STRONG/MODERATE/WEAK) and value (EXCELLENT to LOW)
- O*NET Integration: Maps your experience to 900+ standardized occupations that recruiters search for, revealing hidden career paths and transferable skills
- Skill Gap Visualization: Shows exactly which keywords you're missing for target roles through interactive radar charts and heatmaps
- Smart Job Matching: Uses NVIDIA NV-Embed-v2 (1024-dimensional vectors) for semantic similarity, achieving 3.2x better match rates than keyword search
Real-time Market Intelligence: Analyzes live job postings to identify in-demand skills, trending technologies, and market gaps
AI Interview Practice : Daily personalized coaching powered by NVIDIA Nemotron Nano 8B that targets YOUR specific weaknesses. Instead of generic interview prep, it:
- Asks targeted questions about YOUR resume experiences
- Evaluates answers using STARS framework (Situation, Task, Action, Result)
- Provides real-time voice interaction with Whisper STT/TTS
- Adapts follow-up questions based on answer quality (multi-turn conversations)
- Sends scheduled email reminders for daily practice rounds
- Costs ~$0.06 per 3-round session (vs hundreds for human coaching)
How we built it
As a solo developer with limited resources, I architected for maximum impact with minimal cost:
Budget-Optimized Stack (Oct 2024):
- AWS Lightsail: $40/month for all compute (vs $500+ for traditional EC2)
- Neon PostgreSQL: Data storage.
- Neo4j Aura: Graph database for skill relationships
- Redis on Lightsail: 2GB instance for vector storage
- X.AI Grok API: More affordable than GPT-4 ($0.05 vs $0.15 per analysis)
- RunPod Serverless: Pay-per-use GPU ($0.0002/sec only when needed)
Production Migration (Nov 2024):
- Migrated to AWS EKS with Kubernetes orchestration
- Integrated NVIDIA NIM microservices:
- NV-Embed-v2 for semantic embeddings (70% cheaper than OpenAI)
- Nemotron Nano 8B for interview reasoning (sub-200ms latency)
- Deployed 6 microservices + 2 NIM containers across 2 namespaces
- NGINX Ingress for routing, cross-namespace service mesh
- GPU auto-scaling with spot instances (only pay when needed)
Architecture Decisions:
- Chose Spring Boot for rapid development as a Java developer
- Built three microservices to separate concerns and scale independently
- Implemented async queues to handle long-running AI tasks
- Created auto-cleanup jobs to minimize storage costs
Challenges we ran into
Budget Constraints: Couldn't afford always-on GPU instances. Solution: Implemented Kubernetes auto-scaling with spot instances for NVIDIA NIM pods - only pay for GPU when generating embeddings or conducting interviews.
Solo Development Overhead: Managing backend, frontend, DevOps, Kubernetes, and AI simultaneously. Solution: Focused on MVP features, documented everything in AI Code assist tools for context switching.
LLM Costs: Initial GPT-4 usage would cost $100+/day. Solution: Switched to NVIDIA Nemotron Nano 8B (70% cheaper), engineered precise prompts, implemented aggressive caching.
Vector Migration: OpenAI embeddings are 1536-dim, NV-Embed-v2 is 1024-dim. Solution: Rebuilt Redis HNSW index, re-vectorized 100K+ job listings.
Learning Curve: First time working with NVIDIA NIM, Neo4j, vector databases, and LLMs. Solution: Built incrementally, tested each component thoroughly before integration.
Cross-Namespace Communication: App services needed to call NIM services in different namespace. Solution: Kubernetes DNS with service discovery (embedding-service.nim-services.svc.cluster.local:8001).
Accomplishments that we're proud of
Built Entirely Solo: Full-stack platform with AI, graphs, vectors, microservices, and Kubernetes
Production Kubernetes Deployment: Multi-namespace EKS cluster with NVIDIA NIM, zero-downtime rolling updates
Ultra-Low Cost: Achieved $0.10/analysis with NVIDIA NIM (vs $0.50+ with OpenAI)
Real-Time Voice AI: WebSocket streaming interview with live transcription and multi-turn conversations
Self-Bootstrapped: No external funding, built while working full-time
Technical Learning: Mastered Kubernetes, NVIDIA NIM, Neo4j, vector search, and LLM orchestration from scratch
Real Users: Friends and colleagues using it for actual job applications and interview prep
What we learned
Constraints Drive Innovation: Limited budget forced creative solutions that ended up being better
NVIDIA NIM is Production-Ready: Easier than expected to deploy on Kubernetes, excellent documentation, significant cost savings
Kubernetes Abstracts Complexity: Once configured, deployments and scaling are trivial
Perfect is the Enemy of Shipped: Launching MVP early provided invaluable user feedback
AI Costs Add Up Fast: Careful prompt engineering and caching are essential for sustainability
Graphs Unlock Insights: Neo4j revealed skill relationships I never would have discovered
Vector Search Requires Tuning: HNSW index parameters matter for accuracy vs speed
Solo Developers Can Compete: With cloud services, Kubernetes, and NVIDIA NIM APIs, one person can build enterprise-level features
What's next for Resume Buddy
Immediate (Nov 2024):
- Deploy enhanced NVIDIA NIM logging to production
- Create demo video showcasing NIM integration
- Submit to NVIDIA NIM hackathon
Short-term (Q1 2025):
- Fine-tune Nemotron on interview feedback data for better questions
- Migrate from OpenAI Whisper/TTS to NVIDIA Riva (full NVIDIA stack)
- Implement GitHub Actions CI/CD for automated EKS deployments
- Add GPU utilization and cost tracking dashboard
Medium-term (Q2 2025):
- Integrate NVIDIA RAPIDS for GPU-accelerated document parsing
- Multi-tenant Kubernetes with namespace-per-customer isolation
- API marketplace for HR platforms
Why NVIDIA NIM Matters for Solo Developers:
- Cost Efficiency: 70% cheaper than OpenAI for embeddings and reasoning
- Performance: Sub-200ms latency enables real-time features
- One-Stop Stack: Single vendor (NIM + Riva + RAPIDS) instead of juggling multiple AI providers
- Community: Build.nvidia.com provides excellent docs, examples, and support
Built With
- docker
- docling
- embeddings
- fastapi
- java
- langgraph
- neo4j
- next.js
- openai
- postgresql
- python
- redis
- runpod
- spring
- typescript
Log in or sign up for Devpost to join the conversation.