Resume Buddy - Project Story

Inspiration

As a solo developer frustrated with my own job search, I watched my carefully crafted resume get auto-rejected by ATS systems. After spending hours tailoring applications only to never hear back, I discovered that 75% of resumes never reach human eyes. But even when my resume did pass ATS, I struggled in interviews - I couldn't articulate my achievements confidently, and I had no idea which skills employers actually cared about.

I realized the problem wasn't just resume optimization - it was the entire job search journey: understanding what to highlight on your resume, finding the right jobs that match your skills, and practicing how to talk about your experience convincingly. I decided to build Resume Buddy to decode this entire black box - from ATS screening to semantic job matching to AI-powered interview preparation - using AI to guide job seekers through every step.

What it does

Resume Buddy is my attempt to democratize resume optimization using AI and graph databases. Built entirely as a solo project on a shoestring budget, it provides enterprise-level analysis for individual job seekers:

AI-Powered Line Analysis: Evaluates every resume line for ATS compatibility and impact, scoring credibility (STRONG/MODERATE/WEAK) and value (EXCELLENT to LOW)
O*NET Integration: Maps your experience to 900+ standardized occupations that recruiters search for, revealing hidden career paths and transferable skills
Skill Gap Visualization: Shows exactly which keywords you're missing for target roles through interactive radar charts and heatmaps
Smart Job Matching: Uses NVIDIA NV-Embed-v2 (1024-dimensional vectors) for semantic similarity, achieving 3.2x better match rates than keyword search
Real-time Market Intelligence: Analyzes live job postings to identify in-demand skills, trending technologies, and market gaps
AI Interview Practice : Daily personalized coaching powered by NVIDIA Nemotron Nano 8B that targets YOUR specific weaknesses. Instead of generic interview prep, it:
- Asks targeted questions about YOUR resume experiences
- Evaluates answers using STARS framework (Situation, Task, Action, Result)
- Provides real-time voice interaction with Whisper STT/TTS
- Adapts follow-up questions based on answer quality (multi-turn conversations)
- Sends scheduled email reminders for daily practice rounds
- Costs ~$0.06 per 3-round session (vs hundreds for human coaching)

How we built it

As a solo developer with limited resources, I architected for maximum impact with minimal cost:

Budget-Optimized Stack (Oct 2024):

AWS Lightsail: $40/month for all compute (vs $500+ for traditional EC2)
Neon PostgreSQL: Data storage.
Neo4j Aura: Graph database for skill relationships
Redis on Lightsail: 2GB instance for vector storage
X.AI Grok API: More affordable than GPT-4 ($0.05 vs $0.15 per analysis)
RunPod Serverless: Pay-per-use GPU ($0.0002/sec only when needed)

Production Migration (Nov 2024):

Migrated to AWS EKS with Kubernetes orchestration
Integrated NVIDIA NIM microservices:
- NV-Embed-v2 for semantic embeddings (70% cheaper than OpenAI)
- Nemotron Nano 8B for interview reasoning (sub-200ms latency)
Deployed 6 microservices + 2 NIM containers across 2 namespaces
NGINX Ingress for routing, cross-namespace service mesh
GPU auto-scaling with spot instances (only pay when needed)

Architecture Decisions:

Chose Spring Boot for rapid development as a Java developer
Built three microservices to separate concerns and scale independently
Implemented async queues to handle long-running AI tasks
Created auto-cleanup jobs to minimize storage costs

Challenges we ran into

Budget Constraints: Couldn't afford always-on GPU instances. Solution: Implemented Kubernetes auto-scaling with spot instances for NVIDIA NIM pods - only pay for GPU when generating embeddings or conducting interviews.

Solo Development Overhead: Managing backend, frontend, DevOps, Kubernetes, and AI simultaneously. Solution: Focused on MVP features, documented everything in AI Code assist tools for context switching.

LLM Costs: Initial GPT-4 usage would cost $100+/day. Solution: Switched to NVIDIA Nemotron Nano 8B (70% cheaper), engineered precise prompts, implemented aggressive caching.

Vector Migration: OpenAI embeddings are 1536-dim, NV-Embed-v2 is 1024-dim. Solution: Rebuilt Redis HNSW index, re-vectorized 100K+ job listings.

Learning Curve: First time working with NVIDIA NIM, Neo4j, vector databases, and LLMs. Solution: Built incrementally, tested each component thoroughly before integration.

Cross-Namespace Communication: App services needed to call NIM services in different namespace. Solution: Kubernetes DNS with service discovery (embedding-service.nim-services.svc.cluster.local:8001).

Accomplishments that we're proud of

Built Entirely Solo: Full-stack platform with AI, graphs, vectors, microservices, and Kubernetes

Production Kubernetes Deployment: Multi-namespace EKS cluster with NVIDIA NIM, zero-downtime rolling updates

Ultra-Low Cost: Achieved $0.10/analysis with NVIDIA NIM (vs $0.50+ with OpenAI)

Real-Time Voice AI: WebSocket streaming interview with live transcription and multi-turn conversations

Self-Bootstrapped: No external funding, built while working full-time

Technical Learning: Mastered Kubernetes, NVIDIA NIM, Neo4j, vector search, and LLM orchestration from scratch

Real Users: Friends and colleagues using it for actual job applications and interview prep

What we learned

Constraints Drive Innovation: Limited budget forced creative solutions that ended up being better

NVIDIA NIM is Production-Ready: Easier than expected to deploy on Kubernetes, excellent documentation, significant cost savings

Kubernetes Abstracts Complexity: Once configured, deployments and scaling are trivial

Perfect is the Enemy of Shipped: Launching MVP early provided invaluable user feedback

AI Costs Add Up Fast: Careful prompt engineering and caching are essential for sustainability

Graphs Unlock Insights: Neo4j revealed skill relationships I never would have discovered

Vector Search Requires Tuning: HNSW index parameters matter for accuracy vs speed

Solo Developers Can Compete: With cloud services, Kubernetes, and NVIDIA NIM APIs, one person can build enterprise-level features

What's next for Resume Buddy

Immediate (Nov 2024):

Deploy enhanced NVIDIA NIM logging to production
Create demo video showcasing NIM integration
Submit to NVIDIA NIM hackathon

Short-term (Q1 2025):

Fine-tune Nemotron on interview feedback data for better questions
Migrate from OpenAI Whisper/TTS to NVIDIA Riva (full NVIDIA stack)
Implement GitHub Actions CI/CD for automated EKS deployments
Add GPU utilization and cost tracking dashboard

Medium-term (Q2 2025):

Integrate NVIDIA RAPIDS for GPU-accelerated document parsing
Multi-tenant Kubernetes with namespace-per-customer isolation
API marketplace for HR platforms

Why NVIDIA NIM Matters for Solo Developers:

Cost Efficiency: 70% cheaper than OpenAI for embeddings and reasoning
Performance: Sub-200ms latency enables real-time features
One-Stop Stack: Single vendor (NIM + Riva + RAPIDS) instead of juggling multiple AI providers
Community: Build.nvidia.com provides excellent docs, examples, and support

Built With

docker
docling
embeddings
fastapi
java
langgraph
neo4j
next.js
openai
postgresql
python
redis
runpod
spring
typescript

Updates

Wing chung YEUNG started this project — Nov 02, 2025 07:24 PM EST

Leave feedback in the comments!

Log in or sign up for Devpost to join the conversation.