AyushAudit AI — The Story

Inspiration

India's PM-JAY insures 500M+ people yet loses an estimated ₹2,000 Cr/year to fraud. A rule-based auditor felt solvable with RAG.

$$\text{Audit} = \text{RAG}(\text{Guidelines}) \xrightarrow{\text{LLM}} P(\text{fraud}) \gtrless \tau$$

Guidelines → chunked → FAISS index → top-$k$ rules retrieved per claim → LLM judges.

Unity Catalog replaced DBFS silently — every path assumption broke
hive_metastore is deprecated; /tmp is ephemeral; Gradio beats Streamlit inside notebooks
Heuristics are a 必须 fallback when APIs fail

Problem	Fix
DBFS disabled	UC Volumes `/Volumes/...`
`main` catalog missing	Discovered workspace via `SHOW CATALOGS`
LLM unavailable on free tier	Sarvam AI + heuristic fallback
Scanned PDFs	Tesseract OCR at $300$ DPI