Inspiration

Accounts Payable teams across industries struggle with manual invoice reconciliation, spending 40+ hours every month comparing invoices, purchase orders, and payment records. Iโ€™ve personally seen finance teams jumping between email, ERP, Excel sheets, and PDF viewers, often resulting in delays, payment errors, compliance risk, and vendor dissatisfaction.


What It Does

DocSamajh AI automates the entire Invoice โ†’ PO โ†’ Compliance reconciliation workflow:

  • Extracts structured fields from invoices and purchase orders (line items, vendor info, totals, taxes)
  • Performs automated 3-way matching with risk scoring
  • Detects discrepancies like quantity mismatch, over-billing, wrong tax, or unauthorized items
  • Supports batch processing for hundreds of documents
  • Creates audit trails for finance and compliance teams

How We Built It

Agent Responsibilities
๐Ÿ“„ Document Processor Agent ADE Parse & Extract โ†’ Structured JSON
๐Ÿค Reconciliation Specialist 3-way matching, risk evaluation, discrepancy detection
๐Ÿ›ก Compliance Auditor GST/VAT/tax validation & policy checks
๐Ÿ“Š Audit Logger Captures metadata, timestamps, confidence, user history

Challenges We Ran Into

Challenge Solution
Wide variations in invoice layouts ADE schema tuning + AI fuzzy matching
Long PDFs with multi-page line items Page splitting + async parallel extraction
Compliance rules differ by country/company Policy definition layer for custom rules
Speed vs Accuracy Async pipelines + caching + validation guards
Enterprise-grade auditability Metadata logs + CSV exports + traceable IDs

Accomplishments That We're Proud Of

  • ๐Ÿš€ Built a production-ready, agentic AI platform capable of parsing, reconciling, and validating complex financial documents
  • ๐Ÿ“„ Achieved 98%+ extraction accuracy with LandingAI ADE on real-world invoice formats

What We Learned

  • ๐Ÿ” Deep understanding of Accounts Payable workflows, invoice structure, POs, taxation, vendor data, line items, and compliance checks
  • ๐Ÿง  Designing and orchestrating Agentic AI systems with reasoning, tool calling, and inter-agent communication
  • ๐Ÿ“ Practical document intelligence using LandingAI ADE Parse + Extract APIs
  • โš™๏ธ Robust backend development: async I/O, error handling, retries, metadata logging
  • ๐Ÿงพ Implementing reconciliation logic handling real-world complexities (rounding differences, tax mismatches, missing POs)
  • ๐Ÿงช Measuring LLM confidence, risk scoring, and compliance scoring
  • ๐Ÿ“ˆ Designing AI with business ROI, adoption, and auditability in mind

What's Next for Marvix

Feature Description
๐Ÿ”Œ ERP Integrations SAP, Oracle NetSuite, QuickBooks, Xero connectors
๐Ÿค– Dispute Resolution Agent Auto-generate vendor communication for mismatches
๐ŸŒ Multi-Currency + Global Tax GST, VAT, withholding tax, EU/US compliance rules

Built With

  • agentic-document-extraction
  • landingai
  • openai-agents
  • pandas
  • python
Share this project:

Updates