Inspiration

Organizations depend on hundreds of PDFs—contracts, policies, project docs—but they’re full of contradictions: mismatched deadlines, conflicting rules, and inconsistent numbers. These errors cause delays, confusion, and financial loss. We wanted to build an AI system that can detect these issues automatically before they turn into real-world problems.

What it does

Agent DOC scans multiple PDFs, extracts key information, and identifies contradictions across documents. It highlights conflicting dates, rules, requirements, numbers, and negations. It also understands natural-language deadlines (“next Friday”, “end of Q2”) and sends reminders. Teams can collaborate, assign fixes, and track all resolved conflicts in a shared dashboard.

How we built it

We used an AI-powered text analysis pipeline that ingests bulk PDFs, converts them to clean text, and applies contradiction-detection models. Our system compares semantic meaning, rule logic, and date interpretations across documents. We added a deadline parser, a conflict classifier, and a collaboration dashboard for task tracking. The backend is optimized for multi-PDF handling and structured conflict output.

Challenges we ran into

Extracting clean text from badly formatted or scanned PDFs

Differentiating true contradictions from harmless differences

Understanding vague or ambiguous natural-language dates

Keeping the system fast even when hundreds of PDFs are uploaded

Creating a UI that clearly shows conflicts without overwhelming users

Accomplishments that we're proud of

Built a working AI engine that detects cross-document contradictions

Designed automatic deadline extraction and reminders

Achieved high accuracy in identifying rule, number, and date conflicts

Created a clean multi-PDF upload workflow and dashboard

Turned a complex manual process into a simple automated tool

What we learned

We learned how challenging it is to process large, unstructured documents at scale. We explored NLP techniques for semantic comparison, natural-language date parsing, and rule consistency checks. We also learned how important good UI is for helping teams navigate complex information efficiently.

What's next for Agent DOC

Advanced visual comparison between document versions

Integration with cloud drives for automatic syncing

Multi-language document support

Real-time monitoring of updated files

API access so enterprises can plug Agent DOC into their internal workflows

Adding risk scoring to rank the most critical contradictions

Built With

Share this project:

Updates