IterativeIR (RLHF)

IterativeIR is built as a direct extension of Protocol SIFT's existing agent architecture — the "Direct Agent Extension" track. No new MCP servers, no multi-agent frameworks. The insight was that the hardest problem wasn't capability, it was verification. Claude Code can already run Volatility and Plaso. The missing piece was a system that checked whether it had done so correctly.

What we learned

Accuracy in autonomous DFIR isn't a prompt engineering problem. A longer, more detailed system prompt does not reliably reduce hallucinations — it just makes the model more confident when it hallucinates. The only reliable fix is a feedback mechanism that makes wrong answers cost something. The hallucination penalty in the scorer is what does this: each unanchored claim deducts from the score the agent needs to stop looping, creating a direct incentive to run the tool rather than infer the answer.

Built With

protocolsift
pydantic
python
vectordb
vercel

Updates

Debabrata Pattnayak started this project — Jun 15, 2026 06:22 PM EDT

Leave feedback in the comments!

Log in or sign up for Devpost to join the conversation.