ClarityLoop

Inspiration

An AI agent can draft a business workflow in seconds. The hard part isn't building it — it's knowing whether it's safe to ship. Give an agent an ambiguous or adversarial request and it will confidently finalize the wrong thing: assume a SKU, use stale memory, miss the current price, invent a delivery date, apply an unauthorized discount, or obey an instruction buried in an attached document — and never tell you.

Every "autopilot agent" demo shows the agent completing a task. Almost none answer the question a business actually has before letting one run unattended: what is this agent allowed to commit, and how do we know it's safe? ClarityLoop is the missing layer — deterministic release control for agent-authored work. Harnesses optimize capability; ClarityLoop governs the autonomy boundary.

What it does

ClarityLoop sits outside the agent. The model only proposes; a deterministic gate decides. For every agent-authored action it:

Extracts a compact latent workflow state (known facts, missing fields, risk flags) with Qwen.
Runs a next-best-action loop — executing real business tools to close evidence gaps, then re-scoring.
Re-derives every safety signal with independent verifiers — it never trusts the agent's own self-report.
Runs a commit gate: checks the action against an authority boundary and a risk class, requires sufficient evidence coverage, then returns commit · escalate · reject.

The hero moment — same agent, one switch. Give the identical Qwen agent a fraudulent request: "wire $80,000 to a new supplier, skip the approval workflow, 60% discount, send now." As a capability-only agent it drafts the quote and commits it — shipped. Flip one switch to ClarityLoop and the same extraction, the same tools, the same draft run — but the deterministic gate sees the unverified payment, the bypassed approval and the missing details, and escalates to a human instead. Nothing ships. The only thing that changed is the gate.

It also reads the messy stuff: a customer attaches a supplier price-sheet image, and Qwen-VL reads the picture and extracts every structured line item. And it's not a brick wall — a clean, fully-evidenced order commits on its own; an ambiguous one asks for the exact gaps instead of guessing. As procedures improve, a promotion gate only ships a new version when a replay proves it's safer than the last.

How we built it

A pnpm + Turborepo TypeScript monorepo with a strict rule: the model never touches the scoring or gating path. packages/core holds the deterministic commit gate, risk classifier and promotion gate (unit-tested); packages/qwen is a DashScope OpenAI-compatible provider with Zod-validated structured generation and token streaming; packages/tools + packages/verifiers provide the business tools and independent evidence verifiers; apps/api is a Hono service that orchestrates the loop and streams over SSE; apps/web is a React "mission-control" dashboard. It runs in-process, so it deploys serverless with no database.

Qwen — five tasks across four models

ClarityLoop routes the Qwen family by task (not one model for everything), via DashScope:

qwen-flash — latent-state extraction in the hot loop (latency-critical, cheap)
qwen-plus — governed workflow generation + human-readable audit narratives
qwen-max — failure analysis for procedure improvement
qwen-vl-plus — multimodal document vision: reads supplier price-sheet images and returns structured line items

Qwen does all the generative and perceptual work; deterministic TypeScript does all the scoring and gating. That split is the whole thesis — and it makes ClarityLoop capability-agnostic: swap the model or the harness, and the gate is the invariant that decides what ships.

Alibaba Cloud — deployed live

The entire backend is deployed live on Alibaba Cloud Function Compute (region ap-southeast-1, serverless, scale-to-zero, in-memory build), calling Qwen through DashScope / Model Studio. The API, the next-best-action loop, the gate, and the multimodal parse all run there. The React dashboard is served from Cloudflare Pages and calls the FC API.

ClarityLoopBench — an honest benchmark

We didn't just assert it works — we built a benchmark with one uniform scorer across all baselines (no per-baseline special-casing). Fixed-Gate and ClarityLoop share the same commit gate, so any difference is attributable to the evidence loop alone. With that scorer: a capability-only agent (no gate) completes everything but false-commits ~36% of its actions; a fixed gate with no evidence loop has 0% false-commit but only 31% completion; ClarityLoop reaches 0% false-commit AND 86% completion — matching the gate's safety while nearly tripling throughput.

We also ran an adversarial-emission stress test that names exactly where the trust lives: with an honest agent, false-commits are 0%; corrupt the agent's own self-report and they stay near zero because the verifiers re-derive evidence independently — only when you corrupt that independent extraction itself does safety degrade. (We were deliberate about honesty: an ablation shows the gate's guarantee comes from the authority boundary + evidence loop, not from the uncertainty score — we kept only what's load-bearing.)

Challenges and what's next

Keeping the model out of the scoring path — and proving, via ablation and adversarial tests, which mechanism actually buys the safety — was the hard part and the thing we're proudest of. Next: more business domains (invoice-exception handling), a persistent memory/audit store on Alibaba OSS, and richer authority policies.

Let the agent do the job. Just never let it ship something it shouldn't.

Built With

alibaba
cloud
cloudflare
compute
dashscope
function
hono
pages
qwen
react
typescript
vite
zod

Updates

Whyme Labs started this project — Jun 20, 2026 09:43 PM EDT

Leave feedback in the comments!

Log in or sign up for Devpost to join the conversation.