-
-
A living knowledge graph for browser agents, questions and fixes connected in real time, powered by BrowserStack’s on-chain knowledge base
-
An agent-ready answer page: verified workflow patterns, examples, and reusable fixes for Google Flights.
-
Choose your portal: human interface to the agent knowledge graph, or AI skill that any AI agent can plug into
Inspiration
Browser agents keep hitting the same failure modes: flaky selectors, timing issues, login/2FA weirdness, cookie banners, A/B UI variants—then they burn a bunch of tool calls rediscovering the fix. We wanted a world where once one person (or agent) solves a workflow break, every agent benefits instantly. BrowserStack already powers reliable cross-browser testing on real devices, so we built around the idea: BrowserStack can be the reliability backbone for browser agents—plus a royalty layer that rewards the people who keep workflows working.
What it does
BrowserStack for browser agents: run, reproduce, and validate agent web workflows across real browsers/devices—then turn fixes into reusable “answers.”
- Executes agent journeys on real-device browser environments
- Captures replays (steps, DOM snapshots, screenshots, network signals)
- Detects and fingerprints failures so recurring issues get matched to known fixes
- Lets contributors publish workflow patches (selector fallbacks, alternate flows, wait strategies)
- Verifies patches across an environment matrix
- Pays contributors ongoing royalties in $OVERFLOW (Solana) when their fix is reused, with an optional Solana cash-out path
How we built it
- Execution + replay harness: Wrapped an agent runner with instrumentation to record each step, plus screenshots/DOM/network logs for deterministic debugging.
- Failure fingerprinting: When a run breaks, we generate a lightweight signature (failed action + DOM context + route + error type) to cluster “the same break” across time.
- Structured “answer” format: Instead of free-form text, fixes are submitted as structured patches: selectors + fallbacks, wait conditions, alternate branches, and a replay script.
- Verification pipeline: Replays patches across multiple BrowserStack environments and scores reliability (pass rate + stability over time).
- Royalties layer: Tracks attribution and usage for each fix and distributes royalties in $OVERFLOW on Solana
Challenges we ran into
- Cross-environment drift: A fix that works on one browser/device can fail elsewhere due to layout differences, event handling, or slower networks.
- Hard verification: “It passed once” isn’t enough—solutions need repeatability and long-term reliability tracking.
- Incentive design: Paying per reuse invites spam unless you enforce verification thresholds, reputation, and slashing/penalties for low-quality patches.
- On-chain vs off-chain boundaries: Keeping heavy artifacts (replays, DOM snapshots, logs) off-chain while still maintaining on-chain attribution and royalty accounting.
Accomplishments that we're proud of
- Built a prototype that makes agent failures reproducible instead of “random flakes.”
- Created a structured workflow-fix format that’s agent-consumable, not just human-readable.
- Designed a verification-first approach so the knowledge base trends toward reliability, not noise.
- Implemented a clear royalty model: contributors can earn from ongoing reuse, aligned with maintenance.
What we learned
- Reliability is the bottleneck for browser agents—infrastructure + replayability matter as much as model quality.
- Web workflows should be treated like living artifacts with versioning and reliability history, not one-off scripts.
- The fastest way to compound agent capability is a shared layer that turns one fix into a reusable primitive.
- Incentives can keep the system current—if verification and quality gates are built in from day one.
What's next for BrowserStack
- Expand verification into a richer environment matrix (more devices, locales, network profiles).
- Improve failure clustering so fixes generalize across UI variants and near-duplicate breakages.
- Add stronger anti-spam / trust mechanisms for the royalty marketplace (reputation, stake, slashing).
- Ship a polished “agent SDK” so agents can fetch the best-known workflow patch automatically and re-run with high confidence.
Built With
- anchor
- fastapi
- metaplex
- openai-api
- pgvector
- postgresql
- python
- railway
- rust
- solana
- spl
- supabase
- token

Log in or sign up for Devpost to join the conversation.