ShieldFlow

Picture of the real time dashboard

Inspiration

As AI systems become more powerful, they are increasingly exposed to real-world misuse—prompt injection attacks, accidental leakage of sensitive data, and unsafe model behavior. While building and observing LLM-powered apps, it became clear that most AI failures happen in real time, yet most security controls are reactive.

ShieldFlow was inspired by a simple question: What if AI systems had a live security and trust layer, just like networks do?
We wanted something that could protect people, organizations, and data without changing how developers already build agents.

What it does

ShieldFlow is a real-time security and observability layer for LLM applications and agents.

Every prompt and response is streamed through a Kafka pipeline and analyzed in Apache Flink to detect:

Prompt injection attempts
Sensitive data exposure (PII, secrets)
Unsafe or suspicious behavior

Each session maintains a dynamic trust score:

[ \text{Trust Score}_{t+1} = \text{Trust Score}_t - \sum \text{Risk Signals} ]

Based on this score, ShieldFlow automatically decides whether to:

✅ Allow
🟡 Mask
❌ Block

ShieldFlow integrates directly with CrewAI and LangChain, allowing developers to secure their agents by adding ShieldFlow as a drop-in guardrail—no major rewrites required.
All detections and actions are surfaced live in Datadog dashboards.

How we built it

Kafka (Confluent) streams prompts and agent responses as real-time events
Apache Flink performs stateful inspection, detection, and trust scoring
CrewAI & LangChain integrations allow ShieldFlow to intercept agent prompts and tool outputs
Python services simulate LLM and agent workflows
Datadog ingests metrics, logs, and traces for dashboards, alerts, and incident context

The system treats AI interactions as data in motion, enabling immediate security decisions with full observability.

Challenges we ran into

Designing a trust score that adapts without over-blocking legitimate agent behavior
Securing multi-step agent workflows across tools and memory
Balancing detection accuracy with real-time performance
Integrating observability without adding developer friction

Many existing tools focus on offline evaluation, so building a streaming-first, agent-compatible approach required careful tradeoffs.

Accomplishments that we're proud of

A real-time AI security pipeline built on streaming data
Stateful trust scoring per session and per agent
Seamless integration with CrewAI and LangChain
Clear, actionable Datadog dashboards for AI security signals

Most importantly, ShieldFlow proves that AI agents can be secured without slowing them down.

What we learned

AI agents amplify risk because they act autonomously
Streaming architectures are ideal for AI safety and governance
Observability is essential for trust, debugging, and compliance
Simple, well-placed guardrails can prevent serious real-world harm

What's next for ShieldFlow

ML-based classifiers for more nuanced risk detection
Policy-as-code for agent and enterprise AI governance
Native Vertex AI / Gemini enforcement hooks
Automated incident creation and remediation workflows

ShieldFlow is just the beginning of real-time defense for autonomous AI systems.

Built With

crewai
datadog
flink
gemini
kafka
langchain
python

Updates

oluwaseyifunmi oshinfowokan started this project — Dec 31, 2025 12:38 PM EST

Leave feedback in the comments!

Log in or sign up for Devpost to join the conversation.