Inspiration:
Securing AI agents against adversarial exploits like prompt injections and PII leaks.
What it does:
SecuRabbit Swarm performs parallel red-teaming of AI agents using a multi-agent "hacker swarm" in isolated environments.
How we built it:
We used Daytona for sandbox orchestration, Sentry for real-time security monitoring, and CodeRabbit to harden code based on detected vulnerabilities.
Challenges:
Synchronizing parallel sandbox states and parsing non-deterministic LLM evaluation verdicts.
Accomplishments:
An automated, end-to-end security pipeline that identifies and proposes fixes for vulnerabilities in minutes.
What we learned:
Isolated sandboxing is critical for safely executing adversarial tests without risking production data.
What's next:
Expanding our "Attack Zone" categories and building a deeper CI/CD loop with CodeRabbit to auto-remediate identified security flaws.
Built With
- coderabbit
- daytona
- google-adk
- python
- sentry
- streamlit

Log in or sign up for Devpost to join the conversation.