Inspiration

As AI systems become increasingly integrated into businesses, they are exposed to new security threats such as prompt injection, jailbreaks, system prompt leakage, and adversarial attacks. While organizations are rapidly adopting LLMs and AI agents, most lack a dedicated security layer to protect them. This inspired us to build RedLockX, a real-time AI firewall designed to secure AI systems before malicious inputs can reach the model. More About Prompt Injection Attack

What it does

RedLockX is an AI-powered prompt injection firewall that sits between users and AI models. It analyzes prompts in real time and detects:

  • Prompt Injection Attacks
  • Jailbreak Attempts
  • System Prompt Extraction
  • Indirect Prompt Injections
  • Obfuscation & Encoding Attacks
  • Role-Play Escapes

Using a multi-layer detection engine, RedLockX provides a risk score, attack classification, detailed explanation, and a final ALLOW or BLOCK verdict before the prompt reaches the LLM.

How we built it

We developed RedLockX using a modern full-stack architecture:

  • Frontend: React, TypeScript, Tailwind CSS
  • Backend: Vercel Serverless Functions, Express
  • Database: Supabase PostgreSQL
  • ML Models: Fine-tuned DeBERTa-v3 Transformer
  • Hybrid Detection: XGBoost + All-MiniLM Embeddings
  • Deployment: GitHub + Vercel CI/CD

The system uses a dual-model architecture where both detectors analyze prompts simultaneously. Their outputs are combined through a weighted decision engine to generate the final security verdict.

Challenges we ran into

Building an effective AI security system came with several challenges:

  • Balancing detection accuracy with low-latency responses.
  • Detecting heavily obfuscated and encoded attacks.
  • Reducing false positives while maintaining strong protection.
  • Integrating multiple machine learning models into a unified pipeline.
  • Ensuring system reliability when cloud-hosted ML services become unavailable.
  • Designing explainable security decisions that users can understand and trust.

Accomplishments that we're proud of

  • Built a working end-to-end AI security platform rather than a simple prototype.
  • Successfully combined machine learning and cybersecurity techniques into a single solution.
  • Developed real-time detection for multiple categories of prompt injection attacks.
  • Created a deployable middleware architecture that can integrate with existing AI applications.
  • Implemented logging, analytics, and explainable security verdicts.
  • Designed RedLockX as a scalable foundation for future AI security products.

What we learned

Through RedLockX, we gained valuable experience in:

  • AI and LLM Security
  • Prompt Injection Detection
  • Adversarial Attack Analysis
  • Machine Learning Model Integration
  • Full-Stack Development
  • Cloud Deployment and Scalability
  • API Design and Security Engineering
  • Building practical cybersecurity solutions for emerging AI threats

Most importantly, we learned that securing AI systems requires multiple layers of defense rather than relying solely on model-level safeguards.

What's next for RedLockX

Our vision is to transform RedLockX into a comprehensive AI Security Platform.

Autonomous Agent Security

As AI agents become capable of performing real-world actions, RedLockX will act as a security gateway for:

  • Autonomous AI Agents
  • Multi-Agent Systems
  • MCP Tool Integrations
  • Agent-to-Agent Communication

Sandbox Attack Simulation Environment

We plan to introduce a secure sandbox environment where AI agents can safely interact with simulated systems and applications. This will allow RedLockX to:

  • Observe real-world attack behaviors
  • Generate adversarial training data
  • Test agent security policies
  • Simulate prompt injection and tool manipulation attacks
  • Continuously improve detection capabilities

Adaptive Learning

Future versions will learn from emerging attack patterns and improve detection over time through continuous model updates and feedback-driven training.

Enterprise AI Middleware

RedLockX will evolve into a plug-and-play middleware security layer that organizations can place between users, AI agents, RAG systems, and LLMs.

Multilingual Threat Detection

Current detection focuses primarily on English-language attacks. Future releases will support:

  • Urdu
  • Arabic
  • Chinese
  • Spanish
  • French
  • German
  • Other major languages

allowing RedLockX to identify prompt injection attacks across multiple languages and regions.

Beyond Prompt Injection

Future protection layers will include:

  • RAG Data Poisoning Detection
  • Tool Abuse Prevention
  • Sensitive Data Leakage Protection
  • AI Agent Behavioral Monitoring
  • Autonomous Red Teaming
  • Enterprise Security Analytics

Our long-term goal is to make RedLockX the security layer that protects the next generation of AI systems, AI agents, and autonomous digital workforces.

Built With

Share this project:

Updates

posted an update

RedLockX Update: New Website & Interactive Research Hub We've just launched a major update to RedLockX!

To make our project easier to explore for judges, researchers, and developers, we've built a dedicated project website that showcases the complete RedLockX ecosystem in one place.

What's New? • Interactive landing page with live cybersecurity-inspired visuals • Detailed breakdown of the Prompt Injection threat landscape • Real-world AI security incidents and CVEs • Visual architecture of our dual-model detection pipeline • Attack gallery covering jailbreaks, prompt leaks, obfuscation, and indirect injections • Live Hugging Face integration so visitors can test RedLockX directly • Performance metrics, research insights, and detection methodology

Our goal is not only to detect prompt injection attacks, but also to help raise awareness about one of the fastest-growing threats facing AI systems today.

We're continuing to refine the platform and expand RedLockX into a complete security layer for LLM-powered applications. Feedback is always welcome. Stay tuned for more updates!

RedLockX #AISecurity #CyberSecurity #PromptInjection #LLM #Hackathon #OpenSource

Log in or sign up for Devpost to join the conversation.

posted an update

Hey everyone!

We’re excited to announce that RedLockX is now live. This is a working prototype of our AI-powered Prompt Injection Firewall that detects and blocks malicious prompts in real time before they reach the model.

Please take a minute to check it out and try some prompts. Your feedback would mean a lot! Thanks for the support

Log in or sign up for Devpost to join the conversation.