RedLockX:MainDashboard-SupaBase:DatabaseSchema
GitHub:Repo
WorkFlow
deBERTa-v3 Performance Matrix
Hybrid Layer Performance Matrix

Inspiration

As AI systems become increasingly integrated into businesses, they are exposed to new security threats such as prompt injection, jailbreaks, system prompt leakage, and adversarial attacks. While organizations are rapidly adopting LLMs and AI agents, most lack a dedicated security layer to protect them. This inspired us to build RedLockX, a real-time AI firewall designed to secure AI systems before malicious inputs can reach the model. More About Prompt Injection Attack

What it does

RedLockX is an AI-powered prompt injection firewall that sits between users and AI models. It analyzes prompts in real time and detects:

Prompt Injection Attacks
Jailbreak Attempts
System Prompt Extraction
Indirect Prompt Injections
Obfuscation & Encoding Attacks
Role-Play Escapes

Using a multi-layer detection engine, RedLockX provides a risk score, attack classification, detailed explanation, and a final ALLOW or BLOCK verdict before the prompt reaches the LLM.

How we built it

We developed RedLockX using a modern full-stack architecture:

Frontend: React, TypeScript, Tailwind CSS
Backend: Vercel Serverless Functions, Express
Database: Supabase PostgreSQL
ML Models: Fine-tuned DeBERTa-v3 Transformer
Hybrid Detection: XGBoost + All-MiniLM Embeddings
Deployment: GitHub + Vercel CI/CD

The system uses a dual-model architecture where both detectors analyze prompts simultaneously. Their outputs are combined through a weighted decision engine to generate the final security verdict.

Challenges we ran into

Building an effective AI security system came with several challenges:

Balancing detection accuracy with low-latency responses.
Detecting heavily obfuscated and encoded attacks.
Reducing false positives while maintaining strong protection.
Integrating multiple machine learning models into a unified pipeline.
Ensuring system reliability when cloud-hosted ML services become unavailable.
Designing explainable security decisions that users can understand and trust.

Accomplishments that we're proud of

Built a working end-to-end AI security platform rather than a simple prototype.
Successfully combined machine learning and cybersecurity techniques into a single solution.
Developed real-time detection for multiple categories of prompt injection attacks.
Created a deployable middleware architecture that can integrate with existing AI applications.
Implemented logging, analytics, and explainable security verdicts.
Designed RedLockX as a scalable foundation for future AI security products.

What we learned

Through RedLockX, we gained valuable experience in:

AI and LLM Security
Prompt Injection Detection
Adversarial Attack Analysis
Machine Learning Model Integration
Full-Stack Development
Cloud Deployment and Scalability
API Design and Security Engineering
Building practical cybersecurity solutions for emerging AI threats

Most importantly, we learned that securing AI systems requires multiple layers of defense rather than relying solely on model-level safeguards.

What's next for RedLockX

Our vision is to transform RedLockX into a comprehensive AI Security Platform.

Autonomous Agent Security

As AI agents become capable of performing real-world actions, RedLockX will act as a security gateway for:

Autonomous AI Agents
Multi-Agent Systems
MCP Tool Integrations
Agent-to-Agent Communication

Sandbox Attack Simulation Environment

We plan to introduce a secure sandbox environment where AI agents can safely interact with simulated systems and applications. This will allow RedLockX to:

Observe real-world attack behaviors
Generate adversarial training data
Test agent security policies
Simulate prompt injection and tool manipulation attacks
Continuously improve detection capabilities

Adaptive Learning

Future versions will learn from emerging attack patterns and improve detection over time through continuous model updates and feedback-driven training.

Enterprise AI Middleware

RedLockX will evolve into a plug-and-play middleware security layer that organizations can place between users, AI agents, RAG systems, and LLMs.

Multilingual Threat Detection

Current detection focuses primarily on English-language attacks. Future releases will support:

Urdu
Arabic
Chinese
Spanish
French
German
Other major languages

allowing RedLockX to identify prompt injection attacks across multiple languages and regions.

Beyond Prompt Injection

Future protection layers will include:

RAG Data Poisoning Detection
Tool Abuse Prevention
Sensitive Data Leakage Protection
AI Agent Behavioral Monitoring
Autonomous Red Teaming
Enterprise Security Analytics

Our long-term goal is to make RedLockX the security layer that protects the next generation of AI systems, AI agents, and autonomous digital workforces.

Built With

all-mini-l6-v2
ci/cd
colab
deberta-v3
github
groq
hf-space-api
huggingface
javascript
lang
langgraph
python
replit
supabase
typescript
vercel
xgboost

Submitted to

AI for Civic Innovation Hackathon
- Winner Audience Favourite
Cyber_Coders

Created by

Worked on the workflow + ML Layer of RedLockX

Abd ullah
(Pen-Testing+Cyber-Space+ML-Algos+Agentic-Tools)=>Make.life.Better
Worked with Semantic Layer and Xgboosting.

Muhammad Uzair
I'm a Cyber Security Student at Uet Peshawar.
haseeb ihsan
Shaheer Ahmad

Updates

Abd ullah posted an update — Jun 15, 2026 11:01 AM EDT

RedLockX Update: New Website & Interactive Research Hub We've just launched a major update to RedLockX!

To make our project easier to explore for judges, researchers, and developers, we've built a dedicated project website that showcases the complete RedLockX ecosystem in one place.

What's New? • Interactive landing page with live cybersecurity-inspired visuals • Detailed breakdown of the Prompt Injection threat landscape • Real-world AI security incidents and CVEs • Visual architecture of our dual-model detection pipeline • Attack gallery covering jailbreaks, prompt leaks, obfuscation, and indirect injections • Live Hugging Face integration so visitors can test RedLockX directly • Performance metrics, research insights, and detection methodology

Our goal is not only to detect prompt injection attacks, but also to help raise awareness about one of the fastest-growing threats facing AI systems today.

We're continuing to refine the platform and expand RedLockX into a complete security layer for LLM-powered applications. Feedback is always welcome. Stay tuned for more updates!

RedLockX #AISecurity #CyberSecurity #PromptInjection #LLM #Hackathon #OpenSource

Log in or sign up for Devpost to join the conversation.

Abd ullah posted an update — Jun 14, 2026 02:02 PM EDT

Adding new features into RedLockX, the site will be down for 20-30 min Thanks

Log in or sign up for Devpost to join the conversation.

Abd ullah posted an update — Jun 13, 2026 03:02 AM EDT

Hey everyone!

We’re excited to announce that RedLockX is now live. This is a working prototype of our AI-powered Prompt Injection Firewall that detects and blocks malicious prompts in real time before they reach the model.

Please take a minute to check it out and try some prompts. Your feedback would mean a lot! Thanks for the support

Log in or sign up for Devpost to join the conversation.

Abd ullah started this project — Jun 13, 2026 02:41 AM EDT

Leave feedback in the comments!

Log in or sign up for Devpost to join the conversation.

Shaheer Ahmad posted an update — Jun 10, 2026 12:55 PM EDT

Log in or sign up for Devpost to join the conversation.