โจ Promptly
Empowering the future of AI through precision prompt engineering
๐ Inspiration
In a world accelerating toward autonomous AI agents, prompt engineering is no longer optionalโit's foundational.
The numbers tell a compelling story:
- ๐ The global prompt-engineering market, valued at $222.1M in 2023, is projected to surge beyond $2.06B by 2030
- ๐ค 82% of companies already deploy AI agents in internal operations
- โก 85% have integrated agents into at least one critical workflow
As dependence on AI agents grows exponentially, so does the need for precise, secure, and high-performance prompts.
Promptly was created to meet that needโhelping developers craft, harden, and optimize prompts with clarity and confidence.
๐ฏ What it does
Promptly is a comprehensive platform for modern prompt engineering, built around three powerful pillars:
1. ๐ป Prompt Engineering IDE
A purpose-built development environment designed exclusively for prompts, powered by a dedicated orchestration agent.
Features:
- โก Rapid iteration and testing
- ๐ Structured refinement workflows
- ๐ Version-controlled prompt building
- ๐จ Intuitive interface (think Cursor, but for prompts)
2. ๐ก๏ธ Security Evaluation
Advanced threat detection powered by cutting-edge AI.
Components:
- ๐ค Custom Small Language Model (SLM) trained on 10,000+ prompts and system instructions
- ๐ Detects prompt-injection risks, hidden manipulations, and malicious intent
- ๐ง Gemini-powered security agent that:
- Explains vulnerabilities in plain language
- Highlights critical risks with context
- Suggests actionable remediation strategies
3. ๐ Quality Evaluation
Side-by-side comparison studio across major LLMs.
Benefits:
- ๐ฌ Compare model outputs under identical conditions
- โ๏ธ Evaluate reliability, creativity, and accuracy
- ๐ฏ Choose the optimal model for your specific use case
๐๏ธ How we built it
We architected a modular multi-agent workflow using modern AI orchestration tools:
Core Technologies:
- ๐ LangChain & LangGraph for intelligent orchestration
- ๐ Gemini (via Google AI Studio) for prompt evaluation
- ๐งช OpenAI models (via Azure AI Foundry) for output benchmarking
Security Layer:
- ๐ ๏ธ Built a custom SLM from scratch
- ๐ Fine-tuned on a curated dataset using Vertex AI
- โก Deployed as a lightweight inference service
Frontend:
- โ๏ธ React application with real-time editing
- ๐ Seamless state synchronization
- โ๏ธ Backed by Azure's scalable infrastructure
Every component was designed to be portable, extensible, and cloud-agnostic.
๐ง Challenges we ran into
Technical Integration
Orchestrating a multi-agent ecosystem across Azure and Vertex AI introduced:
- ๐ Complex API conflicts
- ๐ซ Token-handling inconsistencies
- ๐ Authentication edge cases
We relied heavily on community resourcesโStack Overflow, Reddit, and GitHubโto debug obscure failures and discover workarounds.
Dataset Curation
Building the SLM training dataset proved equally challenging:
- ๐ Resources on prompt-injection were scattered and inconsistent
- ๐ต๏ธ Manually curated examples from:
- Ethical hacking forums
- GitHub security repositories
- Academic research papers
- Red-team datasets
- โ๏ธ Ensured full GDPR compliance throughout
The process took significantly longer than expected, but the extra effort resulted in a far more robust and accurate model.
๐ Accomplishments that we're proud of
๐ฏ Custom Security SLM
Our biggest achievement: successfully engineering and deploying our own SLM for prompt-security validationโwithout relying on RAG or external filters.
Capabilities:
- โ Detects subtle manipulation attempts
- ๐ Identifies layered injections
- ๐ง Catches indirect adversarial patterns
- ๐ Delivers impressive accuracy across diverse test cases
โ๏ธ Cross-Cloud Orchestration
We mastered the notoriously difficult task of building a resilient multi-agent system that seamlessly bridges:
- Microsoft Azure
- Google Vertex AI
- LangChain orchestration
- LangGraph state management
The result: a robust, interconnected system ready for enterprise deployment.
๐ What we learned
๐ค Multi-Agent Architecture
Gained deep insights into orchestration frameworksโparticularly how LangGraph excels in structured state management when you maintain:
- ๐ Disciplined design patterns
- ๐ Clear state transitions
- โ ๏ธ Strict error-handling protocols
๐ก๏ธ Security ML Development
The SLM taught us that high-quality, diverse data is non-negotiable when dealing with adversarial patterns:
- ๐ Data augmentation strategies are critical
- ๐ค Ethical sourcing requires careful consideration
- ๐ฌ Continuous validation prevents model drift
๐ฐ Cloud Cost Management
Cross-cloud deployment exposed the importance of financial guardrails:
- ๐ธ LLM-heavy workflows can exhaust budgets rapidly
- ๐ Monitoring and throttling are essential
- ๐ฏ Strategic resource allocation maximizes ROI
๐ Iterative Refinement
Extensive testing confirmed that prompt engineering is inherently iterativeโeach refinement cycle strengthens reliability against real-world edge cases.
๐ What's next for Promptly
We're evolving Promptly into a production-grade platform with an ambitious roadmap:
๐ฎ Near-Term Enhancements
- ๐ง Advanced NLP tools and custom LLM fine-tuning
- ๐ Expanded context windows for complex prompts
- ๐ค More powerful AI editing assistants
- โป๏ธ Autonomous prompt revision and self-optimization
๐ฅ Collaboration Features
- ๐ค Team workspaces for multi-developer environments
- ๐ Shared prompt libraries and templates
- ๐ Real-time collaborative editing
- ๐ Team analytics and insights
๐ Ecosystem Growth
- ๐ค Strategic partnerships to expand our training corpus
- ๐ Open-source IDE components to foster community innovation
- ๐ข Enterprise security certifications (SOC 2, ISO 27001)
- ๐ Educational resources and certification programs
๐ฏ Vision
Position Promptly as the industry-standard platform for safe, effective, and collaborative prompt engineering.
๐ก Final Thoughts
Promptly began as a toolโbut it's rapidly becoming an ecosystem.
Join us in shaping the future of AI interaction, one prompt at a time.
Built With
- azure
- flask
- gemini
- google-ai-studio
- html
- javascript
- langchain
- langgraph
- matplotlib
- n8n
- openai
- pandas
- postgresql
- python
- scikit-learn
- seaborn
- supabase
- tailwind-css
- transformers



Log in or sign up for Devpost to join the conversation.