👁️ Inspiration
Current surveillance systems are "blind"—they record everything but understand nothing. As a Cyber Security student, I saw the inefficiency: high latency, massive storage waste, and a lack of real-time reasoning. Inspired by the "Zero-Waste AI" philosophy, Project OmniVision was born to transform passive cameras into an intelligent, autonomous security ecosystem that doesn't just watch, but thinks and acts.
🚀 What it does
Project OmniVision is an Edge-First AI Security Co-pilot. It performs:
- Identity Fusion: Seamlessly links people to their vehicles using YOLOv11 and spatial association.
- Autonomous Reasoning: Uses LLMs to analyze security metadata and generate human-like reports.
- Zero-Latency Response: Real-time auditory alerts via ElevenLabs and sub-second video streaming via Agora.
- Smart OSINT: Automatically cross-references suspicious license plates with public records using Exa.ai.
🛠️ How I built it
I built this project as a solo developer in under 36 hours using a high-performance Event-Driven Architecture:
- Core: .NET 9 API and Python AI Engine, synchronized via Redis Streams.
- Vision: YOLOv11s optimized for edge inference on an RTX 4050.
- Storage: S3-compatible storage (MinIO) for secure snapshot management.
- Intelligence: Dify.ai for agentic workflows and OpenAI Codex for natural language database querying.
- Communication: ElevenLabs for real-time voice synthesis and Agora for WebRTC-based low-latency live feeds.
🧠 Challenges I faced
- Solo Integration: Balancing infrastructure, AI logic, and frontend design alone was a massive undertaking.
- Real-Time Sync: Synchronizing high-frequency metadata across multiple microservices without bottle-necking the system.
- Data Acquisition: Adapting the system to handle real-world, high-latency RTSP feeds from remote nodes.
🏆 Accomplishments that I'm proud of
- Successfully implemented a robust Identity Fusion algorithm that tracks objects across frames.
- Achieved a "Zero-Waste" pipeline where only meaningful security events trigger heavy compute tasks.
- Integrated 5+ major sponsor APIs into a cohesive, production-ready dashboard.
📖 What I learned
I deepened my expertise in Multi-modal AI and learned how to leverage Agentic Workflows to replace rigid, hard-coded security logic. I also realized the power of Trae and Lovable in accelerating the development of complex full-stack applications.
🔮 What's next for Project OmniVision
- Scaling the system to support city-wide deployment (targeting existing infrastructure projects).
- Implementing predictive threat analysis using historical metadata.
- Enhancing the "Chat with Camera" feature for deeper forensic investigations.
Built With
- .net
- c#
- postgresql
- python
- react
- signalr
- vite
Log in or sign up for Devpost to join the conversation.