Project OmniVision

👁️ Inspiration

Current surveillance systems are "blind"—they record everything but understand nothing. As a Cyber Security student, I saw the inefficiency: high latency, massive storage waste, and a lack of real-time reasoning. Inspired by the "Zero-Waste AI" philosophy, Project OmniVision was born to transform passive cameras into an intelligent, autonomous security ecosystem that doesn't just watch, but thinks and acts.

🚀 What it does

Project OmniVision is an Edge-First AI Security Co-pilot. It performs:

Identity Fusion: Seamlessly links people to their vehicles using YOLOv11 and spatial association.
Autonomous Reasoning: Uses LLMs to analyze security metadata and generate human-like reports.
Zero-Latency Response: Real-time auditory alerts via ElevenLabs and sub-second video streaming via Agora.
Smart OSINT: Automatically cross-references suspicious license plates with public records using Exa.ai.

🛠️ How I built it

I built this project as a solo developer in under 36 hours using a high-performance Event-Driven Architecture:

Core: .NET 9 API and Python AI Engine, synchronized via Redis Streams.
Vision: YOLOv11s optimized for edge inference on an RTX 4050.
Storage: S3-compatible storage (MinIO) for secure snapshot management.
Intelligence: Dify.ai for agentic workflows and OpenAI Codex for natural language database querying.
Communication: ElevenLabs for real-time voice synthesis and Agora for WebRTC-based low-latency live feeds.

🧠 Challenges I faced

Solo Integration: Balancing infrastructure, AI logic, and frontend design alone was a massive undertaking.
Real-Time Sync: Synchronizing high-frequency metadata across multiple microservices without bottle-necking the system.
Data Acquisition: Adapting the system to handle real-world, high-latency RTSP feeds from remote nodes.

🏆 Accomplishments that I'm proud of

Successfully implemented a robust Identity Fusion algorithm that tracks objects across frames.
Achieved a "Zero-Waste" pipeline where only meaningful security events trigger heavy compute tasks.
Integrated 5+ major sponsor APIs into a cohesive, production-ready dashboard.

📖 What I learned

I deepened my expertise in Multi-modal AI and learned how to leverage Agentic Workflows to replace rigid, hard-coded security logic. I also realized the power of Trae and Lovable in accelerating the development of complex full-stack applications.

🔮 What's next for Project OmniVision

Scaling the system to support city-wide deployment (targeting existing infrastructure projects).
Implementing predictive threat analysis using historical metadata.
Enhancing the "Chat with Camera" feature for deeper forensic investigations.

Built With

Updates

Tú Trịnh Hoàng started this project — Mar 21, 2026 08:00 AM EDT

Leave feedback in the comments!

Log in or sign up for Devpost to join the conversation.