👁️ Inspiration

Current surveillance systems are "blind"—they record everything but understand nothing. As a Cyber Security student, I saw the inefficiency: high latency, massive storage waste, and a lack of real-time reasoning. Inspired by the "Zero-Waste AI" philosophy, Project OmniVision was born to transform passive cameras into an intelligent, autonomous security ecosystem that doesn't just watch, but thinks and acts.

🚀 What it does

Project OmniVision is an Edge-First AI Security Co-pilot. It performs:

  • Identity Fusion: Seamlessly links people to their vehicles using YOLOv11 and spatial association.
  • Autonomous Reasoning: Uses LLMs to analyze security metadata and generate human-like reports.
  • Zero-Latency Response: Real-time auditory alerts via ElevenLabs and sub-second video streaming via Agora.
  • Smart OSINT: Automatically cross-references suspicious license plates with public records using Exa.ai.

🛠️ How I built it

I built this project as a solo developer in under 36 hours using a high-performance Event-Driven Architecture:

  • Core: .NET 9 API and Python AI Engine, synchronized via Redis Streams.
  • Vision: YOLOv11s optimized for edge inference on an RTX 4050.
  • Storage: S3-compatible storage (MinIO) for secure snapshot management.
  • Intelligence: Dify.ai for agentic workflows and OpenAI Codex for natural language database querying.
  • Communication: ElevenLabs for real-time voice synthesis and Agora for WebRTC-based low-latency live feeds.

🧠 Challenges I faced

  • Solo Integration: Balancing infrastructure, AI logic, and frontend design alone was a massive undertaking.
  • Real-Time Sync: Synchronizing high-frequency metadata across multiple microservices without bottle-necking the system.
  • Data Acquisition: Adapting the system to handle real-world, high-latency RTSP feeds from remote nodes.

🏆 Accomplishments that I'm proud of

  • Successfully implemented a robust Identity Fusion algorithm that tracks objects across frames.
  • Achieved a "Zero-Waste" pipeline where only meaningful security events trigger heavy compute tasks.
  • Integrated 5+ major sponsor APIs into a cohesive, production-ready dashboard.

📖 What I learned

I deepened my expertise in Multi-modal AI and learned how to leverage Agentic Workflows to replace rigid, hard-coded security logic. I also realized the power of Trae and Lovable in accelerating the development of complex full-stack applications.

🔮 What's next for Project OmniVision

  • Scaling the system to support city-wide deployment (targeting existing infrastructure projects).
  • Implementing predictive threat analysis using historical metadata.
  • Enhancing the "Chat with Camera" feature for deeper forensic investigations.

Built With

Share this project:

Updates