Inspiration

In a digital age where our personal lives and the digital space is increasingly intertwined, our Personally Identifiable Information (PII) becomes ever more valuable for malicious actors(e.g. scammers). The implications of a PII leak today has never been more grave than ever before. Therefore, we introduce Beep, an innovative real-time chat application built for the TikTok Hackathon featuring advanced voice messaging capabilities with intelligent PII (Personally Identifiable Information) detection and automatic redaction. This platform ensures user privacy while maintaining seamless communication through cutting-edge AI technologies. We hope to inspire a new generation of real-time chat applications that are centered on protecting their user's PII.

🌟 Key Highlights

  • 🎯 Privacy-First: Automatic detection and redaction of sensitive information in real-time
  • 🎤 Voice Intelligence: Advanced speech-to-text with multiple audio format support
  • ⚡ Real-Time: Instant messaging with WebSocket technology
  • 🛡️ Enterprise Security: Comprehensive PII protection
  • 🎨 Modern UI: Beautiful, responsive interface built with Next.js and Tailwind CSS

🎯 Core Features

🎤 Advanced Audio Messaging

  • 📱 Multi-Format Support: WAV, MP3, OGG, M4A, WebM, AAC, AMR, FLAC, OPUS
  • ⚡ Real-time Broadcasting: Instant delivery via optimized SocketIO
  • ⏱️ Smart Duration Tracking: Automatic audio length calculation with metadata
  • 🔍 File Validation: Intelligent size limits and format verification
  • 🎚️ Audio Processing: FFmpeg-powered conversion and optimization

🧠 Intelligent Speech-to-Text

  • 🤖 AI-Powered: Facebook S2T model with transformer architecture
  • 📍 Precise Timestamps: Word-level timing for enhanced user experience
  • 🛡️ Error Resilience: Graceful fallbacks with detailed error reporting
  • 🔄 Batch Processing: Efficient handling of multiple audio files
  • 📊 Confidence Scoring: Quality metrics for transcription accuracy

🔒 Enterprise-Grade PII Detection

  • 🚨 Real-time Scanning: Every message analyzed before delivery
  • 🎯 Smart Redaction: Phone numbers, emails, SSNs, addresses → [REDACTED]
  • 📈 Confidence Reporting: Detailed detection metrics and field types
  • 🔐 Privacy Preservation: Original content stored securely for authorized review

💬 Real-time Communication

  • 🔌 SocketIO Integration: Low-latency bidirectional communication
  • 🏠 Room-based Architecture: Isolated conversations with unique codes
  • 👥 User Management: Online/offline status with join/leave notifications
  • 💾 Message Persistence: Conversation history with searchable archives
  • ⌨️ Typing Indicators: Real-time typing status for enhanced UX

Built With

Share this project:

Updates