Inspiration
Every day, millions upload TikToks without realizing they might be exposing personal details — a license plate, a child’s face, an email on-screen, or even a phone number spoken out loud. Once it’s online, it’s too late. We wanted a privacy-first layer that empowers creators to stay safe without slowing down their creativity.
What it does
Redactly automatically scans your video before upload, looking at:
- Visuals: detects faces, vehicles, license plates etc.
- Speech & Captions: transcribes audio to catch phone numbers, addresses, names, etc using NER. It then surfaces all potential privacy leaks in an interactive UI where you decide: blur faces, pixelate plates, or bleep/mute sensitive words. With one click, it outputs a redacted video ready for TikTok.
How we built it
- Computer Vision: YOLOS for people/vehicles, plus regex/NER for PII.
- Speech: Whisper ASR pipeline to catch private info in dialogue.
- Fusion Layer: merges detections across modalities, removes duplicates, assigns severity.
- Frontend: Lynx UI that previews flagged regions and lets creators toggle actions.
- Backend: FastAPI pipeline that scans, fuses results, and re-encodes the redacted video using OpenCV + ffmpeg.
Challenges we ran into
Performance: running multiple models per frame was slow, so we introduced frame sampling.
Synchronization: aligning millisecond timestamps across audio and video tracks for smooth redaction. Audio/Visual artifacts that were removed using FFMPEG
Accomplishments that we're proud of
- Built a working end-to-end system: upload video → scan → interactive review → download safe version.
- Designed a clean UI that empowers creators to stay in control instead of enforcing automatic censorship.
What we learned
How messy real-world video analysis is — OCR, ASR, and detection each produce noisy results.
Importance of fusion: combining signals and scoring them is as valuable as the detectors themselves.
What's next for Redactly
Custom TikTok integration: direct “preview & redact before posting.”
Model fine-tuning: train on real plate datasets, region-specific privacy cues.
Privacy policies: expand detection to include ID cards, children, and other sensitive contexts.
Built With
- computer-vision
- cv2
- fastapi
- lynx
- natural-language-processing
- pydub
- python
- react
- spacy
- ultralytics
- yolov11
Log in or sign up for Devpost to join the conversation.