GIF
Sample video of uploading audio file and how it can show

🛡️ VConShield – PII Redaction

Protecting Conversations, Preserving Privacy

🔍 Inspiration

In a world driven by digital communication, protecting user privacy is more important than ever. With the rise of virtual conversations and voice data sharing, personally identifiable information (PII) is often unknowingly exposed. We were inspired to build VConShield to address this critical concern by automatically identifying and redacting PII from conversational data (vCons), ensuring compliance and user trust.

💡 What it does

VConShield takes audio conversations (e.g., .wav, .mp3) and returns redacted .vcon files by:

Identifying PII such as names, addresses, phone numbers, emails, etc., using advanced Named Entity Recognition (NER) models.
Redacting the PII in transcripts by replacing it with [REDACTED].
Optionally processing the audio to either beep, silence, or cut the identified PII segments.
Providing a secure conserver link to access both detected and redacted outputs.

🛠️ How we built it

Transcription: Converted audio input to text using Whisper from OpenAI for accurate speech-to-text conversion.
PII Detection: Used spaCy’s NER and fine-tuned transformer-based models (like BERT and RoBERTa) to detect PII entities.
Redaction: Replaced PII in the transcript with [REDACTED] tags.
Audio Redaction: Used audio processing libraries (pydub, ffmpeg) to modify the original audio by beeping or silencing the PII segments.
vCon Integration: Packaged output into .vcon format using open-source vCon schema.
Frontend: Created a simple upload interface using Flask and integrated conserver links for download.

🚧 Challenges we ran into

Aligning time-stamped audio segments to specific text tokens for precise redaction.
Detecting less-obvious PII like gender indicators, religious beliefs, or voice metadata.
Balancing redaction accuracy without over-scrubbing useful context.
Processing long conversations without exceeding model or memory limits.

🏆 Accomplishments that we're proud of

Successfully built an end-to-end pipeline that works for both transcript and audio redaction.
Achieved high precision in PII detection using fine-tuned NER models.
Integrated with vCon format and delivered results via conserver link – clean, simple, and secure.
Created an intuitive demo that non-technical users can interact with.

📚 What we learned

How to build robust NER pipelines tailored for spoken conversations.
How to work with audio segmentation, timestamps, and waveform redaction.
The importance of addressing edge-case PII like voice, city, or inferred metadata.
Practical implementation of the vCon standard and working with conserver links.

🚀 What's next for VConShield - PII Redaction

Integrating multilingual PII detection and redaction.
Supporting real-time redaction for live call centers.
Adding customizable redaction rules per compliance frameworks (e.g., HIPAA, GDPR).
Improving the UI/UX and deploying as a cloud-native SaaS tool.
Collaborating with telehealth, customer support, and legal teams to adopt privacy-first audio processing.

Built With

Submitted to

AlgoArena

Created by

I contributed by testing various Hugging Face models for Named Entity Recognition and selecting the one with the highest accuracy for conversational data. I also built the sentiment analysis component to detect emotional tone in the conversations. Additionally, I implemented backend code for data redaction, using NLP techniques to improve the formatting and accuracy of the redacted data. On the frontend, I enhanced the user interface to allow in-page editing of redacted outputs for better user control and clarity.

Akshata Prakash Salunkhe
I was instrumental in setting up the input file processing pipeline, ensuring seamless handling of uploaded audio files (.wav, .mp3) and preparing them for transcription. I architected and developed the backend logic to integrate transcription, PII detection, and redaction workflows, coordinating the use of the best-performing NER model for accurate identification and replacement of sensitive information. On the frontend, I built a user-friendly interface that allows users to upload conversations and receive redacted outputs displayed in a clear, standardized format, along with conserver links for easy access and download.

Divya Sakhare

Updates

Akshata Prakash Salunkhe started this project — Jun 12, 2025 02:35 PM EDT

Leave feedback in the comments!

Log in or sign up for Devpost to join the conversation.