Problem Statement 7:

As AI technologies rapidly integrate into our daily lives, concerns about privacy and security have become more urgent than ever. With the rise of powerful generative AI models, large-scale data collection, and cloud-based deployment, users face increasing risks: sensitive data leakage, identity theft, etc. This hackathon invites participants to explore solutions in the following areas:

  • Enhancing the privacy of AI systems themselves (Privacy of AI); and/or
  • Using AI to defend user privacy and security (AI for Privacy).

Inspiration:

In this day and age, AI content on social media has evolved to seamlessly replicate authentic video footages. Yet nowadays, AI detection sites only deal with deepfakes. But what about non-deepfake videos that look real. There are groups of users, particularly the elderly, who are not able to discern AI videos from real ones given how hyper-realistic some videos can be. We need to protect these groups of users so as to curb the spread of misinformation and inform them on true reality.

What it does:

This project is a machine learning-based system designed to detect whether an input video has been AI-generated or is real. By analysing sampled frames from videos, the model classifies each video with probability scores indicating the likelihood of it being AI-generated or authentic.

How we built it:

  • Extracts a fixed number of representative frames from any input video.
  • Utilizes a pretrained video transformer model fine-tuned specifically for distinguishing AI-generated videos.
  • Supports batch training and inference on local video datasets.
  • Automatically handles corrupted or missing videos during data processing.
  • Outputs interpretable probabilities for real vs. AI-generated content.
  • Saves trained model checkpoints for reuse or further fine-tuning.

Challenges we ran into:

  • Due to time constraints and limited hardware, we were only able to test about 1000 videos.
  • It was also difficult to find good video datasets as our solution is very broad and it aims to tackle AI videos in general. ## Accomplishments that we're proud of:
  • Given these constraints, our model managed to fine-tune to a level that is sufficient for a demo version

What we learned:

We learn what goes on in the process of training a model, including data pre-processing, which is probably the most important aspect of machine learning. We also get a glimpse of how fine-tuning the model works and we definitely gain insights on how we can improve for future AI hackathons

What's next for AI Video Detection WebApp:

We will definitely need to fine-tune our model to different video categories instead of only generalising them as AI vs Real.

Additional References:

Model and Sample Videos for Testing: link

Built With

Share this project:

Updates