Detecting AI-Generated Images Using Multi-Scale Attention

Project Summary

In this project, we will reproduce and extend the CIFAKE real-versus-synthetic image classification pipeline originally introduced by Bird & Lotfi (2023). We will first re-implement the baseline CNN classifier and its explainability workflow in PyTorch, replacing the original Grad-CAM visualization with Integrated Gradients for more robust interpretability.

We then introduce a novel artifact-aware architecture that expands the original model’s capabilities by integrating:

Frequency-domain representation (FFT/DCT) to capture generative artifacts
Multi-scale convolutional processing
Attention mechanisms to highlight subtle texture abnormalities typical of synthetic images.

Additionally, we will evaluate three model families: the original CIFAKE CNN baseline, standard pre-trained architectures (ResNet, VGG, ViT), our proposed frequency+attention multi-stream model.

Finally, we will test all models on synthetic images generated by methods not used in the CIFAKE dataset, enabling a cross-generator generalization study.

Methodology

Reproduction (Pytorch):

Recreate the CIFAKE dataset loading, preprocessing, and CNN training pipeline.
Match or approximate the reported accuracy using equivalent hyperparameters.
Replace Grad-CAM with Integrated Gradients.
Analyze any discrepancies in performance caused by framework differences (optimizer defaults, initialization, normalization).

Proposed Architecture

We design a dual-stream artifact-aware architecture: Stream A: Spatial Branch

CNN backbone (ResNet-18 or custom small CNN).
Multi-scale feature extraction using dilated convolutions or pyramid features.

Stream B: Frequency Branch

Convert input images to the frequency domain using FFT or DCT.
Apply a lightweight CNN to extract frequency patterns (checkerboard textures, periodic noise, high-frequency artifacts).

Fusion Mechanism

Concatenate spatial+frequency features.
Apply channel + spatial attention (e.g., lightweight self-attention block).

Classifier Head

Fully-connected layers leading to a binary real/fake prediction.

Interpretability Plan (Beyond Grad-CAM)

We employ a suite of interpretability techniques to uncover which features drive synthetic-image detection:

Integrated Gradients (primary method) – pixel-level attributions.
Occlusion Sensitivity – region importance by masking patches.
Feature Ablation (frequency masking) – identify which frequency bands matter.

Evaluation Plan

We employ a suite of interpretability techniques to uncover which features drive synthetic-image detection:
Integrated Gradients (primary method) – pixel-level attributions.
Occlusion Sensitivity – region importance by masking patches.
Feature Ablation (frequency masking) – identify which frequency bands matter.
Cross-Generator Generalization (New Synthetic Images) We will evaluate all models on synthetic images generated by newer diffusion or GAN models, such as: Stable Diffusion v2 DALL·E mini / Craiyon, any publicly available GAN fakes or lightly generated samples

Data

CIFAKE Dataset (Primary): The dataset contains 30k real CIFAR-10 images + 30k diffusion-generated synthetic counterparts and is a good source of balanced binary classification.
New Synthetic Dataset (Secondary): We will find small curated set of non-CIFAKE synthetic images for OOD testing, which ensures evaluation beyond the original paper.

Built With

Updates

Shihang Gui started this project — Nov 17, 2025 09:35 PM EST

Leave feedback in the comments!

Log in or sign up for Devpost to join the conversation.