Make IT meme

Who is us
Concept
Goal
Name your variables!!!

Playful Code Review System

We’ve built a first version of a playful and engaging way to learn programming by turning code reviews into something closer to meme culture.

Given a Python code snippet, our system detects common errors, anti-patterns, and coding malpractices, and transforms them into contextual feedback paired with a meme and a generated message. The result is a “roast-meets-education” experience designed to make learning from mistakes more memorable and less intimidating.

The tool is currently focused on Python, but the architecture is language-extensible.

Idea

Instead of just receiving a grade, we reimagine feedback as something social, funny, and shareable:

Professors can use it to give students structured yet entertaining feedback for smaller code submissions or exercises
Students can use it peer-to-peer to highlight issues in code in a lighthearted way
Feedback can be customized by tone (funny, educational, sarcastic, etc.)

How it works (high level)

1. Dataset creation (offline layer)

We built a curated dataset of coding malpractices by combining:

Ruff linter rules
Python anti-pattern repositories from GitHub
Bad-practice datasets and code examples

This process also involved web scraping and HTML parsing due to inconsistent formats across sources.

On top of that, we added custom “fun” rules to make the system more relatable (e.g., discouraging meaningless print statements like "here", overuse of emojis in formal code, or confusing variable reuse).

2. Two-layer detection system

We experimented with multiple approaches:

Static + rule-based analysis
Using tools like Ruff and structured analyzers to detect standard issues.
LLM-based reasoning model
A prompt-tuned LLM trained on our curated malpractice dataset. This approach proved more robust and flexible.

Overall, the LLM-based detector achieved stronger coverage and better generalization.

3. Meme retrieval system (semantic matching)

Once issues are detected, we map them to memes using:

FAISS vector search index
Sentence Transformers (Hugging Face embeddings, specifically all-MiniLM-L6-v2)

Each meme is encoded as a vector, and the detected issue acts as a query. A KNN-style retrieval is performed to select the most appropriate meme for the situation.

Mathematically, retrieval can be expressed as:

$$ \text{meme}^* = \arg\min_{m \in M} \; d\big(f(\text{issue}), f(m)\big) $$

where:

f is the embedding function
dis a distance metric (cosine distance)
M is the meme dataset

4. Meme + humor generation layer

Finally, we feed:

the detected issue
the selected meme
user-selected tone (funny / educational / sarcastic / etc.)

into an LLM that generates a contextual joke or explanation.

A small randomness factor ensures variety and avoids repetitive outputs.

Why it matters

This project turns code feedback into something:

more engaging for learners
more expressive for teachers
more social and shareable for peers

Instead of just saying:

“this is wrong” we say it in a way that sticks.

Built With

python

Submitted to

HackUPC 2026

Created by

This was a highly collaborative project across all stages, and my main contributions focused on both the system design and the development of the detection pipeline.

I led the initial ideation of the overall framework, helping define the structure of the system from code analysis to meme-based feedback generation. This included researching existing approaches for detecting code malpractices and exploring available tools, datasets, and static analysis methods that could be integrated into our pipeline.

A significant part of my work was the creation and enrichment of the malpractice dataset. I developed web scraping and HTML parsing pipelines to collect and standardize examples from multiple heterogeneous sources, including linter rules, Python anti-pattern repositories, and existing bad-code datasets.

I also contributed directly to the error detection system, working on both the static and rule-based analysis layer (e.g., integrating tools like Ruff and structured rule checks) and the LLM-based reasoning layer, including prompt design and iterative improvements to enhance detection quality and coverage

Beyond implementation, I was involved in supervising the overall detection approach and proposing fine-tuning strategies to improve performance and consistency between the rule-based and LLM-based components.

anna maria zakreva
Sergi Guimerà

Updates

Sergi Guimerà started this project — Apr 25, 2026 02:42 PM EDT

Leave feedback in the comments!

Log in or sign up for Devpost to join the conversation.