Categorizing Fine Art
Who: Anna Dai (adai10), Rena Jiang (rjiang6), Ezra Muratoglu (emuratog)
Introduction: We want to implement a DL model to classify fine art images by genre, as proposed in the paper linked here: https://ieeexplore.ieee.org/abstract/document/8675906 We think that this will be a meaningful challenge because rather than classifying something with a hard ground truth, such as object recognition, our model seeks to categorize images semantically by artistic style.
Related Work: https://ieeexplore.ieee.org/document/5967323, https://hcis-journal.springeropen.com/articles/10.1186/s13673-016-0063-4, These two papers explore a similar task of painting genre classification, though the first one is limited in the number of genre/classes, and the second uses self-organizing maps instead of supervised neural nets.
Data: We will be using the Paintings Dataset for Recognizing the Art Movement (Pandora 18K) collection (http://imag.pub.ro/pandora/pandora_download.html), which comprises of 18038 images and 18 classes. The images are relatively evenly distributed across styles (each style ranging from 4-7% of the total dataset)
Methods: Option 1: One stage classification, or classifying image as a whole. Option 2: Two-stage classification, or splitting each painting into five patches, the four corners (L-shaped) and square centerpiece, training and classifying each patch individually, and then making a decision based on the predictions of those 5 patches. We can use GoogLeNet architecture, or something similar with a reasonable number of params (<15m)
Metrics: Base goal = 40% accuracy, Target goal = 50% accuracy, Stretch goal = 60% accuracy (Paper reaches baseline 64% accuracy with GoogLeNet architecture)
Ethics: What is your dataset? Are there any concerns about how it was collected, or labeled? Is it representative? What kind of underlying historical or societal biases might it contain? The labels of the dataset may potentially be disputable since paintings may fall into multiple genres or present more ambiguously, which would be reflected in the model's prediction. The dataset is also taken from the West, which has historically dominated fine art and marginalised Eastern and Polynesian art, though this is not a significant concern in this context since the movements/genres that we seek to label/classify primarily pertain to the West.
What broader societal issues are relevant to your chosen problem space? This kind of model could serve as a digital education tool, making the consumption of art more accessible, especially as classification of artistic style is traditionally a skill that requires both time and access to fine art and other educational resources.
Division of Labor: Anna–preprocessing, Rena–one-stage classification, Ezra–two-stage classification, All–troubleshoot/debug
Built With
- python
- tensorflow
Log in or sign up for Devpost to join the conversation.