Image classification using few-shot learning.

Inspiration

These techniques in deep learning are inspired by how humans leverage past knowledge to learn new things. Fine-tuning the entire model adapts a pre-trained model to new tasks by updating all its layers, potentially achieving high performance with large and diverse datasets. Feature extraction and transfer learning use pre-trained layers as fixed feature extractors, reducing computational cost and overfitting risk, making them ideal for smaller datasets or limited resources. Incremental fine-tuning and distillation techniques further refine these approaches for specific scenarios, allowing for gradual adaptation to new classes or knowledge distillation for resource-constrained environments. Overall, these techniques aim to efficiently transfer knowledge from pre-trained models, mimicking how humans build upon existing knowledge to learn new skills.

What it does?

It basically explores the image classification methodology using pretrained swin tiny transformer architecture and also it explores pretrained swin with supervised constructive learning for clustering image embeddings and applying classification onto that to get a well bounded partition for classification.

How we built it

Fine-tune the entire model: One can fine-tune the entire pretrained model on the combined dataset, including the new class. This involves updating the weights of all layers in the model, including those of the pretrained layers, to adapt to the new class.

Retrain the final classification layer: Alternatively, one can keep the weights of the pretrained layers fixed and train only the final classification layer to classify the new class. This approach is computationally efficient and less prone to overfitting, especially if your original dataset was large and the new class has limited data.
Incremental fine-tuning: If the new class is related to the existing classes in your dataset, you can perform incremental fine-tuning. First, fine-tune the model on the new class while keeping the weights of the pretrained layers fixed. Then, fine-tune the entire model on the combined dataset to further refine the model's performance on all classes.
Distillation: If computational resources are limited and the new class has limited data, you can use knowledge distillation. Train a new model (teacher model) on the combined dataset, including the new class, and then train a smaller model (student model) to mimic the predictions of the teacher model. This approach can help generalize the knowledge learned from the combined dataset to the new class more effectively. The choice of approach depends on various factors such as the size of your dataset, computational resources, and the similarity of the new class to the existing classes. Experimentation and evaluation on a validation set are essential to determine the most effective approach for your specific scenario. ## Challenges we ran into Getting diverse, high-quality data for fine-tuning can be tough and expensive. Solution: Augment existing data or use transfer learning from similar datasets. Fine-tuning the entire model requires a lot of computing power and time. Solution: Use cloud resources or compress models to reduce the burden. Challenge: Models may memorize training data instead of generalizing, especially with small datasets. Solution: Apply regularization techniques and use validation datasets for model evaluation. Challenge: Deep neural networks are often seen as black boxes, making it hard to understand their decisions. Solution: Utilize interpretation techniques like feature visualization or model-agnostic methods. Challenge: Pre-trained models may struggle when applied to new domains with different data distributions. Solution: Use domain adaptation techniques or fine-tune with domain-specific data. Challenge: Models might not effectively learn from minority classes if they're much smaller. Solution: Address class imbalance with techniques like class weighting or data resampling. ## Accomplishments that we're proud of. We followed up these different structure with many experiments to find out the importance of each model and the process used to train the model, compared the accuracy across different settings, and found that out the best performing model.

What we learned

Embarking on this project means diving into the exciting world of cutting-edge machine learning. We have focused on few-shot learning techniques tailored for image classification, mastering the art of training neural networks. Along the way, we have refined your skills in data preprocessing, model selection, and optimization strategies. This journey isn't just about technical prowess; it's about real-world impact. We have explored domains like medical imaging and satellite analysis, where data scarcity is a challenge. Collaborating with experts, we have fine-tuned your approach and share your discoveries with the wider research community. Ultimately, this project isn't just about learning—it's about making a tangible contribution to the field of machine learning.

What's next for Image classification using few-shot learning.

Advanced Meta-learning Techniques: Dive deeper into meta-learning methods such as meta-reinforcement learning or meta-learning with memory-augmented networks to improve few-shot learning performance.

Cross-modal Few-shot Learning: Extend few-shot learning techniques to handle multiple modalities, such as combining images with text or audio inputs for more robust classification.

Interactive Learning Approaches: Investigate interactive learning methods where the model actively queries for additional information or guidance from users to improve classification accuracy with few examples.

Continual Few-shot Learning: Develop techniques for continual few-shot learning where the model continuously adapts to new tasks or classes over time without forgetting previously learned knowledge.

Zero-shot Learning Enhancement: Enhance few-shot learning models to handle zero-shot learning scenarios more effectively, where the model must classify classes it has never seen before based on semantic descriptions or attributes.

Built With

huggingface
python
pytorch

Submitted to

UTA Datathon 2024

Created by

I work on exploring few shot learning, also I looked into the Constructive learning frame work and implemented it into this project, and divided it into 2 parts, one for CLR to form cluster of image embedding and other part to do classification on top of the same model, keeping it's weights freeze.

Parth Soni
I worked on the swin transformer pretrained model training, I have explored on the fully pretrained model rather freezing the weights.

Ravikiran Bhonagiri
Avish Modi
Nitya Parikh

Updates

Avish Modi started this project — Apr 14, 2024 10:45 AM EDT

Leave feedback in the comments!

Log in or sign up for Devpost to join the conversation.