Inspiration
The primary aim of the project is to learn creating model experiments, hyper parameter search and distributed training on the determined ai platform.
The project includes 2 models with several experiments to successfully deploy on the determined ai platform.
One project is a fine tuned efficientnet image classifier and re-training a question/answer transformer.
What it does & How its built
Image classification
The EfficientNetB0 model is the base model and it is fine tuned for Stanford dogs database using transfer learning where only the top layers are retrained.
The model is written in keras in the determined-ai platform template and deployed for distributed training.
After training experiment, an adaptive hyper parameter search experiment was created from that checkpoint to identify correct learning rate and drop out.
dataset: https://www.tensorflow.org/datasets/catalog/stanford_dogs
Question & Answer Transformer
A pytorch experiment using an existing question and answer transformer example was developed training on 2 slots for distributed training.
The transformer was fine tuned for Adverserial QA database from huggingface.
dataset: https://huggingface.co/datasets/adversarial_qa
Challenges we ran into
The main challenge was adapting model code to templates for the determined platform to be able to run experiments. Other challenges include ensuring the model works with the pytorch and tensorflow/keras versions of determined-ai
What's next for Determined driven model hyper parameter search
The next approach is to try image to text and text to image transformers. I had tried keras-nlp but due to conflicts between libraries was unable to execute the experiments
Built With
- determined
- keras
- python
- pytoch
- tensorflow

Log in or sign up for Devpost to join the conversation.