Zephyr-7B Preference tuning

Inspiration

I've seen a lot of companies trying to build smaller models that perform as well as LLama

This trains Zephyr-7B to act as a customer service chatbot

Used a GPTQ-quantized version ofZephyr-7B to optimize with LoRA

Getting it to run with the 3B version

Got the 7B version to run

Huggingface project don't always have the best documentation

Code cleaning, adapting for Zephyr-3B

Leave feedback in the comments!

Log in or sign up for Devpost to join the conversation.