Beginner's Guide to Fine-Tuning Vision Transformers #7130
Unanswered
ellie-sleightholm
asked this question in
Show and tell
Replies: 0 comments
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
-
Fine-Tuning Vision Transformer Models - A Beginner's Guide
Introduction
I recently created an article on how you can fine-tune your own Vision Transformer models after searching the web for resources and struggling to find any. In this discussion, I've decided to summarise my article so that hopefully, beginners or others looking to fine-tune Vision Transformer models can do so with ease!
For the full code and a guided walk-through visit this article.
1. Load a Dataset
To perform fine-tuning, we will use a small image classification dataset. We’ll use the
microsoft/cats_vs_dogs
which is a collection of cat and dog images.2. Preparing the Images - ViT Image Processor
3. Processing the Dataset
4. Training and Fine-Tuning
Beta Was this translation helpful? Give feedback.
All reactions