Exercise: Transformer Representation

An exercise for utilizing features extracted from a vision transformer for downstream tasks.

This exercise has two parts. In the first part, we'll learn how to extract features for a batch of images from the DINOv2 vision transformer model, and apply dimentionality reduction and clustering on those features.
In the second part, we will train a model on top of those extracted features for the segmentation task.

Setup

All the neccessary files are included in this repo. You just need to setup the python environment by running this script:

source setup.sh

After this, make sure you are in the base environment and then run jupyter lab:

mamba activate base
jupyter lab

TA Info

To convert solutions python files into notebooks and generate the exercises, first, please install jupytext and nbconvert. Afterward, run python ./generate_exercise <input_file.py> .

Name		Name	Last commit message	Last commit date
Latest commit History 10 Commits
metrics		metrics
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
generate_exercise.py		generate_exercise.py
sam_finetuning_HT.ipynb		sam_finetuning_HT.ipynb
setup.sh		setup.sh
transformer_representation_part_1_exercise.ipynb		transformer_representation_part_1_exercise.ipynb
transformer_representation_part_1_solution.ipynb		transformer_representation_part_1_solution.ipynb
transformer_representation_part_1_solution.py		transformer_representation_part_1_solution.py
transformer_representation_part_2_exercise.ipynb		transformer_representation_part_2_exercise.ipynb
transformer_representation_part_2_solution.ipynb		transformer_representation_part_2_solution.ipynb
transformer_representation_part_2_solution.py		transformer_representation_part_2_solution.py
utils.py		utils.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Exercise: Transformer Representation

An exercise for utilizing features extracted from a vision transformer for downstream tasks.

Setup

TA Info

About

Releases

Packages

Contributors 3

Languages

License

dl4mia/03_learned_representations

Folders and files

Latest commit

History

Repository files navigation

Exercise: Transformer Representation

An exercise for utilizing features extracted from a vision transformer for downstream tasks.

Setup

TA Info

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Contributors 3

Languages

Packages