Skip to content

Latest commit

 

History

History
14 lines (8 loc) · 750 Bytes

README.md

File metadata and controls

14 lines (8 loc) · 750 Bytes

nanoViT

Minimal ViT implementation from scratch in PyTorch. Inspired by nanogpt. Designed to be simple and hackable.

Model code: nanovit/vit.py.

  • Verified by loading CLIP ViT-B/32 model weights.
  • ViT from Scratch: (exercise) implement each module from scratch yourself in Colab

Training script for CIFAR-100: train_cifar100.py.

  • 2.7M params ViT for 200 epochs runs in ~15mins on a 3090, achieves accuracy of 68.5% (ViTs aren't great at small scale datasets).

Also a ConvMixer implementation (nanovit/convmixer.py) as a baseline.