Repo where I'll be creating my own LLM from scratch. The goal is to deepen my understanding of model-architecture design, training methodologies, and core deep-learning principles.
Run first
export PYTHONPATH="${PYTHONPATH}:$(pwd)/src"
- Make sure to use PyTorch 2.0 or later!
More to come...