v0.1.16
Summary
- Various fixes
- Excellent work on refactoring by @polarathene
- First vision model: Phi 3 vision
What's Changed
- Implement the Phi 3 vision model by @EricLBuehler in #351
- Bump version again to 0.1.15 by @EricLBuehler in #390
- Add docs for installing huggingface-cli by @EricLBuehler in #391
- Fix metal loading issue by loading sequentially by @EricLBuehler in #394
- Fix logging in gguf and ggml by @EricLBuehler in #399
- Add fused bias linear layer with cublaslt by @EricLBuehler in #400
- docs: Resolve CI lints on docs by @polarathene in #401
- Refactor: GGUF metadata tokenizer by @polarathene in #389
- Add
Nonzero
layer by @EricLBuehler in #402 - Bump version to 0.1.16 by @EricLBuehler in #404
Full Changelog: v0.1.15...v0.1.16