v0.1.12
What's Changed
- Add an example by @EricLBuehler in #357
- Fix no auth token for local loading by @EricLBuehler in #360
- fix: Ensure committed files are normalized to LF by @polarathene in #361
- Fix unauth check by @EricLBuehler in #362
- Allow default unigram unk token for GGUF by @EricLBuehler in #363
- Disable cublaslt if using f16 kernels by @EricLBuehler in #359
- refactor: GGUF + GGML Loaders with
ModelKind
by @polarathene in #356 - Clamp n device layers to n model layers by @EricLBuehler in #367
- Bump version to 0.1.12 by @EricLBuehler in #368
Full Changelog: v0.1.11...v0.1.12