v0.1.21
What's Changed
- Expose idefics2 loader by @EricLBuehler in #450
- Try auto dtypes based on compute cap by @EricLBuehler in #453
- Fix dtype error for logit bias by @EricLBuehler in #454
- Fix sequence prompt len for Phi3-V by @EricLBuehler in #455
- Tune threshold for matmul via f16 by @EricLBuehler in #457
- Improve short/long scaling precision for LongRope by @EricLBuehler in #458
- Fix LongRope models position ids calculation by @EricLBuehler in #459
- Update deps by @EricLBuehler in #460
- Improve handling of errors in auto dtype selection by @EricLBuehler in #461
- Add support for cross-gpu device mapping by @EricLBuehler in #462
- Bump version to 0.1.21 by @EricLBuehler in #463
Full Changelog: v0.1.20...v0.1.21