v0.0.0beta19
yunfeng-scale
released this
13 Oct 04:50
·
283 commits
to main
since this release
What's Changed
- Increase graceful timeout and hardcode AWS_PROFILE by @squeakymouse in #306
- bump pypi version by @ian-scale in #303
- Ianmacleod/add mistral by @ian-scale in #307
- Ianmacleod/add falcon 180b by @ian-scale in #309
- update 180b inference framework by @ian-scale in #310
- Adding code llama to TGI by @mfagundo-scale in #311
- Add AWQ enum by @yunfeng-scale in #317
- Fix documentation to reference Files API by @squeakymouse in #312
- Return TGI errors by @yunfeng-scale in #313
- Fix streaming endpoint failure handling by @yunfeng-scale in #314
- Validate quantization by @yunfeng-scale in #315
- Properly return PENDING status for docker image batch jobs/fine tune jobs by @seanshi-scale in #318
- add user_id and team_id as log facets by @song-william in #321
- publish 0.0.0b19 by @yunfeng-scale in #322
New Contributors
- @mfagundo-scale made their first contribution in #311
Full Changelog: v0.0.0beta18...v0.0.0beta19