Fix `InferenceEndpointsLLM` not using cached token #690

gabrielmbmb · 2024-06-03T11:01:36Z

Description

InferenceEndpointsLLM was not automatically using the cached Hugging Face token (cached because huggingface-cli login).

In addition, this PR has updated the AsyncLLM.__del__ to just close the async loop if it was newly created by the AsyncLLM, otherwise it could raise a RuntimeError when calling close if the AsyncLLM didn't create it, because it's managed by other lib which might be still using it (such as pytest-asyncio).

codspeed-hq · 2024-06-03T11:06:49Z

CodSpeed Performance Report

Merging #690 will not alter performance

_{Comparing fix-inference-endpoints-llm-cached-token (27241da) with develop (918c19f)}

Summary

✅ 1 untouched benchmarks

gabrielmbmb added 2 commits June 3, 2024 11:21

Fix RuntimeError closing event loop if not created by AsyncLLM

95145de

Update InferenceEndpointsLLM so it uses cached token

a89b9b8

gabrielmbmb added the fix label Jun 3, 2024

gabrielmbmb added this to the 1.2.0 milestone Jun 3, 2024

gabrielmbmb requested a review from alvarobartt June 3, 2024 11:01

gabrielmbmb self-assigned this Jun 3, 2024

alvarobartt approved these changes Jun 3, 2024

View reviewed changes

Fix test

d55d69f

Merge branch 'develop' into fix-inference-endpoints-llm-cached-token

27241da

gabrielmbmb merged commit e61b598 into develop Jun 3, 2024
7 checks passed

gabrielmbmb deleted the fix-inference-endpoints-llm-cached-token branch June 3, 2024 11:17

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix `InferenceEndpointsLLM` not using cached token #690

Fix `InferenceEndpointsLLM` not using cached token #690

gabrielmbmb commented Jun 3, 2024

codspeed-hq bot commented Jun 3, 2024 •

edited

Loading

Fix InferenceEndpointsLLM not using cached token #690

Fix InferenceEndpointsLLM not using cached token #690

Conversation

gabrielmbmb commented Jun 3, 2024

Description

codspeed-hq bot commented Jun 3, 2024 • edited Loading

CodSpeed Performance Report

Merging #690 will not alter performance

Summary

Fix `InferenceEndpointsLLM` not using cached token #690

Fix `InferenceEndpointsLLM` not using cached token #690

codspeed-hq bot commented Jun 3, 2024 •

edited

Loading