[FEATURE] Update `InferenceClient` to use `chat.client_completion` instead of `apply_chat_template` #772

plaguss · 2024-07-04T13:34:33Z

Is your feature request related to a problem? Please describe.
TGI and huggingface_hub added support for chat completion, removing the need of using the tokenizer to format the prompt correctly.

Describe the solution you'd like
We can use client.chat_completion instead of apply_chat_template as documented here

Describe alternatives you've considered
Maintain as it is.

Additional context
Thanks @osanseviero for the hint.

The text was updated successfully, but these errors were encountered:

plaguss · 2024-07-18T07:00:04Z

Closed with #778

plaguss added improvement refactor good first issue Good for newcomers labels Jul 4, 2024

gabrielmbmb self-assigned this Jul 10, 2024

gabrielmbmb added this to the 1.3.0 milestone Jul 10, 2024

plaguss closed this as completed Jul 18, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[FEATURE] Update `InferenceClient` to use `chat.client_completion` instead of `apply_chat_template` #772

[FEATURE] Update `InferenceClient` to use `chat.client_completion` instead of `apply_chat_template` #772

plaguss commented Jul 4, 2024

plaguss commented Jul 18, 2024

[FEATURE] Update InferenceClient to use chat.client_completion instead of apply_chat_template #772

[FEATURE] Update InferenceClient to use chat.client_completion instead of apply_chat_template #772

Comments

plaguss commented Jul 4, 2024

plaguss commented Jul 18, 2024

[FEATURE] Update `InferenceClient` to use `chat.client_completion` instead of `apply_chat_template` #772

[FEATURE] Update `InferenceClient` to use `chat.client_completion` instead of `apply_chat_template` #772