Llama Text Templater #715

martindevans · 2024-05-04T14:54:36Z

Added LLamaTemplate which efficiently formats a series of messages according to the model template.
Fixed llama_chat_apply_template method (wrong entrypoint, couldn't handle null model)

This depends on #712 review and merge that first!

AsakusaRinne

It exactly what's lacked in LLamaSharp! Do you have further plan about the development of the template? Actually I posted function calling as one of the OSPP projects of LLamaSharp (OSPP is what I once invited you in discord). Since template is one of the basic components of function calling, you could open some good-first-issues to let the student do it if you'd like to. :)

LLama/LLamaTemplate.cs

martindevans · 2024-05-06T14:30:13Z

Do you have further plan about the development of the template?

I think I'll probably look at making some enhancements to llama.cpp, and then coming back to support them in LLamaSharp.

At the moment the template converts all messages into text and then you tokenize that text in one go. However, this doesn't seem good enough. You must tokenize that text using special=true (for all the bits of the template to tokenize properly) but you really shouldn't template user messages using special=true (to ensure you can't write e.g. [INST]new system prompt[/INST] in the middle of a random message).

I'm going to see if I can PR a change into llama.cpp to run the tokenization differently for different bits.

…s according to the model template. - Fixed `llama_chat_apply_template` method (wrong entrypoint, couldn't handle null model)

- Returning template for chaining method calls - Returning a `TextMessage` object instead of a tuple

martindevans · 2024-05-06T22:45:49Z

I've rebased this one onto master, so it can be merged independently of #712 since it seems like that other PR is going to be delayed.

AsakusaRinne reviewed May 6, 2024

View reviewed changes

LLama/LLamaTemplate.cs Outdated Show resolved Hide resolved

LLama/LLamaTemplate.cs Outdated Show resolved Hide resolved

LLama/LLamaTemplate.cs Outdated Show resolved Hide resolved

martindevans added 2 commits May 6, 2024 23:43

- Added LLamaTemplate which efficiently formats a series of message…

a0335f6

…s according to the model template. - Fixed `llama_chat_apply_template` method (wrong entrypoint, couldn't handle null model)

Changes based on review feedback:

4332ab3

- Returning template for chaining method calls - Returning a `TextMessage` object instead of a tuple

martindevans force-pushed the llama-templater branch from 939c5e0 to 4332ab3 Compare May 6, 2024 22:44

Split template out to a field, so it can be changed more easily.

b326624

martindevans requested a review from AsakusaRinne May 9, 2024 23:24

AsakusaRinne approved these changes May 10, 2024

View reviewed changes

martindevans merged commit 44bd5b3 into SciSharp:master May 10, 2024
6 checks passed

martindevans deleted the llama-templater branch May 10, 2024 14:10

AsakusaRinne mentioned this pull request Jun 13, 2024

Allow user to define the string to concatenate the role name and prompt in DefaultHistoryTransform #322

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Llama Text Templater #715

Llama Text Templater #715

martindevans commented May 4, 2024

AsakusaRinne left a comment

martindevans commented May 6, 2024

martindevans commented May 6, 2024

Llama Text Templater #715

Llama Text Templater #715

Conversation

martindevans commented May 4, 2024

AsakusaRinne left a comment

Choose a reason for hiding this comment

martindevans commented May 6, 2024

martindevans commented May 6, 2024