[IMPLEMENTATION] magpie #740

gabrielmbmb · 2024-06-18T10:06:11Z

No description provided.

fpreiss · 2024-06-24T09:08:46Z

I have tried to implement the prompting strategy of the magpie paper using distilabel's ollama integration and noticed, that the current implementation does not allow me to overwrite the chat template. I believe the /api/generate endpoint would need to be wrapped instead of the /api/chat endpoint. I had some success with the following:

TEMPLATE_OVERRIDES: dict[str, str] = {
    # https://llama.meta.com/docs/model-cards-and-prompt-formats/meta-llama-3/#special-tokens-used-with-meta-llama-3
    LLAMA3_8B: "<|eot_id|><|start_header_id|>user<|end_header_id|>\n\n"
}


class OllamaMagpieLLM(OllamaLLM):
    """Magpie compatibility layer for Ollama."""

    async def agenerate(
        self,
        input: StandardInput,
        format: Literal["", "json"] = "",
        # TODO: include relevant options from `Options` in `agenerate` method.
        options: Options | None = None,
        keep_alive: bool | None = None,
    ) -> GenerateOutput:
        """Override of the `OllamaLLM.agenerate` method make Ollama fill the user message.

        The original implementation uses Ollama's chat endpoint instead of the generate endpoint.
        This simplifies implementing multi-turn conversations, but we can't manipulate the prompt template.
        """
        try:
            prompt = input[0]["content"], # needs some work for multi turn support
            completion: dict[str, Any] = await self._aclient.generate(
                prompt=prompt
                model=self.model,
                template=TEMPLATE_OVERRIDES[self.model],
                stream=False,
                format=format,
                options=options,
                keep_alive=keep_alive,
            )
            return [completion["response"]]
        except Exception as e:
            self._logger.warning(
                f"⚠️ Received no response using Ollama client (model: '{self.model_name}')."
                f" Finish reason was: {e}"
            )

Note that as of writing this, the prompt in the generate call has to be a non-empty string, to generate the user instructions as outlined in the paper. Seems to be an issue on ollama/llama.cpp's side.

gabrielmbmb · 2024-07-31T11:38:39Z

Hi @fpreiss, for now we have implemented Magpie for TransformersLLM, InferenceEndpointsLLM and vLLM. Will work in the next release to add compatibility to the rest of LLMs.

gabrielmbmb added the enhancement New feature or request label Jun 21, 2024

gabrielmbmb added this to the 1.3.0 milestone Jun 21, 2024

gabrielmbmb added this to ⚗️ distilabel Jun 21, 2024

gabrielmbmb linked a pull request Jul 11, 2024 that will close this issue

Add Magpie and MagpieGenerator tasks #778

Merged

gabrielmbmb closed this as completed Jul 31, 2024

github-project-automation bot moved this to Done in ⚗️ distilabel Jul 31, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[IMPLEMENTATION] magpie #740

[IMPLEMENTATION] magpie #740

gabrielmbmb commented Jun 18, 2024

fpreiss commented Jun 24, 2024

gabrielmbmb commented Jul 31, 2024

[IMPLEMENTATION] magpie #740

[IMPLEMENTATION] magpie #740

Comments

gabrielmbmb commented Jun 18, 2024

fpreiss commented Jun 24, 2024

gabrielmbmb commented Jul 31, 2024