feat: Add chat completion mutation #5255

anticorrelator · 2024-11-01T14:58:03Z

resolves #4774

Adds a playgroundChatCompletion mutation that returns a FinishedChatCompletion and accepts the same payload as the chatCompletion subscription

For simplicity, for now it simply reuses the streaming interfaces we've used, collates the results and returns them all at the end

Parker-Stafford · 2024-11-01T15:26:12Z

src/phoenix/server/api/mutations/playground_chat_mutations.py

+        provider_key = input.model.provider_key
+        llm_client_class = PLAYGROUND_CLIENT_REGISTRY.get_client(provider_key, input.model.name)
+        if llm_client_class is None:
+            raise BadRequest(f"No LLM client registered for provider '{provider_key}'")


can we get this returned as an error so it's easier to display in the front end, the FinishedChatCompletion type has an errors field right?

BadRequest is a custom graphql error, so it should be displayable in the frontend

Relay has an onError hook https://relay.dev/docs/api-reference/use-mutation/#return-value

yeah it just requires a whole lot of parsing to get the real error from the relay error

i see. probably a broader discussion if we want to add error types on mutations / query. we've avoided that pattern for the sake of simplicity so far.

don't we have that on the streaming version of this though? an error member of the union, i guess that is technically not mutation or query (subscription) but was thinking it would set the precedent here

but yes don't need to use it here just saying it can be helpful for handling known errors more gracefully for us, I know it's been shared before but this is the inspiration https://sachee.medium.com/200-ok-error-handling-in-graphql-7ec869aec9bc
and
https://productionreadygraphql.com/2020-08-01-guide-to-graphql-errors

don't we have that on the streaming version of this though? an error member of the union, i guess that is technically not mutation or query (subscription) but was thinking it would set the precedent here

the error types in the subscription were unavoidable because we need to continue yielding payloads (FinishedChatCompletion in particular) even after an error has occurred. We don't currently have the same pattern on mutations and queries afaik

ahh okay, i didn't know that was why those were added, we can punt it for now, we need to parse errors from relay anyways for various other pages so will make a ticket to get that working

Parker-Stafford · 2024-11-01T15:27:06Z

src/phoenix/server/api/mutations/playground_chat_mutations.py

+@strawberry.type
+class PlaygroundChatCompletionMutationMixin:
+    @strawberry.mutation
+    async def playground_chat_completion(


nit: I know lots of other things are named as playground in here, any reason they would not be usable for generating a chat completion outside of the playground? would probably prefer to name it something like generate_chat_completion unless there is a reason we think this won't work outside of hte playground

there are a ton of playground-specific type definitions and I think our codebase has a lot of generic names that aren't actually reusable

Yeah that makes a ton of sense was just curious but feel free to keep it!

I think this should be chat_completion since it is part of the GraphQL API. A consumer of the GraphQL API could use this resolver to do a chat completion regardless of whether they are in playground.

Parker-Stafford · 2024-11-01T15:28:21Z

src/phoenix/server/api/mutations/playground_chat_mutations.py

+
+        info.context.event_queue.put(SpanInsertEvent(ids=(span.project_id,)))
+
+        return span.finished_chat_completion


oh interesting is this just called span because it's named that on line 73, feels a bit weird to have the chat completion attached to a span

this is the way the streaming_llm_span context manager works atm. it's changed in my open pr

axiomofjoy · 2024-11-01T17:55:33Z

src/phoenix/server/api/mutations/playground_chat_mutations.py

+            chunks = []
+            async for chunk in llm_client.chat_completion_create(
+                messages=messages, tools=input.tools or [], **invocation_parameters
+            ):
+                span.add_response_chunk(chunk)
+                chunks.append(chunk)


Suggested change

chunks = []

async for chunk in llm_client.chat_completion_create(

messages=messages, tools=input.tools or [], **invocation_parameters

):

span.add_response_chunk(chunk)

chunks.append(chunk)

async for chunk in llm_client.chat_completion_create(

messages=messages, tools=input.tools or [], **invocation_parameters

):

span.add_response_chunk(chunk)

Parker-Stafford · 2024-11-01T18:01:26Z

src/phoenix/server/api/mutations/playground_chat_mutations.py

+    @strawberry.mutation
+    async def playground_chat_completion(
+        self, info: Info[Context, None], input: ChatCompletionInput
+    ) -> FinishedChatCompletion:


I think this return type needs to be something like

@strawberry.type class ChatCompletionText: content: str span: Span @strawberry.type class ChatCompletionError: message: str span: Span # maybe don't need this or definitely optional @strawberry.type class ChatCompletionToolCall: id: str function: FunctionCall # this is just json i think span: Span ChatCompletionMutationPayload: TypeAlias = Annotated[ Union[ChatCompletionText, ChatCompletionToolCall, ChatCompletionError], strawberry.union("ChatCompletionMutationPayload"), ]

not just the span, if i'm understanding correctly

Parker-Stafford · 2024-11-01T20:05:54Z

src/phoenix/server/api/mutations/playground_chat_mutations.py

+class ChatCompletionResult:
+    content: Optional[str]
+    tool_calls: List[ChatCompletionToolCall]
+    span: Span
+    error_message: Optional[str]


Any reason to not mke this a union, also do tool_calls need to be Optional?

Parker-Stafford · 2024-11-01T20:06:20Z

src/phoenix/server/api/mutations/playground_chat_mutations.py

+class ChatCompletionToolCall:
+    id: str
+    function: strawberry.scalars.JSON
+    span: Optional[Span]


if this is not apart of a union but an optional portion of the result type, it doesn't need the sapn since the span will be on the top level result

anticorrelator added 2 commits November 1, 2024 10:56

Add chat completion mutation

e6c6607

Clarify mutation name

78090fc

dosubot bot added the size:L This PR changes 100-499 lines, ignoring generated files. label Nov 1, 2024

Parker-Stafford reviewed Nov 1, 2024

View reviewed changes

Parker-Stafford approved these changes Nov 1, 2024

View reviewed changes

axiomofjoy reviewed Nov 1, 2024

View reviewed changes

Parker-Stafford reviewed Nov 1, 2024

View reviewed changes

Refactor output types

d25744f

Parker-Stafford reviewed Nov 1, 2024

View reviewed changes

Improve payload output type

af08ca1

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: Add chat completion mutation #5255

feat: Add chat completion mutation #5255

anticorrelator commented Nov 1, 2024 •

edited

Loading

Parker-Stafford Nov 1, 2024

axiomofjoy Nov 1, 2024

axiomofjoy Nov 1, 2024

Parker-Stafford Nov 1, 2024

axiomofjoy Nov 1, 2024

Parker-Stafford Nov 1, 2024 •

edited

Loading

Parker-Stafford Nov 1, 2024 •

edited

Loading

axiomofjoy Nov 1, 2024

Parker-Stafford Nov 1, 2024

Parker-Stafford Nov 1, 2024

anticorrelator Nov 1, 2024

Parker-Stafford Nov 1, 2024

axiomofjoy Nov 1, 2024

Parker-Stafford Nov 1, 2024

axiomofjoy Nov 1, 2024

axiomofjoy Nov 1, 2024

Parker-Stafford Nov 1, 2024

axiomofjoy Nov 1, 2024

Parker-Stafford Nov 1, 2024

Parker-Stafford Nov 1, 2024


		info.context.event_queue.put(SpanInsertEvent(ids=(span.project_id,)))

		return span.finished_chat_completion

feat: Add chat completion mutation #5255

Are you sure you want to change the base?

feat: Add chat completion mutation #5255

Conversation

anticorrelator commented Nov 1, 2024 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Parker-Stafford Nov 1, 2024 • edited Loading

Choose a reason for hiding this comment

Parker-Stafford Nov 1, 2024 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

anticorrelator commented Nov 1, 2024 •

edited

Loading

Parker-Stafford Nov 1, 2024 •

edited

Loading

Parker-Stafford Nov 1, 2024 •

edited

Loading