Add req_perform_open, which makes resp$body the underlying stream #521

jcheng5 · 2024-08-27T18:35:06Z

Alternative, lower-level approach to #520. SSE support could be built on top--I'll take a stab at that too.

Example usage:

library(httr2)

# Define the API endpoint and your API key
api_endpoint <- "https://api.openai.com/v1/chat/completions"
api_key <- Sys.getenv("OPENAI_API_KEY")

# Build the request
response <- request(api_endpoint) %>%
  req_headers(
    "Content-Type" = "application/json",
    "Authorization" = paste("Bearer", api_key)
  ) %>%
  req_body_json(list(
    model = "gpt-4o",
    temperature = 0.7,
    stream = TRUE,
    messages = list(
      list(
        role = "system",
        content = "You're an incredibly verbose assistant."
      ),
      list(
        role = "user",
        content = "When did the modern Olympics start?"
      )
    )
  )) %>%
  req_perform_open()

while (isIncomplete(response$body)) {
  line <- readLines(response$body, n = 1)
  if (length(line) > 0) {
    message(line)
  } else {
    # If no data was available, wait a bit
    message("SLEEPING...\n")
    Sys.sleep(0.25)
  }
}

close(response$body)

blocking = TRUE greatly simplifies cases where you don't have better things to do while you're waiting (i.e. most R use cases that are not Shiny or plumber?)

R/req-perform-stream.R

Co-authored-by: Hadley Wickham <[email protected]>

jcheng5 · 2024-08-28T01:28:39Z

SSE equivalent usage:

library(httr2)

# Define the API endpoint and your API key
api_endpoint <- "https://api.openai.com/v1/chat/completions"
api_key <- Sys.getenv("OPENAI_API_KEY")

# Build the request
response <- request(api_endpoint) %>%
  req_headers(
    "Content-Type" = "application/json",
    "Authorization" = paste("Bearer", api_key)
  ) %>%
  req_body_json(list(
    model = "gpt-4o",
    temperature = 0.7,
    stream = TRUE,
    messages = list(
      list(
        role = "system",
        content = "You're an incredibly verbose assistant."
      ),
      list(
        role = "user",
        content = "When did the modern Olympics start?"
      )
    )
  )) %>%
  req_perform_open(blocking = FALSE)

while (isIncomplete(response$body)) {
  msg <- read_sse(response$body)
  if (!is.null(msg)) {
    cat(msg$data)
    cat("\n")
  } else {
    message("SLEEPING...\n")
    Sys.sleep(0.25)
  }
}

close(response$body)

jcheng5 · 2024-08-28T01:35:22Z

BTW, req_perform_connection(blocking=TRUE) and blocking=FALSE both make sense for both the readLines and read_sse scenarios. Use the former when you don't have anything better to do but wait, and you'll get the answer back with the least possible delay and maximum efficiency (no "busy wait"). Use the latter when you want to do other things while you wait.

Ideally we'd eventually have a true callback based async version that is both efficient and nonblocking.

jcheng5 · 2024-08-28T17:38:48Z

R/req-perform-stream.R

+#' @export
+#' @rdname req_perform_connection
+close.httr2_response <- function(con, ...) {
+  check_streaming_response(con)


Does this mean if you call close(resp) on a non-streaming response, you'll get an error? If so, I think a no-op would be better; as a general rule I try not to throw in cleanup code since it's so often invoked in places where errors are not expected or hard to deal with (like in a finally or a gc finalizer).

Yeah makes sense.

jcheng5 · 2024-08-28T17:45:38Z

One issue we must address before merging is that resp_stream_sse only works if the response$body was opened in text mode. This is because it uses pushBack, which only works on text connections. We could avoid this restriction by implementing our own pushBack, or do what I did locally, which was to add a mode argument to req_perform_connection (and btw resp_stream_sse is also always UTF-8 per the specification) and make sure to use rt if you know you're going to resp_stream_sse.

hadley · 2024-08-28T17:46:18Z

And at a more meta level, we need to add some tests for resp_stream_sse() too. I think I should have time this afternoon.

jcheng5 · 2024-08-28T17:55:55Z

I'm not saying we need to implement this or, even if we want to, that it needs to be part of this PR, but I just wanted to point out two things about the SSE spec:

There is a standard JavaScript API for consuming SSE, a transliteration into R would look something like:

es <- EventSource$new("http://localhost:5000")
es$addEventListener("message", \(e) { message("Message received: ", e$data) })
...
es$close()

I wonder if a high-level SSE wrapper like this (or a more idiomatic version) is something httr2 users would expect to see and/or find useful.

The SSE spec has a mechanism for (client initiated) reconnecting of dropped connections, including tracking the last event that was successfully received and reporting that during reconnection. With this PR as-is, you could do it yourself, but it'd be a bit of an exercise.

* Requires r-lib/httr2#521 * Explain plot feature is currently broken

hadley · 2024-08-28T20:48:04Z

No one else has asked for SSE support, so I don't think we need to do prospective work, but I'm certainly happy to continue to build on this API as we discover more about what we need.

I'm planning on merging this PR today.

jcheng5 · 2024-08-28T22:46:16Z

Looks great, thanks!

jcheng5 added 3 commits August 27, 2024 11:29

Add req_perform_open, which makes resp$body the underlying stream

2a4c969

Remove unneeded argument

6c82cef

You might want blocking

5755169

blocking = TRUE greatly simplifies cases where you don't have better things to do while you're waiting (i.e. most R use cases that are not Shiny or plumber?)

hadley reviewed Aug 27, 2024

View reviewed changes

R/req-perform-stream.R Outdated Show resolved Hide resolved

jcheng5 and others added 3 commits August 27, 2024 15:02

Rename function

bfc60f8

Co-authored-by: Hadley Wickham <[email protected]>

Roxygenize

2b4fa0c

Add SSE parsing logic

5de36ac

jcheng5 and others added 12 commits August 27, 2024 20:50

Export read_sse

2d3b781

Docs; fill in missing body type pieces

3f87ea2

Use req_perform_connection in req_perform_stream

db4b951

Fix docs

04fb722

Plumbing in more pieces

7a6a5d3

Improve docs

ad0ec7d

More testing

cf05cfa

Test print method

5ef0975

Minor polishing

a27c564

Can't use annonymous function shorthand

7661abe

Add retries

4b06b82

Actually implement retries with backoff

13259c7

jcheng5 commented Aug 28, 2024

View reviewed changes

jcheng5 added a commit to jcheng5/r-sidebot that referenced this pull request Aug 28, 2024

wip: streaming support

db80b13

* Requires r-lib/httr2#521 * Explain plot feature is currently broken

jcheng5 and others added 4 commits August 28, 2024 11:24

Add mode argument to req_perform_connection

1fb062a

Add explicit check for sse connection text mode

bd94cc3

Make close idempotent

6f36858

Add tests for streaming sse

66b4d43

hadley added 4 commits August 28, 2024 14:43

Tweak text mode setup

70f06b7

Add news bullet

099657d

Update docs

8c12978

Add an example

6e23b14

hadley marked this pull request as ready for review August 28, 2024 20:46

Need to skip webfakes on covr

c5be167

hadley merged commit 1c17dda into main Aug 28, 2024
13 checks passed

hadley deleted the expose-curl-stream branch August 28, 2024 21:29

hadley mentioned this pull request Sep 3, 2024

Add support for server-sent events #481

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add req_perform_open, which makes resp$body the underlying stream #521

Add req_perform_open, which makes resp$body the underlying stream #521

jcheng5 commented Aug 27, 2024 •

edited

Loading

jcheng5 commented Aug 28, 2024

jcheng5 commented Aug 28, 2024

jcheng5 Aug 28, 2024

hadley Aug 28, 2024

jcheng5 commented Aug 28, 2024

hadley commented Aug 28, 2024 •

edited

Loading

jcheng5 commented Aug 28, 2024

hadley commented Aug 28, 2024

jcheng5 commented Aug 28, 2024

Add req_perform_open, which makes resp$body the underlying stream #521

Add req_perform_open, which makes resp$body the underlying stream #521

Conversation

jcheng5 commented Aug 27, 2024 • edited Loading

jcheng5 commented Aug 28, 2024

jcheng5 commented Aug 28, 2024

jcheng5 Aug 28, 2024

Choose a reason for hiding this comment

hadley Aug 28, 2024

Choose a reason for hiding this comment

jcheng5 commented Aug 28, 2024

hadley commented Aug 28, 2024 • edited Loading

jcheng5 commented Aug 28, 2024

hadley commented Aug 28, 2024

jcheng5 commented Aug 28, 2024

jcheng5 commented Aug 27, 2024 •

edited

Loading

hadley commented Aug 28, 2024 •

edited

Loading