Proper SSE parsing #338

atesgoral · 2023-10-16T14:06:59Z

Follow up to #332

Switch to using event_stream_parser, a spec-compliant event stream parser (that I recently published)
~~The stream proc now receives an optional second argument for any errors encountered during parsing -- consumers should know, instead of these errors being silently ignored by this library~~
Explicitly check for "[DONE]" chunks.
~~Remove bits about errors OpenAI does not send~~
~~Tweak bit about token usage in streams (it's not a public feature yet)~~
Check HTTP response code and only try to parse the body as an error JSON if it's not OK.

All Submissions:

Have you followed the guidelines in our Contributing document?
Have you checked to ensure there aren't other open Pull Requests for the same update/change?
Have you added an explanation of what your changes do and why you'd like us to include them?

ScotterC · 2023-10-16T19:34:08Z

Thanks for this. Can you link the EventStreamParser gem? I wasn't able to see it in your github but I read through it locally. I've merged this into my fork and trying it out in production.

atesgoral · 2023-10-16T21:28:06Z

@ScotterC It's publicly published: https://rubygems.org/gems/event_stream_parser

andresgutgon · 2023-10-17T09:30:18Z

This code is not open yet no? I see 404 when I click here

atesgoral · 2023-10-17T14:47:17Z

@andresgutgon Ah, my apologies. I forgot about making the repo public. It is now :)

smojtabai · 2023-10-17T17:16:59Z

Hey @atesgoral , I was testing this, but I think it still swallows errors, not sure if this is a spec issue with OpenAI or what:

OpenAI::Client.new.chat( parameters: { model: "gpt-4", messages: [{ role: "garbage", content: "hello"}], temperature: 0, stream: proc do |chunk, error| puts "inside stream" puts chunk puts error end } )
No error is returned and nothing in the block.
It looks like openAI returns an entire error json object and this https://github.com/Shopify/event_stream_parser/blob/main/lib/event_stream_parser.rb skips it?

atesgoral · 2023-10-17T20:11:44Z

@smojtabai event_stream_parser is a pure parser of event streams and is agnostic of OpenAI. It's not swallowing any events.

It seems ruby-openai isn't checking the HTTP response code before processing the response as an event stream. In case of errors, OpenAI returns a JSON response, not event stream. I just realized this and I'll see if I can forward-fix that in this PR. Thanks for raising it.

smojtabai · 2023-10-17T21:06:21Z

@atesgoral ah yes, you are correct, I miss understood what was going on there.

#328 looks like tried to contribute a fix but was never merged or looked at, unsure if @alexrudall has any thoughts or if it will be merged ?

atesgoral · 2023-10-17T21:28:42Z

With a one-line addition here:

ruby-openai/lib/openai/http.rb

Lines 66 to 69 in e76c729

    
           Faraday.new do |f| 
        
             f.options[:timeout] = @request_timeout 
        
             f.request(:multipart) if multipart 
        
           end

f.response :raise_error

It is possible to mount the RaiseError middleware to get 4xx and 5xx responses percolate all the way up, but it's better to catch 4xx errors locally to be able to grab the JSON OpenAI is returning in the body.

I'll take a look at #328 to see if I can find inspiration there.

andresgutgon · 2023-10-18T07:06:39Z

This is looking fantastic. Looking forward to see this finish. Great work @atesgoral 👏

atesgoral · 2023-10-18T13:25:31Z

Complication: When the RaiseError middleware is enabled, Faraday will bail out (like it should) early and not call the on_data block. So, in streaming mode, there's no way to access the response body in the presence of that stock middleware.

Ignoring the middleware, I couldn't get access to Faraday's env argument in the on_data block for some reason (maybe it's not in the version of Faraday that this gem relies on?). If I could, then I could check the HTTP response code within the on_data block while also collecting response chunks.

Contemplating on creating a streaming-friendly version of the RaiseError middleware (and/or upgrading Faraday if needed).

smojtabai · 2023-10-18T22:59:49Z

Hmmm, I think upgrading Faraday would be the best looking solution. As a hack I tried monkey patching the http json_pst method with this:

    def json_post(path:, parameters:)
      stream_body = ''
      is_streaming = parameters[:stream].respond_to?(:call)
      response = conn.post(uri(path: path)) do |req|
        if is_streaming
          to_json_stream_proc = to_json_stream(user_proc: parameters[:stream])
          req.options.on_data = proc do |chunk, chunk_size|
            stream_body = chunk
            to_json_stream_proc.call(chunk, chunk_size)
          end
    
          parameters[:stream] = true # Necessary to tell OpenAI to stream.
        elsif parameters[:stream]
          raise ArgumentError, "The stream parameter must be a Proc or have a #call method"
        end
    
        req.headers = headers
        req.body = parameters.to_json
    
      end
      if is_streaming && response.status >= 400
        to_json(stream_body)
      else
        to_json(response&.body)
      end
    end

It sorta for now keeps the same signature on return of an error, I guess we could even call the user function instead this way if needed. Unsure if this can cause any issues with race conditions or if its a good idea but throwing it out there.

If we don't want to check the return code, we could just augment the to_json_stream function as follows which is hackier but may have less side effects:

def to_json_stream(user_proc:)
      parser = EventStreamParser::Parser.new

      proc do |chunk, chunk_size|
        processed_data = false
        parser.feed(chunk) do |type, data, id, reconnection_time|
          processed_data = true
          parsed_data = JSON.parse(data)
          user_proc.call(parsed_data) unless data == "[DONE]"
        end
        if processed_data == false
          Rails.logger.warn("Failed to parse chunk: #{chunk}")
          parsed_data = JSON.parse(chunk)
          if parsed_data['error'].present?
            raise StandardError.new("Error with OpenAI: #{parsed_data}")
          end
        end
      end
    end

This basically checks if the stream processor fails it tries to see if JSON got returned and for now throws (but can do what we want) in case of error.

smojtabai · 2023-10-18T23:08:08Z

We also may have wondered a bit and I apologize, I do think your first fix is important and can be merged separate from this but streaming in my opinion is not very usable without the ability to understand error responses from OpenAI. Do you have any connection with @alexrudall , I notice he hasn't commented on the other PR's trying to fix the error issue?

atesgoral · 2023-10-19T05:50:44Z

The Faraday version is new enough. I must have been making a mistake earlier. I see the env argument being passed to the on_data block.

Here's how I managed to grab the error response and raise the appropriate Faraday exception with it:

13ef7e3

    def to_json_stream(user_proc:)
      parser = EventStreamParser::Parser.new
      proc do |chunk, _bytes, env|
        if env && env.status != 200
          raise_error = Faraday::Response::RaiseError.new
          raise_error.on_complete(env.merge(body: JSON.parse(chunk)))
        end

        parser.feed(chunk) do |_type, data|
          user_proc.call(JSON.parse(data)) unless data == "[DONE]"
        end
      end
    end

This also feels a bit like a Faraday deficiency of not making this super easy.

The story so far, in a new draft PR that does all this and goes further by raising exceptions on HTTP errors (as well as JSON parse errors): #342

I got blocked on a cryptic failure on a finetune test. It could be the test setup that's wrong.

atesgoral · 2023-10-19T05:59:59Z

@smojtabai No, Alex and I are not connected. I'm just trying to fix things for our project (on my fork of this library) while trying to contribute back to the source.

atesgoral · 2023-10-19T06:15:14Z

So, I might actually tweak this PR to preserve existing functionality of treating HTTP errors as non-errors and returning the JSON error as a result. #342 could be a bigger leap to change behaviour, as a follow-up.

smojtabai · 2023-10-19T07:07:29Z

I agree with keeping existing functionality for now, more likely to get merged, thank you!

atesgoral · 2023-10-19T16:47:14Z

spec/openai/client/http_spec.rb

@@ -108,42 +108,39 @@
      context "when called with a string containing a single JSON object" do
        it "calls the user proc with the data parsed as JSON" do
          expect(user_proc).to receive(:call).with(JSON.parse('{"foo": "bar"}'))
-          stream.call('data: { "foo": "bar" }')
+          stream.call(<<~CHUNK)


Event streams require double new lines (or CR, or CRLFs) to emit data events.

Already learning new things keep going 💪

atesgoral · 2023-10-19T16:48:16Z

spec/openai/client/http_spec.rb

-      end
-
-      context "when called with a string containing that looks like a JSON object but is invalid" do
+      context "when called with string containing invalid JSON" do


It was weird to test for "semblance" of JSON. It's either valid JSON or not.

atesgoral · 2023-10-19T16:49:43Z

spec/openai/client/http_spec.rb

+        end
+      end
+
+      context "when called with JSON split across chunks" do


This is the real motivation of this PR. ruby-openai now becomes resilient to packet fragmentation at awkward locations.

Hi @atesgoral, I'm an engineer at OpenAI investigating this response. Do you have any logs (ideally from curl showing the full response stream) to demonstrate this issue? Please feel free to email me at [email protected]. Thanks!

@athyuttamre I haven't recently seen this happen when directly talking to the OpenAI API, but I recall seeing it a while back, when network conditions were bad. But it can still happen when a proxy or some other network element buffers and chops up the text/event-stream chunks at non-event-stream boundaries.

I don't think this is an OpenAI problem at all. Clients just need to be resilient and do buffered/proper parsing of event streams.

For posterity, here are two chunks (stream fragments) from an earlier reproduction of the parser issue that gets unearthed by buffering + rechunking:

Chunk 1:

data: {"id":"chatcmpl-83QVo11UROyI8HUeixAo25Vjx0SA5","object":"chat.completion.chunk","created":1695827588,"model":"gpt-4-0613","choices":[{"index":0,"delta":{"role":"assistant","content":""},"finish_reason":null}]} data: {"id":"chatcmpl-83QVo11UROyI8HUeixAo25Vjx0SA5","object":"chat.completion.chunk","created":1695827588,"model":"gpt-4-0613","choices":[{"index":0,"delta"

Chunk 2:

:{"content":"Hello"},"finish_reason":null}]}

atesgoral · 2023-10-19T16:50:55Z

Alright, I updated the PR to be laser-focused on solving the parsing issue, without touching any other behaviour.

@alexrudall if you have some spare cycles, it would be great to get this in.

atesgoral · 2023-10-20T01:09:45Z

Follow-up PR to raise errors: #344

smojtabai · 2023-10-20T14:42:33Z

@atesgoral if this branch were merged, would I be able to tell if an error occurred? Did you feel like the callback having an error argument was too big a change (without raising an exception)?

atesgoral · 2023-10-20T14:59:47Z

@smojtabai With this change, there's no change to how errors are subtly handled. You'd still have to check if the parsed JSON chunk your proc receives looks like an error (has an error property).

Ah, I guess the shape of the error is changing now since the error value is not being plucked. 🤔

But, assuming the error raising PR below could hopefully be also reviewed and accepted in quick succession, there could be a major bump to the gem for the new error handling flavour. We'll see soon 🤞

Raise exceptions #344

alexrudall · 2023-10-30T19:43:38Z

Released in 5.2.0 - huge thanks to you @atesgoral for your work on this and to others for input

atesgoral force-pushed the use_event_stream_parser branch from a2ec592 to d5a8946 Compare October 16, 2023 14:09

atesgoral mentioned this pull request Oct 16, 2023

Better SSE parsing #332

Closed

atesgoral force-pushed the use_event_stream_parser branch 4 times, most recently from d735b87 to 6d5bac2 Compare October 16, 2023 17:52

ScotterC mentioned this pull request Oct 16, 2023

Use event_stream_parser for robust SSE parsing ScotterC/ruby-openai#1

Merged

atesgoral mentioned this pull request Oct 19, 2023

Robust parsing and raising exceptions on HTTP errors #342

Closed

3 tasks

Use event_stream_parser for robust SSE parsing

e6f40af

atesgoral force-pushed the use_event_stream_parser branch from 6d5bac2 to e6f40af Compare October 19, 2023 16:43

atesgoral commented Oct 19, 2023

View reviewed changes

Add another quick spec

ef5a830

alexrudall merged commit 76aab2d into alexrudall:main Oct 30, 2023
6 checks passed

atesgoral deleted the use_event_stream_parser branch October 30, 2023 20:12

rohitpaulk mentioned this pull request Dec 19, 2023

helicone-stream-force-format: true seems to cause JSON::ParserError when using Azure OpenAI #411

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Proper SSE parsing #338

Proper SSE parsing #338

atesgoral commented Oct 16, 2023 •

edited

Loading

ScotterC commented Oct 16, 2023

atesgoral commented Oct 16, 2023

andresgutgon commented Oct 17, 2023

atesgoral commented Oct 17, 2023

smojtabai commented Oct 17, 2023 •

edited

Loading

atesgoral commented Oct 17, 2023

smojtabai commented Oct 17, 2023

atesgoral commented Oct 17, 2023 •

edited

Loading

andresgutgon commented Oct 18, 2023

atesgoral commented Oct 18, 2023

smojtabai commented Oct 18, 2023 •

edited

Loading

smojtabai commented Oct 18, 2023

atesgoral commented Oct 19, 2023 •

edited

Loading

atesgoral commented Oct 19, 2023

atesgoral commented Oct 19, 2023

smojtabai commented Oct 19, 2023

atesgoral Oct 19, 2023

andresgutgon Oct 19, 2023

atesgoral Oct 19, 2023

atesgoral Oct 19, 2023 •

edited

Loading

athyuttamre Oct 30, 2023

atesgoral Oct 30, 2023 •

edited

Loading

atesgoral Oct 30, 2023

atesgoral Oct 30, 2023

atesgoral commented Oct 19, 2023 •

edited

Loading

atesgoral commented Oct 20, 2023

smojtabai commented Oct 20, 2023

atesgoral commented Oct 20, 2023 •

edited

Loading

alexrudall commented Oct 30, 2023

Proper SSE parsing #338

Proper SSE parsing #338

Conversation

atesgoral commented Oct 16, 2023 • edited Loading

All Submissions:

ScotterC commented Oct 16, 2023

atesgoral commented Oct 16, 2023

andresgutgon commented Oct 17, 2023

atesgoral commented Oct 17, 2023

smojtabai commented Oct 17, 2023 • edited Loading

atesgoral commented Oct 17, 2023

smojtabai commented Oct 17, 2023

atesgoral commented Oct 17, 2023 • edited Loading

andresgutgon commented Oct 18, 2023

atesgoral commented Oct 18, 2023

smojtabai commented Oct 18, 2023 • edited Loading

smojtabai commented Oct 18, 2023

atesgoral commented Oct 19, 2023 • edited Loading

atesgoral commented Oct 19, 2023

atesgoral commented Oct 19, 2023

smojtabai commented Oct 19, 2023

atesgoral Oct 19, 2023

Choose a reason for hiding this comment

andresgutgon Oct 19, 2023

Choose a reason for hiding this comment

atesgoral Oct 19, 2023

Choose a reason for hiding this comment

atesgoral Oct 19, 2023 • edited Loading

Choose a reason for hiding this comment

athyuttamre Oct 30, 2023

Choose a reason for hiding this comment

atesgoral Oct 30, 2023 • edited Loading

Choose a reason for hiding this comment

atesgoral Oct 30, 2023

Choose a reason for hiding this comment

atesgoral Oct 30, 2023

Choose a reason for hiding this comment

atesgoral commented Oct 19, 2023 • edited Loading

atesgoral commented Oct 20, 2023

smojtabai commented Oct 20, 2023

atesgoral commented Oct 20, 2023 • edited Loading

alexrudall commented Oct 30, 2023

atesgoral commented Oct 16, 2023 •

edited

Loading

smojtabai commented Oct 17, 2023 •

edited

Loading

atesgoral commented Oct 17, 2023 •

edited

Loading

smojtabai commented Oct 18, 2023 •

edited

Loading

atesgoral commented Oct 19, 2023 •

edited

Loading

atesgoral Oct 19, 2023 •

edited

Loading

atesgoral Oct 30, 2023 •

edited

Loading

atesgoral commented Oct 19, 2023 •

edited

Loading

atesgoral commented Oct 20, 2023 •

edited

Loading