Added the isjsonvalid() and the wordstream() functions #52

atantos · 2023-11-18T07:58:20Z

Hi there!

You might want to consider adding a default wordstream() function. The function parses the stream and it seems to work as is. If you think it is ok, I could also contribute to the docu of the function.

Regards.

Alex

roryl23 · 2023-11-18T17:16:57Z

Hey Alex, thanks for the PR! I'm getting caught up after a hectic couple weeks. Will take a look by the end of today

svilupp · 2023-11-19T11:58:19Z

Out of curiousity, what is the use case here? What would be the advantages of implementing it?

It seems that it basically parses JSON twice anyway, once inside a try-catch (in isvalidjson) and then when actually parsing. What is the advantage over just having the try-catch around the parsing itself?

Side note: perhaps you could disable your formatter to only change the proposed functionality? There are a few places picked up by the diff where only spaces have changed.

atantos · 2023-11-19T12:14:07Z

Thank you for the insight, @svilupp! I'm thinking of a "default" streamcallback function for word-by-word streaming that would be very useful for a user for a number of reasons. Crafting such a function is not straightforward, as the data stream isn't just a series of JSON-parsable objects prefixed with "Data:". The stream may split, with a packet starting mid-way through an object from the previous packet. You're correct about the double JSON parsing; there might indeed be a more efficient method to verify if the input is JSON before continuing with the wordstream() function. Also, I appreciate your feedback on the formatter—I'll definitely keep that in mind!

svilupp · 2023-11-27T20:41:03Z

Thank you for the insight, @svilupp! I'm thinking of a "default" streamcallback function for word-by-word streaming that would be very useful for a user for a number of reasons. Crafting such a function is not straightforward, as the data stream isn't just a series of JSON-parsable objects prefixed with "Data:". The stream may split, with a packet starting mid-way through an object from the previous packet. You're correct about the double JSON parsing; there might indeed be a more efficient method to verify if the input is JSON before continuing with the wordstream() function. Also, I appreciate your feedback on the formatter—I'll definitely keep that in mind!

Apologies for the slow response!

Would you mind outlining some of the use cases for building JSONs on the fly as they are getting streamed? I lack imagination and I tend to associate streaming with nicer UX design for chat interfaces. Are you thinking of some other use case?

I'm not very familiar with the functionality, so I'd appreciate if you could capture a few key test cases and codify them in the test set. It will help us prevent any future regressions and it will help with onboarding new devs, because they'll see a practical example.
I can imagine that parsing text vs JSON mode response vs function_call will require a slightly different approach, so I'm keen to see that covered in the test suite.

cpfiffer

I like the PR and think we should merge it in. I left two comments I hope we can incorporate to make it a little more obvious how this stuff works.

cpfiffer · 2023-12-01T23:31:56Z

src/OpenAI.jl

+isvalidjson(str) =
+    try
+        JSON3.read(str)
+        true
+    catch
+        false
+    end


Minor note but I typically prefer function ... end syntax for multiline functions.

Suggested change

isvalidjson(str) =

try

JSON3.read(str)

true

catch

false

end

function isvalidjson(str)

try

JSON3.read(str)

true

catch

false

end

end

cpfiffer · 2023-12-01T23:32:32Z

src/OpenAI.jl

+"""
+Default streamcallback function for create_<action> functions.
+"""


Could you add a little usage section here to show how people should use this?

svilupp · 2024-02-01T13:30:13Z

Perhaps we should learn from what others are doing?

Langchain:

astream_events

It’s worth calling out that we’re using our new astream_eventsmethod to easily stream back all events (new tokens, as well as function calls and function results) and surface them to the user. We do some filtering of this stream to get relevant messages or message chunks, and then render them nicely in the UI. If you aren’t familiar with astream_events, it is definitely worth checking it out in more detail here.

Code: https://github.com/langchain-ai/langchain/blob/a0ec04549575de5547faa5381fa69fef61405ac8/libs/core/langchain_core/runnables/base.py#L697

Added the isjsonvalid() and the wordstream() functions

3f8ba8f

roryl23 mentioned this pull request Nov 18, 2023

Update API endpoints #50

Open

cpfiffer suggested changes Dec 1, 2023

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Added the isjsonvalid() and the wordstream() functions #52

Added the isjsonvalid() and the wordstream() functions #52

atantos commented Nov 18, 2023

roryl23 commented Nov 18, 2023

svilupp commented Nov 19, 2023

atantos commented Nov 19, 2023 •

edited

Loading

svilupp commented Nov 27, 2023

cpfiffer left a comment

cpfiffer Dec 1, 2023

cpfiffer Dec 1, 2023

svilupp commented Feb 1, 2024

Added the isjsonvalid() and the wordstream() functions #52

Are you sure you want to change the base?

Added the isjsonvalid() and the wordstream() functions #52

Conversation

atantos commented Nov 18, 2023

roryl23 commented Nov 18, 2023

svilupp commented Nov 19, 2023

atantos commented Nov 19, 2023 • edited Loading

svilupp commented Nov 27, 2023

cpfiffer left a comment

Choose a reason for hiding this comment

cpfiffer Dec 1, 2023

Choose a reason for hiding this comment

cpfiffer Dec 1, 2023

Choose a reason for hiding this comment

svilupp commented Feb 1, 2024

atantos commented Nov 19, 2023 •

edited

Loading