Ollama chat endpoint support #16

tquartus · 2024-01-14T04:13:59Z

Hello, Andrew,

First off, thanks for your project! I wrote some code over a year ago for interacting with the OpenAI API via Emacs. OpenAI broke that several times with their changes. Recently, I started working with LLMs locally and decided to rewrite everything from scratch to support multiple APIs ... and discovered that, thankfully, someone had beaten me to that! :-)

I noticed that the existing package provided support only for the Ollama /generate endpoint, but not the /chat endpoint. I find the /generate endpoint simpler for one-off requests. But for chat conversations, with multiple interactions, I prefer the /chat endpoint. For that, I can generate a set of interactions from scratch, regenerate responses, or switch between different LLMs in the same conversation. So, I've introduced some changes to support the other endpoint. I believe it's backward compatible.

The main changes include:

A new :endpoint slot in the llm-ollama structure, which specifies the API endpoint to use. This slot defaults to generate, but can be set to chat to utilize the /chat endpoint.
Adjustments to the JSON encodings for making requests to the API and processing responses from the server. These changes accommodate the differences between the /generate and /chat endpoints. Specifically, separate helper functions have been created to handle the request data for each endpoint.
Modifications to the llm-ollama--chat-request function to call the appropriate helper function based on the specified endpoint.
Updates to the llm-ollama--url calls in the llm-chat and llm-chat-streaming methods to use the specified endpoint.
Updated documentation to reflect the new :endpoint slot in the llm-ollama structure.

Hopefully, support for both endpoints aligns with the package's goal of abstracting functionality to a higher level and concealing API variations.

I look forward to your feedback and the opportunity to contribute to the ongoing development of this package.

Best,
Thomas

…dpoint helper.

ahyatt · 2024-01-14T18:11:49Z

Thanks for this change! Before I take a look, do you have FSF copyright assignment? This is needed to contribute substantial changes to this project, since it is part of GNU ELPA.

tquartus · 2024-01-14T19:28:40Z

In process. Emailed FSF just as I sent this in. Hopefully, won't take long. I'm a college professor, but our faculty handbook makes it clear they don't claim copyright over works by faculty. Hoping FSF will accept that without requiring additional signatures. Sorry for the delay. Will let you know when it's settled.

ahyatt · 2024-01-14T22:07:56Z

No problem, thank you for getting that process started! And without looking at your code, I'd suggest that it's OK to go to the chat endpoint by default. That is typically what we do for other providers.

tquartus · 2024-02-04T03:48:35Z

My FSF copyright assignment finally came through. Just wanted to let you know.

ahyatt

Thank you for these changes and for getting the copyright assignment!

ahyatt · 2024-02-04T15:13:13Z

llm-ollama.el

+`request-alist' depending on whether a `chat' or `generate'
+endpoint is specified in the provider."
+  (let* ((request-alist 
+          (if (string= (llm-ollama-endpoint provider) "chat")


You can just use "equal" here, string= is more about comparing without properties.

ahyatt · 2024-02-04T15:14:00Z

README.org

@@ -56,6 +56,7 @@ In addition to the provider, which you may want multiple of (for example, to cha
 - ~:port~: The port that ollama is run on.  This is optional and will default to the default ollama port.
 - ~:chat-model~: The model name to use for chat.  This is not optional for chat use, since there is no default.
 - ~:embedding-model~: The model name to use for embeddings.  This is not optional for embedding use, since there is no default.
+- ~:endpoint~: The ollama endpoint to use, either "generate" or "chat".  This is optional and will default to "generate".


Do you think it makes a difference to use "generate", when we could just use "chat" always? If there's no appreciable difference, I'd prefer not having an option.

ahyatt · 2024-02-04T15:16:04Z

llm-ollama.el

@@ -178,7 +220,7 @@ STREAMING if non-nil, turn on response streaming."
  ;; we really just need it for the local variables.
  (with-temp-buffer
    (let ((output (llm-request-sync-raw-output 
-                   (llm-ollama--url provider "generate")
+                   (llm-ollama--url provider (slot-value provider 'endpoint))


This is fine, but stylistically the rest of the code prefers to use (llm-ollama-endpoint provider), so can you change this and the other instance of this (assuming you keep the the endpoint slot?)

…dpoint-support.

…t-support

tquartus · 2024-02-07T21:29:49Z

Thanks! Based on your suggestions, I've removed support for the /generate endpoint and now only support the more versatile /chat endpoint. This simplifies the code and makes it more straightforward. As a result, the code involving string= and the :endpoint slot became unnecessary, so I've removed them as well, and also removed the added line from the README.org. Please let me know if there's anything else you'd like me to adjust.

ahyatt · 2024-02-07T23:39:04Z

Thank you for your change!

tquartus and others added 7 commits January 11, 2024 17:37

Added support for Ollama /api/chat endpoint

a4d7098

Corrected form of comments of helper functions.

61db5c3

Minor changes to new function comments.

3147810

Added endpoint parameter to documentation.

843cf24

Restored comment that I had accidentally dropped from the generate en…

1c3727c

…dpoint helper.

Remove unneeded space at end of line.

a1b17b0

Merge branch 'ahyatt:main' into ollama-chat-endpoint-support

1e08b73

ahyatt requested changes Feb 4, 2024

View reviewed changes

tquartus added 5 commits February 7, 2024 09:41

Resolved merge conflicts and merged upstream/main into ollama-chat-en…

b9fc46f

…dpoint-support.

Merge remote-tracking branch 'upstream/main' into ollama-chat-endpoin…

ea72852

…t-support

Removed /generate endpoint support based on PR feedback

ea2ec28

Minor clean up, remove mention of :endpoint slot in README.

9e7344a

Minor changes

993081f

ahyatt merged commit a343797 into ahyatt:main Feb 7, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Ollama chat endpoint support #16

Ollama chat endpoint support #16

tquartus commented Jan 14, 2024

ahyatt commented Jan 14, 2024

tquartus commented Jan 14, 2024

ahyatt commented Jan 14, 2024

tquartus commented Feb 4, 2024

ahyatt left a comment

ahyatt Feb 4, 2024

ahyatt Feb 4, 2024

ahyatt Feb 4, 2024

tquartus commented Feb 7, 2024

ahyatt commented Feb 7, 2024

Ollama chat endpoint support #16

Ollama chat endpoint support #16

Conversation

tquartus commented Jan 14, 2024

ahyatt commented Jan 14, 2024

tquartus commented Jan 14, 2024

ahyatt commented Jan 14, 2024

tquartus commented Feb 4, 2024

ahyatt left a comment

Choose a reason for hiding this comment

ahyatt Feb 4, 2024

Choose a reason for hiding this comment

ahyatt Feb 4, 2024

Choose a reason for hiding this comment

ahyatt Feb 4, 2024

Choose a reason for hiding this comment

tquartus commented Feb 7, 2024

ahyatt commented Feb 7, 2024