Configuration validation and annotations on errors #80

gentlementlegen · 2024-07-27T01:05:12Z

In the bot v1, when the configuration is changed, annotations are added to the configuration file if any error is encountered within the configuration. It would be really nice to have for v2 as it is a common scenario to have an invalid configuration and have no feedback about it because the error would only be visible within the worker logs.

The main challenge is that only plugins are aware of their configuration shape, so probably the kernel should call every plugin to check their validity, which would require an endpoint or some access to the configuration validators. At the same time, the manifest.json validity could be checked, relating to #78.

Since this might imply quite a few network requests, we can also consider having this functionality as a separate Worker using service binding.

The text was updated successfully, but these errors were encountered:

0x4007 · 2024-07-29T08:22:03Z

Since this might imply quite a few network requests

I think its fine given that we can make 5000 requests per hour, per organization, before we get rate limited as an app. Lets keep it simple and not do the separate Worker. Config changes do not happen regularly.

gentlementlegen · 2024-07-30T01:47:41Z

@0x4007 What you say is valid for the GitHub API. But Workers are limited to 50 requests per run / instance.

0x4007 · 2024-07-30T02:54:13Z

We can do this in a plugin instead?

gentlementlegen · 2024-07-30T06:02:36Z

I thought about it, but it means that the plugin should have access to the private repo containing the configuration and should be able to read / write which seems dangerous. Or maybe we can run that plugin within the configuration repo itself. But if we do so we cannot handle per repo based configurations I think

0x4007 · 2024-07-30T06:12:07Z

Or maybe we can run that plugin within the configuration repo itself.

Cool idea

But if we do so we cannot handle per repo based configurations I think

Research will reveal the answer!

gentlementlegen · 2024-08-29T06:39:37Z

/start

ubiquity-os · 2024-08-29T06:39:44Z

Warning! This task was created over 33 days ago. Please confirm that this issue specification is accurate before starting. Deadline Thu, Sep 5, 6:39 AM UTC Registered Wallet 0x0fC1b909ba9265A846b82CF4CE352fc3e7EeB2ED

Tips:

Use /wallet 0x0000...0000 if you want to update your registered payment wallet address.
Be sure to open a draft pull request as soon as possible to communicate updates on your progress.
Be sure to provide timely updates to us when requested, or you will be automatically unassigned from the task.

gentlementlegen · 2024-08-29T06:40:52Z

What should be done beyond the kernel side changes:
Each plugin should have an entry point for validation on Workers, and an Action script for Actions.

0x4007 · 2024-08-29T06:42:38Z

Each plugin should have an entry point for validation on Workers, and an Action script for Actions.

Can you elaborate this isn't clear to others

gentlementlegen · 2024-08-29T07:13:39Z

The kernel workflow should be the following:

The kernel receives a push event
The kernel figures out it is the configuration file
The kernel will, for each plugin inside the configuration, poke and endpoint for Workers and an Action for Actions that will parse the configuration and return validation errors
I think the kernel should also put a warning on endpoints it cannot reach and ignore the plugin without breaking the whole configuration
The kernel will eventually post annotation if any issue is found, or a message saying the configuration is valid

The v1 had a similar workflow. I think this is important to have this functionality specially for newcomers that could be confused about the configuration, or avoid breaking it during development and updates.

gentlementlegen · 2024-09-10T04:55:38Z

@0x4007 I've seen you mentioning an issue about the base multiplier having changed for v2, and indeed it was reset to 1. Put it back to 3 as it was in v1.

gentlementlegen · 2024-09-20T02:34:53Z

@0x4007 Would something similar to the following format be satisfactory? It gives the error and link the corresponding line with a preview:

0x4007 · 2024-09-20T03:03:27Z

Also use the

[!CAUTION]

my message

syntax

The red one whatever the syntax is

gentlementlegen · 2024-09-21T01:47:31Z

Progress update:
I finally got the Action plugins to validate themselves, was trickier because everything is asychronous. So I had a design question: since I have to listen for Actions to finish validating, currently what happens is that each plugin will add its own message, like so:

Maybe that would be too noisy since a user would get tag each time a plugin is done validating. So maybe what we can consider:

no message on success to reduce noise
combine all the outputs by editing the message each time a plugin is done validating

Downsize of reducing noise by editing the message is that every plugin will be async and maybe errors keep coming after the user thought the configuration was valid. What do you think? Another run example: https://github.com/Meniole/ubiquibot-config/commit/3e9152e3eadd98ef20b10bfba7529533ed392c46

0x4007 · 2024-09-22T03:04:01Z

Yes do both of your suggestions

0x4007 · 2024-09-23T14:46:05Z

Oh I thought the validator is one and done inside the kernel. This approach across every plugin seems wrong.

This can be dynamic by reading the plugin ajv validator files

gentlementlegen · 2024-09-23T15:44:55Z

@0x4007 I think it makes more sense to have it within the plugin itself, we could even have it within our SDK for simplicity. I do not see how the kernel can understand plugin configurations.

0x4007 · 2024-09-23T15:46:09Z

My idea was to import and run the ajv validation code. Could be risky but maybe there's a way to quarantine it in a way that's not too complex.

gentlementlegen · 2024-09-23T15:49:49Z

We do not use ajv anywhere, because ajv cannot run within workers. Beyond being very risky and prone to code injection, some plugins have very complex configurations spread across multiple files, like conversation-rewards how could we handle this? Some other plugins have runtime encode / decode, or types generated through import of third party libraries (for example GitHub roles that are generated from an enum). That seemed way too complex to just read the configuration file, it is much simpler to reuse the code already implemented within the plugin itself, since every plugin already validates its own payload.

Also, in the case of workers, the kernel does not know which repository they refer too, so we should serve the file as plain text on the endpoint which doesn't seem elegant.

Another advantage of using directly the plugin is that currently it also detects invalid GitHub / Worker environment variables which is very helpful.

0x4007 · 2024-09-23T16:43:50Z

I think we need to figure this out eventually because of the marketplace/plugin installer feature

gentlementlegen · 2024-09-23T17:09:21Z

I think it is quite straightforward for Workers, too slow for Actions. Maybe eventually we will need to have an endpoint for all the plugins. I don't see how we can have the kernel itself handle this because:

the kernel should parse the included files and retrieve all of them
in case some external library is linked, the kernel should install it
that open the possibility to inject code, and extremely easy to break as well
we cannot safely read the environment of plugins either, because we would leak secrets

If we can solve all of these then the kernel should be able to just import the configuration type files and read them.

0x4007 · 2024-09-24T05:24:09Z

My vision is not too different from a Docker-like approach. We can spin up a virtual shell as a child process, which then runs its own node.js instance.

I think it is quite straightforward for Workers, too slow for Actions. Maybe eventually we will need to have an endpoint for all the plugins. I don't see how we can have the kernel itself handle this because:

the kernel should parse the included files and retrieve all of them

I think we just need to send everything, although I'm not fully understanding the context of this statement.

in case some external library is linked, the kernel should install it

As in, npm? Knip might be able to help compile these, or there may be some better tools.

that open the possibility to inject code, and extremely easy to break as well

Code injection might be acceptable within a virtual shell.

we cannot safely read the environment of plugins either, because we would leak secrets

The virtual shell should not have access to the environment secrets.

If we can solve all of these then the kernel should be able to just import the configuration type files and read them.

gentlementlegen · 2024-09-24T05:55:29Z

Workers do not run node.js but a custom cloudflare-like minimal implementation. Workers do not allow to run external code for security reasons, nor allow child processes, or shells, afaik.

I do not understand the benefit of heavily complexifying this on the kernel, when we already have running endpoints that support all the logic (and Actions are literally dockers themselves). Also I do not understand what Knip is needed for?

Practical example:
I need to check if the configuration for conversation-rewards is valid. Here is the configuration file:
https://github.com/ubiquibot/conversation-rewards/blob/development/src/configuration/incentives.ts

How can I check that the configuration provided is valid against this?

0x4007 · 2024-09-24T10:35:53Z

One expensive idea is to consume the type with o1-mini and then have it post a comment if it thinks that it won't work. I suppose it would unfortunately have to consume the entire plugin codebase before determining this though.

It would be a bit cheaper if we standardize the location of the payload type checker file.

This seems like a bad approach to scale. We could consider making this a standalone plugin which is expensive to run but we can allow partners to opt-out if its too expensive?

In conclusion I don't see a great cheap solution. I think we should handle it inside of the plugins as you are doing, and then in the future we automate plugin configuration using a GUI which can prevent misconfiguration? Perhaps we enforce a standard for plugin developers to follow for it to populate on the GUI and to be configurable?

gentlementlegen · 2024-09-25T04:09:35Z

I think running in their own plugin is the cheapest way we can use for now. For the GUI, it would be no problem with Worker ones as the response would be instantaneous on a bad configuration, but Actions would take like a minute to validate which would be a bummer for a UI, so that's the part we should figure out. One thing that @whilefoo pointed out is serve the configuration schema through the manifest, which could have been a solution if we find a way to compile the schema in such a way it contains every needed info that could be interpreted by the kernel correctly.

whilefoo · 2024-09-25T09:47:35Z

Yeah my first idea was that worker plugins could serve configuration schema through the manifest but that's too cumbersome to write in the manifest so instead it could just convert typebox schema to json schema and send it over a dedicated endpoint, but the problem is with action plugins because we can't get a fast and sync response

gentlementlegen · 2024-09-25T10:28:58Z

@whilefoo Based on what you said, maybe then there would be an approach that would allows both Actions and Workers to have a fast response.
Typebox has a codegen tool which allows to transform from the Model to JSON. There is also a tool that allows to transform JSON to TypeBox. Or, codegen supports Model -> TS and TS -> Model but I feel less comfortable serving TypeScript files. However, the JSON tool is a CLI, not a package so maybe it cannot be run within Workers.

For workers, we could simply serve the JSON through an endpoint. For Actions, we could have a script automatically generating the JSON file on push events, so the kernel can simply download the file (which would always be at the same locations for plugins) which would be significantly faster than running an Action.

That can be a path I can explore as well.

0x4007 · 2024-09-25T21:47:41Z

because we can't get a fast and sync response

Admin can make a config change, the Action runs and posts a comment on the commit a few minutes later BUT it tags the author, so they are notified.

I think this is acceptable as a first version if the better solutions can't be figured out.

gentlementlegen · 2024-09-26T04:41:52Z

@0x4007 @whilefoo It usually takes ~30s and it does tag the user.

I did some research regarding the usage of a json to describe the configuration. Here is what I found:

typebox-codegen cannot be used because it generates code files, and Cloudflare forbids the use of eval, exec, Function and so on, so cannot run arbitrary code
typebox can serialize the JSON, but cannot read it back
I thought we could, however, use other packages like AJV to validate the JSON based on its content. But AJV also relies on Function to read the schemas so it cannot run within workers (Allow it to work on edge (e.g. cloudflare workers) ajv-validator/ajv#2318). There seems to be an alternative that I didn't try out yet: https://github.com/cfworker/cfworker/tree/main/packages/json-schema

So it is possible, but comes with drawbacks.

Functionality	JSON	Endpoint + Action
Instantaneous response	✅	❌
Environment validation	❌	✅
Decode validation	❌	✅
Detailed errors	✅	✅
Need of an extra build step	✅	❌
Need for an extra Action script	❌	✅

TL;DR JSON is faster, Endpoint + Action give much more accurate errors, so don't know what route we prefer.

0x4007 · 2024-09-26T05:16:05Z

@whilefoo you can make the decision

whilefoo · 2024-09-26T15:11:29Z

What do you mean by environment validation?

I think decode validation is not that important and also I think it's more secure if the kernel does validation and not the plugin which can access the configuration.

For Actions, we could have a script automatically generating the JSON file on push events, so the kernel can simply download the file (which would always be at the same locations for plugins) which would be significantly faster than running an Action.

How would the action know where the configuration schema is located in the codebase? Unless the developer sets a path to the file and name of the variable

gentlementlegen · 2024-09-26T15:33:08Z

Environment validation meaning validating env process.env values, it is possible when it happens on the plugin side.

The decode can be handy, practical example in the plugin name where "test" would be a valid string, but does not properly represent a plugin URL nor an Action path. This is validated during decode, so only can be picked up plugin side.

For the path, we should just rely on a standard location the same way we do for the manifest. Or even have it appended within the manifest itself.

0x4007 · 2024-09-26T17:15:45Z

Standard location (root, sibling of manifest) seems simplest

0x4007 added the Priority: 1 (Normal) label Jul 29, 2024

gentlementlegen added the Time: <1 Week label Aug 29, 2024

ubiquity-os bot added the Price: 200 USD label Aug 29, 2024

ubiquity-os bot assigned gentlementlegen Aug 29, 2024

gentlementlegen added Time: <1 Day and removed Time: <1 Week labels Sep 10, 2024

ubiquity-os bot added Price: 300 USD and removed Price: 200 USD labels Sep 10, 2024

gentlementlegen added Time: <1 Week Price: 200 USD and removed Price: 300 USD labels Sep 10, 2024

ubiquity-os bot added Price: 300 USD and removed Price: 200 USD labels Sep 10, 2024

gentlementlegen removed the Time: <1 Day label Sep 10, 2024

ubiquity-os bot added Price: 600 USD and removed Price: 300 USD labels Sep 10, 2024

gentlementlegen linked a pull request Sep 19, 2024 that will close this issue

feat: configuration annotations #112

Open

3 tasks

gentlementlegen linked a pull request Sep 21, 2024 that will close this issue

feat: schema validation ubiquity-os-marketplace/automated-merging#21

Draft

gentlementlegen mentioned this issue Sep 24, 2024

Skip plugin run on missing manifest.json file #78

Closed

This was referenced Sep 25, 2024

feat: schema validation ubiquity-os-marketplace/conversation-rewards#127

Draft

feat: schema validation ubiquity-os-marketplace/disqualifier#25

Draft

feat: schema validation ubiquity-os/plugin-template#23

Draft

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Configuration validation and annotations on errors #80

Configuration validation and annotations on errors #80

gentlementlegen commented Jul 27, 2024 •

edited

Loading

0x4007 commented Jul 29, 2024 •

edited

Loading

gentlementlegen commented Jul 30, 2024

0x4007 commented Jul 30, 2024 •

edited

Loading

gentlementlegen commented Jul 30, 2024 •

edited

Loading

0x4007 commented Jul 30, 2024

gentlementlegen commented Aug 29, 2024

ubiquity-os bot commented Aug 29, 2024

gentlementlegen commented Aug 29, 2024

0x4007 commented Aug 29, 2024

gentlementlegen commented Aug 29, 2024 •

edited

Loading

gentlementlegen commented Sep 10, 2024

gentlementlegen commented Sep 20, 2024

0x4007 commented Sep 20, 2024 •

edited

Loading

gentlementlegen commented Sep 21, 2024 •

edited

Loading

0x4007 commented Sep 22, 2024

0x4007 commented Sep 23, 2024 •

edited

Loading

gentlementlegen commented Sep 23, 2024

0x4007 commented Sep 23, 2024

gentlementlegen commented Sep 23, 2024 •

edited

Loading

0x4007 commented Sep 23, 2024 •

edited

Loading

gentlementlegen commented Sep 23, 2024 •

edited

Loading

0x4007 commented Sep 24, 2024

gentlementlegen commented Sep 24, 2024

0x4007 commented Sep 24, 2024 •

edited

Loading

gentlementlegen commented Sep 25, 2024

whilefoo commented Sep 25, 2024

gentlementlegen commented Sep 25, 2024 •

edited

Loading

0x4007 commented Sep 25, 2024 •

edited

Loading

gentlementlegen commented Sep 26, 2024

0x4007 commented Sep 26, 2024

whilefoo commented Sep 26, 2024

gentlementlegen commented Sep 26, 2024 •

edited

Loading

0x4007 commented Sep 26, 2024

Configuration validation and annotations on errors #80

Configuration validation and annotations on errors #80

Comments

gentlementlegen commented Jul 27, 2024 • edited Loading

0x4007 commented Jul 29, 2024 • edited Loading

gentlementlegen commented Jul 30, 2024

0x4007 commented Jul 30, 2024 • edited Loading

gentlementlegen commented Jul 30, 2024 • edited Loading

0x4007 commented Jul 30, 2024

gentlementlegen commented Aug 29, 2024

ubiquity-os bot commented Aug 29, 2024

Tips:

gentlementlegen commented Aug 29, 2024

0x4007 commented Aug 29, 2024

gentlementlegen commented Aug 29, 2024 • edited Loading

gentlementlegen commented Sep 10, 2024

gentlementlegen commented Sep 20, 2024

0x4007 commented Sep 20, 2024 • edited Loading

gentlementlegen commented Sep 21, 2024 • edited Loading

0x4007 commented Sep 22, 2024

0x4007 commented Sep 23, 2024 • edited Loading

gentlementlegen commented Sep 23, 2024

0x4007 commented Sep 23, 2024

gentlementlegen commented Sep 23, 2024 • edited Loading

0x4007 commented Sep 23, 2024 • edited Loading

gentlementlegen commented Sep 23, 2024 • edited Loading

0x4007 commented Sep 24, 2024

gentlementlegen commented Sep 24, 2024

0x4007 commented Sep 24, 2024 • edited Loading

gentlementlegen commented Sep 25, 2024

whilefoo commented Sep 25, 2024

gentlementlegen commented Sep 25, 2024 • edited Loading

0x4007 commented Sep 25, 2024 • edited Loading

gentlementlegen commented Sep 26, 2024

0x4007 commented Sep 26, 2024

whilefoo commented Sep 26, 2024

gentlementlegen commented Sep 26, 2024 • edited Loading

0x4007 commented Sep 26, 2024

gentlementlegen commented Jul 27, 2024 •

edited

Loading

0x4007 commented Jul 29, 2024 •

edited

Loading

0x4007 commented Jul 30, 2024 •

edited

Loading

gentlementlegen commented Jul 30, 2024 •

edited

Loading

gentlementlegen commented Aug 29, 2024 •

edited

Loading

0x4007 commented Sep 20, 2024 •

edited

Loading

gentlementlegen commented Sep 21, 2024 •

edited

Loading

0x4007 commented Sep 23, 2024 •

edited

Loading

gentlementlegen commented Sep 23, 2024 •

edited

Loading

0x4007 commented Sep 23, 2024 •

edited

Loading

gentlementlegen commented Sep 23, 2024 •

edited

Loading

0x4007 commented Sep 24, 2024 •

edited

Loading

gentlementlegen commented Sep 25, 2024 •

edited

Loading

0x4007 commented Sep 25, 2024 •

edited

Loading

gentlementlegen commented Sep 26, 2024 •

edited

Loading