Make config-schema extensible for handling of unknown fields #156214

mikecote · 2023-04-28T18:30:13Z

Related issue #155764.

In this PR, I'm adding an extendsDeep function to the schema object. This feature allows you to create a copy of an existing schema definition and recursively modify options without mutating them. With extendsDeep, you can specify whether unknown attributes on objects should be allowed, forbidden or ignored.

This new function is particularly useful for alerting scenarios where we need to drop unknown fields when reading from Elasticsearch without modifying the schema object. Since we don't control the schema definition in some areas, extendsDeep provides a convenient way to set the unknowns option to all objects recursively. By doing so, we can validate and drop unknown properties using the same defined schema, just with unknowns: forbid extension.

Usage:

// Single, shared type definition
const type = schema.object({ foo: schema.string() });

// Drop unknown fields (bar in this case)
const savedObject = { foo: 'test', bar: 'test' };
const ignoreSchema = type.extendsDeep({ unknowns: 'ignore' });
ignoreSchema.validate(savedObject);

// Prevent unknown fields (bar in this case)
const soToUpdate = { foo: 'test', bar: 'test' };
const forbidSchema = type.extendsDeep({ unknowns: 'forbid' });
forbidSchema.validate(soToUpdate);

mikecote · 2023-05-01T16:39:26Z

@elasticmachine merge upstream

…oc-extendsDeep

…ibana into config-schema/poc-extendsDeep

mikecote · 2023-05-02T17:14:22Z

@elastic/kibana-core I would be curious to hear your thoughts about this approach. Would you benefit from this elsewhere?

pgayvallet

I think the approach looks fine to me.

My questions and concerns in comments.

packages/kbn-config-schema/src/types/array_type.test.ts

pgayvallet · 2023-05-03T08:54:53Z

packages/kbn-config-schema/src/types/object_type.ts

+  public extendsDeep(options: ExtendsDeepOptions) {
+    const extendedProps = Object.entries(this.props).reduce((memo, [key, value]) => {
+      if (value !== null && value !== undefined) {
+        return {


(thinking out loud here) I'm overall fine with the recursive approach, I can understand that it would be complicated / tedious to manually redefine everything.

Now, if this extendsDeep approach looks fine to me as an internal API to perform the mutation, I wonder if that's what we want as the public API.

I was more thinking of something like:

const type = schema.object({ foo: schema.string() }); const savedObject = { foo: 'test', bar: 'test' }; // ... later type.validate(savedObject, { unknowns: 'ignore' });

Now, technically, internally we don't want to create a new schema everytime validate is called with this option. But if we know that we will only be exposing this single 'schema mutation' option to validate, we could store the mutated schema internally.

However, the validate signature is already crowded with 'useless' stuff as second and third parameters:

kibana/packages/kbn-config-schema/src/types/type.ts

Line 86 in 27ff7d3

public validate(value: any, context: Record<string, any> = {}, namespace?: string): V {

So I don't think it would be that easy to adapt it to have a good DX, as it would look like

type.validate(savedObject, {}, undefined, { unknowns: 'ignore' });

So in summary, your approach is probably the most pragmatic one.

pgayvallet · 2023-05-03T08:56:35Z

packages/kbn-config-schema/src/types/record_type.ts

+  private readonly keyType: Type<K>;
+  private readonly valueType: Type<V>;
+  private readonly options: RecordOfOptions<K, V>;


We're now storing all the constructor options on all of our type classes to be able to clone them during extendsDeep.

I'm thinking about GC / memory consumption here. I think it's fine, given the various props were referenced by lower schemas anyway, so we're probably not introducing any significant impact to the memory consumption of schemas here, but ideally someone else would also confirm that.

packages/kbn-config-schema/src/types/type.ts

pgayvallet · 2023-05-03T09:01:19Z

packages/kbn-config-schema/src/types/conditional_type.test.ts

+      const result = type
+        .extendsDeep({ unknowns: 'allow' })


(unrelated to this line) I wonder if that extendsDeep will be sufficient covering our 'eviction schema' needs. Like, isn't there going to be scenarios were teams would need a more finely grained configuration, e.g ignore for some props, allow for some others and so on?

I guess in that case, they can still just manually redefine the schema (if such scenario even make sense)

I can only explain from how I plan to use this but feel free to propose changes. In the example of Task Manager, we plan to evict unknown properties when reading from the task.state object while preventing tasks from running if task.params contains unknown properties (ex: properties created by a newer Kibana version).

We would build two different task schema objects to satisfy the needs:

schema for read would ignore unknowns for task.state while forbid unknowns for task.params

schema for write would forbid unknowns for task.state and task.params

Since we have control of the params schema by task type and the state schema by task type, we can intercept those and only apply .extendsDeep(...) as needed before building the task read/write schema for a given task type.

…-ref HEAD~1..HEAD --fix'

…ibana into config-schema/poc-extendsDeep

mikecote · 2023-05-04T13:03:37Z

@elasticmachine merge upstream

kibana-ci · 2023-05-04T14:07:58Z

💚 Build Succeeded

Buildkite Build
Commit: dedf230

Metrics [docs]

Public APIs missing comments

Total count of every public API that lacks a comment. Target amount is 0. Run node scripts/build_api_docs --plugin [yourplugin] --stats comments for more detailed information.

id	before	after	diff
`@kbn/config-schema`	127	133	+6

Async chunks

Total size of all lazy-loaded chunks that will be downloaded as the user navigates the app

id	before	after	diff
`aiops`	792.4KB	794.1KB	+1.6KB

Public APIs missing exports

Total count of every type that is part of your API that should be exported but is not. This will cause broken links in the API documentation system. Target amount is 0. Run node scripts/build_api_docs --plugin [yourplugin] --stats exports for more detailed information.

id	before	after	diff
`@kbn/config-schema`	17	18	+1

Unknown metric groups

API count

id	before	after	diff
`@kbn/config-schema`	129	135	+6

ESLint disabled line counts

id	before	after	diff
`enterpriseSearch`	19	21	+2
`securitySolution`	398	401	+3
total			+5

Total ESLint disabled count

id	before	after	diff
`enterpriseSearch`	20	22	+2
`securitySolution`	478	481	+3
total			+5

History

💔 Build #125153 failed 132cfed
💛 Build #125115 was flaky 532f5ee
💛 Build #124902 was flaky 8d5abe3
💛 Build #124542 was flaky ac50762
💔 Build #124375 failed f3cd265

To update your PR or re-run it, just comment with:
@elasticmachine merge upstream

cc @mikecote

pgayvallet

LGTM

…#156214) Related issue elastic#155764. In this POC, I'm adding an `extendsDeep` function to the schema object. This feature allows you to create a copy of an existing schema definition and recursively modify options without mutating them. With `extendsDeep`, you can specify whether unknown attributes on objects should be allowed, forbidden or ignored. This new function is particularly useful for alerting scenarios where we need to drop unknown fields when reading from Elasticsearch without modifying the schema object. Since we don't control the schema definition in some areas, `extendsDeep` provides a convenient way to set the `unknowns` option to all objects recursively. By doing so, we can validate and drop unknown properties using the same defined schema, just with `unknowns: forbid` extension. Usage: ``` // Single, shared type definition const type = schema.object({ foo: schema.string() }); // Drop unknown fields (bar in this case) const savedObject = { foo: 'test', bar: 'test' }; const ignoreSchema = type.extendsDeep({ unknowns: 'ignore' }); ignoreSchema.validate(savedObject); // Prevent unknown fields (bar in this case) const soToUpdate = { foo: 'test', bar: 'test' }; const forbidSchema = type.extendsDeep({ unknowns: 'forbid' }); forbidSchema.validate(soToUpdate); ``` --------- Co-authored-by: Kibana Machine <[email protected]>

mikecote added 3 commits April 19, 2023 14:59

Initial commit

0302a5d

Cleanup code

262805f

Update POC

f3cd265

mikecote self-assigned this Apr 28, 2023

kibanamachine and others added 4 commits May 1, 2023 12:39

Merge branch 'main' into config-schema/poc-extendsDeep

ac50762

Merge branch 'main' of github.com:elastic/kibana into config-schema/p…

0be20c9

…oc-extendsDeep

Add tests

db1c624

Merge branch 'config-schema/poc-extendsDeep' of github.com:mikecote/k…

8d5abe3

…ibana into config-schema/poc-extendsDeep

pgayvallet reviewed May 3, 2023

View reviewed changes

mikecote and others added 4 commits May 3, 2023 07:41

Set updated schema into variables

fc31e32

[CI] Auto-commit changed files from 'node scripts/precommit_hook.js -…

532f5ee

…-ref HEAD~1..HEAD --fix'

Share OptionsForUnknowns

0f1357b

Merge branch 'config-schema/poc-extendsDeep' of github.com:mikecote/k…

132cfed

…ibana into config-schema/poc-extendsDeep

Merge branch 'main' into config-schema/poc-extendsDeep

dedf230

mikecote changed the title ~~[POC] Making config-schema extensible for handling of unknown fields~~ Make config-schema extensible for handling of unknown fields May 4, 2023

mikecote marked this pull request as ready for review May 4, 2023 17:42

mikecote requested a review from a team as a code owner May 4, 2023 17:42

mikecote added Team:Core Core services & architecture: plugins, logging, config, saved objects, http, ES client, i18n, etc release_note:skip Skip the PR/issue when compiling release notes v8.9.0 labels May 4, 2023

pgayvallet approved these changes May 5, 2023

View reviewed changes

mikecote merged commit 1cab306 into elastic:main May 5, 2023

kibanamachine added the backport:skip This commit does not require backporting label May 5, 2023

pgayvallet mentioned this pull request Sep 19, 2023

Warn instead of failing to start when there is an unknown value in kibana.yml #166481

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Make config-schema extensible for handling of unknown fields #156214

Make config-schema extensible for handling of unknown fields #156214

mikecote commented Apr 28, 2023 •

edited

Loading

mikecote commented May 1, 2023

mikecote commented May 2, 2023

pgayvallet left a comment

pgayvallet May 3, 2023

pgayvallet May 3, 2023

pgayvallet May 3, 2023

mikecote May 3, 2023 •

edited

Loading

mikecote commented May 4, 2023

kibana-ci commented May 4, 2023

API count

ESLint disabled line counts

Total ESLint disabled count

pgayvallet left a comment

Make config-schema extensible for handling of unknown fields #156214

Make config-schema extensible for handling of unknown fields #156214

Conversation

mikecote commented Apr 28, 2023 • edited Loading

mikecote commented May 1, 2023

mikecote commented May 2, 2023

pgayvallet left a comment

Choose a reason for hiding this comment

pgayvallet May 3, 2023

Choose a reason for hiding this comment

pgayvallet May 3, 2023

Choose a reason for hiding this comment

pgayvallet May 3, 2023

Choose a reason for hiding this comment

mikecote May 3, 2023 • edited Loading

Choose a reason for hiding this comment

mikecote commented May 4, 2023

kibana-ci commented May 4, 2023

💚 Build Succeeded

Metrics [docs]

Public APIs missing comments

Async chunks

Public APIs missing exports

API count

ESLint disabled line counts

Total ESLint disabled count

History

pgayvallet left a comment

Choose a reason for hiding this comment

mikecote commented Apr 28, 2023 •

edited

Loading

mikecote May 3, 2023 •

edited

Loading