-
Notifications
You must be signed in to change notification settings - Fork 1k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
KSQL should not register a schema when it's only consuming from a topic #5553
Comments
I agree we shouldn't be registering a schema here. A related issue, (linked above), is that we shouldn't be blindly overwriting otherwise compatible schemas either - but I think that's left to another discussion. Our whole handling of schemas needs a rethink. However, the challenge here is that users could produce into this topic using @agavra has the most context / knowledge here... Just another example of why |
There's also the I think there are lots of places where reliance on schema registry might not make total sense for us - but the problem is that our serdes require schema registry. Perhaps we could mock a schema registry client that actually just looks things up in our metastore instead of looking into schema registry all the time (and only falling back to schema registry for topics that aren't in our metastore - like changelog topics, which fwiw we can also change to avoid using schema registry by having our own internal topic formats). I'm not totally sure I know how I would solve this problem within the scope of 0.11 though... that's up for debate |
yeah, the so that too shouldn't be an issue longer term... Maybe we should just prioritise those KLIPs :D |
I don't disagree, but I'm not sure that's on-topic. |
Looks like this is actually a regression from the previous release caused by #4717 - I will change this to make sure that |
This issue will be fixed in the next release. Thanks for pointing it out @CHR-LeeOlsen |
Describe the bug
When creating a stream over an existing topic with value_format=avro, ksql will register a dynamically generated schema. When using producers that don't generate the schema, this can cause it to attempt to use a schema that is incompatible with our data model.
To Reproduce
Steps to reproduce the behavior, include:
Expected behavior
There should be no reason for ksql to register a schema here since it is only a consumer of the topic and would be using whatever schema was used to serialize the data.
Actual behaviour
It generates the following schema
Additional context
The auto generated schema is never used in any way because we produce to this topic ourselves. KSQL is only consuming from it. But this can break a producer under some use cases where the producer is not in control of the schema but rather using the latest version from the schema registry. There is really no value to KSQL generating this schema at all. It should only generate a schema for a topic it is writing to.
The text was updated successfully, but these errors were encountered: