removing resync and updating some logs #292

shawkins · 2021-04-16T12:04:13Z

No description provided.

shawkins · 2021-04-16T12:06:56Z

This is to further assist with issues we've seen. Based upon our deeper understanding of resync, it does not need to run at all - this will help remove a potential source of events. The kafkaCluster log is too high for the information it provides, and the resource events should show their resource version - which will help correlate to the state seen in yaml output of the resources.

ppatierno · 2021-04-16T12:34:56Z

operator/src/main/java/org/bf2/operator/InformerManager.java

@@ -74,33 +74,33 @@ void onStart(@Observes StartupEvent ev) {
        // TODO: should we make the resync time configurable?

        kafkaSharedIndexInformer =
-                sharedInformerFactory.sharedIndexInformerFor(Kafka.class, KafkaList.class, operationContext, 60 * 1000L);
+                sharedInformerFactory.sharedIndexInformerFor(Kafka.class, KafkaList.class, operationContext, 0);


without the resync how does it work in case of lost events?

I need to look more at what the behavior is on 5.0.0 with the 0 setting, but what they have changed the implementation into based upon fabric8io/kubernetes-client#2812 is that resync is purely in memory - it won't contact the api server at all.

Missed events are assumed to not be possible as long as a watch is active. Once a watch is closed a reconnect is performed. Once connected again the logic performs the necessary catch up - that is where we saw the delete bug, because the catch up logic did not update the index in that case.

I don't know the fabric8 internal but when you say "resync is purely in memory" ... so it's resyncing with what if it doesn't contact the API server. My understanding was resyncing the in-memory cache contacting the API server.

They made a mistake in 5.0.0 compared to the go client logic. fabric8 on the resync interval was performing a relist. They are now removing that based upon the linked issue. So the resync expectation is to simply generated update events for everything that exists in the cache - that's it. Since we are ignoring these events anyway (with the resource version difference checks) we don't need any resync.

:-o ... so this fabric8 resync sounds like more dangerous than useful imho

At this point, yes it is. We thought we were relying on behavior that was a mistake... So it's best to turn it off now.

(cherry picked from commit ea96db7)

removing resync and updating some logs

2734890

shawkins requested a review from ppatierno April 16, 2021 12:04

ppatierno reviewed Apr 16, 2021

View reviewed changes

ppatierno approved these changes Apr 16, 2021

View reviewed changes

k-wall merged commit ea96db7 into bf2fc6cc711aee1a0c2a:main Apr 16, 2021

k-wall pushed a commit that referenced this pull request Apr 16, 2021

removing resync and updating some logs (#292)

a08337d

(cherry picked from commit ea96db7)

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

removing resync and updating some logs #292

removing resync and updating some logs #292

shawkins commented Apr 16, 2021

shawkins commented Apr 16, 2021

ppatierno Apr 16, 2021

shawkins Apr 16, 2021

ppatierno Apr 16, 2021

shawkins Apr 16, 2021

ppatierno Apr 16, 2021

shawkins Apr 16, 2021

removing resync and updating some logs #292

removing resync and updating some logs #292

Conversation

shawkins commented Apr 16, 2021

shawkins commented Apr 16, 2021

ppatierno Apr 16, 2021

Choose a reason for hiding this comment

shawkins Apr 16, 2021

Choose a reason for hiding this comment

ppatierno Apr 16, 2021

Choose a reason for hiding this comment

shawkins Apr 16, 2021

Choose a reason for hiding this comment

ppatierno Apr 16, 2021

Choose a reason for hiding this comment

shawkins Apr 16, 2021

Choose a reason for hiding this comment