Extract and optimize group member calculation #1937

tnqn · 2021-03-03T17:13:29Z

Previously grouping member calculation was coupled with the NetworkPolicyController, and there were two kinds of heavy calculation in the process:
For each event like Pod add, it needed to find all affected groups by matching the new Pod's labels with all groups that can potentially select it. If it was an update event, it even needed to perform the process one more time for its old labels. Then it triggered the sync process of the affected groups.
For the sync process of a group, it needed to find all Pods or ExternalEntities by matching its selectors with all entities that can potentially match it.
For each pair of Pod and Group, they would be matched at least twice. Besides, whenever the group was processed another time, it would again scan all Pods.

The repeated calculation can be eliminated by caching the result of the first matching process when processing Pod events. Then the matching process will only occur when there's a new Pod or a new Group.
Besides, most Pods actually share same labels with others when they are managed by same controller. The cache can be further optimized to maintain the relationship between label set and label selector, making Pods that having same labels and Groups that having same selectors share the cache.

This patch introduces an interface which is responsible for group member calculation and an implementation implementing it with the above optimization. The interface is then consumed by NetworkPolicyController and features like EndpointQuerier and ClusterGroup to retrieve Pods/ExternalEntities selected by a given Group or Groups that select a given Pod. Besides, other controllers that have the group logic like EgressPolicy can consume it directly without having redundant code with NetworkPolicyController.

Performance impact:

name                                       old time/op    new time/op    delta
InitXLargeScaleWithSmallNamespaces-48         3.65s ±14%     4.35s ± 7%  +19.20%  (p=0.016 n=5+5)
InitXLargeScaleWithLargeNamespaces-48         5.34s ± 2%     1.48s ± 5%  -72.34%  (p=0.008 n=5+5)
InitXLargeScaleWithOneNamespace-48            3.22s ± 8%     3.46s ± 2%     ~     (p=0.056 n=5+5)
InitXLargeScaleWithNetpolPerPod-48            45.9s ±11%     25.6s ± 4%  -44.18%  (p=0.008 n=5+5)
InitXLargeScaleWithClusterScopedNetpol-48     1.40s ± 3%     1.18s ± 5%  -16.08%  (p=0.008 n=5+5)

name                                       old alloc/op   new alloc/op   delta
InitXLargeScaleWithSmallNamespaces-48         824MB ± 0%     900MB ± 0%   +9.22%  (p=0.008 n=5+5)
InitXLargeScaleWithLargeNamespaces-48         629MB ± 0%     321MB ± 0%  -49.00%  (p=0.008 n=5+5)
InitXLargeScaleWithOneNamespace-48            921MB ± 0%     922MB ± 0%   +0.13%  (p=0.008 n=5+5)
InitXLargeScaleWithNetpolPerPod-48           3.39GB ± 0%    0.13GB ± 0%  -96.19%  (p=0.008 n=5+5)
InitXLargeScaleWithClusterScopedNetpol-48     279MB ± 0%     244MB ± 0%  -12.59%  (p=0.008 n=5+5)

name                                       old allocs/op  new allocs/op  delta
InitXLargeScaleWithSmallNamespaces-48         12.0M ± 0%     12.6M ± 0%   +5.35%  (p=0.008 n=5+5)
InitXLargeScaleWithLargeNamespaces-48         4.29M ± 0%     4.48M ± 0%   +4.63%  (p=0.008 n=5+5)
InitXLargeScaleWithOneNamespace-48            1.39M ± 0%     1.40M ± 0%   +0.74%  (p=0.008 n=5+5)
InitXLargeScaleWithNetpolPerPod-48            1.66M ± 0%     1.75M ± 0%   +5.52%  (p=0.008 n=5+5)
InitXLargeScaleWithClusterScopedNetpol-48     3.07M ± 0%     3.20M ± 0%   +4.04%  (p=0.008 n=5+5)

Explanation:

InitXLargeScaleWithSmallNamespaces has only 4 Pods in each Namespace and all NetworkPolicies are namespace scoped, so even without the index, each policy just needs to scan 4 Pods at most. Maintaing the index adds few overhead, hence the minor increment in time and memory.
InitXLargeScaleWithLargeNamespaces has 1000 Pods and 100 NetworkPolicies in each Namespace, each NetworkPolicy applies to 10 Pods. Pods that have same labels will only be scaned once so many calculation is saved. When syncing groups, the result can be got directly without listing all Pods in the Namespace first, so calculation and memory is saved.
InitXLargeScaleWithOneNamespace just has many Pods but only 1 namespace, 1 AppliedToGroup and 1 AddressGroup in total, so having the index doesn't help anything.
InitXLargeScaleNetpolPerPod has only 1 namespace, 10000 Pods and 1 NetworkPolicy per Pod. It can benefit from the index greatly for the same reason as InitXLargeScaleWithLargeNamespaces.

tnqn · 2021-03-03T17:53:26Z

@jianjuns @antoninbas @Dyanngg Please let me know if the change makes sense to you. And whether it can help nested group feature or make it harder? @Dyanngg
I will add performance impact later.

codecov-io · 2021-03-03T18:01:19Z

Codecov Report

Merging #1937 (0b7d0d4) into main (e80ab3b) will increase coverage by 2.34%.
The diff coverage is 83.56%.

@@            Coverage Diff             @@
##             main    #1937      +/-   ##
==========================================
+ Coverage   64.35%   66.69%   +2.34%     
==========================================
  Files         193      197       +4     
  Lines       16967    17174     +207     
==========================================
+ Hits        10919    11455     +536     
+ Misses       4899     4551     -348     
- Partials     1149     1168      +19

Flag	Coverage Δ
e2e-tests	`26.55% <37.11%> (?)`
kind-e2e-tests	`56.39% <67.34%> (+1.18%)`	⬆️
unit-tests	`42.17% <82.86%> (+0.52%)`	⬆️

Flags with carried forward coverage won't be shown. Click here to find out more.

Impacted Files	Coverage Δ
pkg/controller/networkpolicy/store/group.go	`80.00% <ø> (+8.00%)`	⬆️
pkg/controller/types/group.go	`0.00% <0.00%> (ø)`
pkg/controller/grouping/controller.go	`64.64% <64.64%> (ø)`
pkg/controller/networkpolicy/clustergroup.go	`87.79% <87.50%> (+0.22%)`	⬆️
pkg/controller/grouping/group_entity_index.go	`94.58% <94.58%> (ø)`
pkg/controller/networkpolicy/crd_utils.go	`88.60% <100.00%> (-1.61%)`	⬇️
pkg/controller/networkpolicy/endpoint_querier.go	`91.42% <100.00%> (+4.94%)`	⬆️
...ntroller/networkpolicy/networkpolicy_controller.go	`84.61% <100.00%> (+0.49%)`	⬆️
pkg/apis/controlplane/sets.go	`33.33% <0.00%> (-12.83%)`	⬇️
pkg/apiserver/certificate/cacert_controller.go	`55.83% <0.00%> (-10.01%)`	⬇️
... and 33 more

Dyanngg · 2021-03-03T22:12:43Z

@jianjuns @antoninbas @Dyanngg Please let me know if the change makes sense to you. And whether it can help nested group feature or make it harder? @Dyanngg
I will add performance impact later.

I'm still reviewing the GroupEntityIndex interface, but after the first look I don't think it will make nested group computation easier or more complicated. It's just that the nested group member calc logic would be entirely re-written, since right now I rely on the groupMember field of the internalGroup to union nested groupMembers, which is removed by this PR.

antoninbas

I am fine with the approach. What's the memory usage impact of the change when running the benchmarks in networkpolicy_controller_perf_test.go?

pkg/controller/grouping/controller.go

pkg/controller/networkpolicy/clustergroup.go

pkg/controller/grouping/group_entity_index.go

pkg/controller/grouping/group_entity_index_test.go

pkg/controller/networkpolicy/networkpolicy_controller_perf_test.go

tnqn · 2021-03-04T01:59:59Z

@jianjuns @antoninbas @Dyanngg Please let me know if the change makes sense to you. And whether it can help nested group feature or make it harder? @Dyanngg
I will add performance impact later.

I'm still reviewing the GroupEntityIndex interface, but after the first look I don't think it will make nested group computation easier or more complicated. It's just that the nested group member calc logic would be entirely re-written, since right now I rely on the groupMember field of the internalGroup to union nested groupMembers, which is removed by this PR.

I removed the field as it was consumed by GetGroupMembers, populateAddressGroupMemberSet and getAppliedToWorkloads when they need to know the members of a clustergroup but with the change it can be achieved by querying the GroupEntityIndex directly, so the internalGroup itself doesn't have to store the redundant data. I can imagine that in nested group case, you want to avoid calculate the union of the members again when querying its member, right? If I understand correctly, I can add this field back or let's see whether there is more efficient way.

tnqn

@Dyanngg and @antoninbas thanks for your quick review and feedback.

pkg/controller/grouping/controller.go

pkg/controller/grouping/group_entity_index_test.go

pkg/controller/networkpolicy/clustergroup.go

pkg/controller/networkpolicy/networkpolicy_controller_perf_test.go

jianjuns

I also wonder the memory usage.

And do you think any in-memory store implementation can help (compared to defining all maps case by case)?

pkg/controller/grouping/group_entity_index.go

jianjuns · 2021-03-05T01:19:53Z

pkg/controller/grouping/group_entity_index.go

+	labelItems map[string]*labelItem
+	// labelIndex is nested map from entityType to Namespace to keys of labelItems.
+	// It's used to filter potential labelItems when matching a namespace scoped selectorItem.
+	labelIndex map[entityType]map[string]sets.String


Is it more efficient to create two separate maps for Pods and ExternalEntities?

Or to share code, we can convert them to one internal type?

The code of matching Pods and ExtenalEntities is already shared and they are treated in same way when stored in labelItems.
Separating them in labelIndex is to reduce the potential labelItems a selector needs to match. This is because a selector can either select ExternalEntity or Pod. If it's a Pod selector, this is no need to try ExternalEntity labelItems. The same reason applies to selectorIndex.

tnqn · 2021-03-10T17:37:29Z

@jianjuns @antoninbas I updated benchmark test result in PR description. To get accurate cpu and memory metrics, I used benchmark test and execute the processing in serial manner.

antoninbas

the changes look good to me, and so do the benchmark results

antoninbas · 2021-03-10T20:47:06Z

pkg/controller/grouping/group_entity_index.go

+	// GetEntities returns the selected Pods or ExternalEntities for the given group.
+	GetEntities(groupType string, name string) ([]*v1.Pod, []*v1alpha2.ExternalEntity)
+	// GetGroupsForPod returns the groups that select the given Pod.
+	GetGroupsForPod(namespace, name string) (map[string][]string, bool)


do you think we could define a new type for the group type: type GroupType string? It would help clarify what the returned values are if we have map[GroupType][]string instead of map[string][]string.

Maybe an enum?

I defined GroupType and listed all values here at the begining, then found it makes consumers must add their own type here. Maybe I should define GroupType (using string) here and let consumers define their own values in their own packages, does it make sense to you?

I feel enum (int) also works, but I guess your concern is harder to track the int values when they are not in the same package?

Maybe I should define GroupType (using string) here and let consumers define their own values in their own packages, does it make sense to you?

That works for me as a middle ground between using a plain string and defining named constants for all group types in this package.

@jianjuns Yes, if using enum(int), we should define them together in this package to avoid conflict.

I think it is just like string is easier to remember and has less chance to conflict (but can still). I do not like long string lookup, but I understand your point and no strong opinion.

pkg/controller/grouping/group_entity_index.go

pkg/controller/networkpolicy/networkpolicy_controller_perf_test.go

Dyanngg

LGTM overall

pkg/controller/grouping/group_entity_index.go

jianjuns · 2021-03-11T03:54:50Z

pkg/controller/networkpolicy/networkpolicy_controller.go

@@ -81,6 +82,10 @@ const (
 	PriorityIndex = "priority"
 	// ClusterGroupIndex is used to index ClusterNetworkPolicies by ClusterGroup names.
 	ClusterGroupIndex = "clustergroup"
+
+	appliedToGroupType = "appliedToGroup"


Then do you like to change this one to enum too?

jianjuns · 2021-03-11T04:04:41Z

pkg/controller/grouping/group_entity_index.go

+	// GetEntities returns the selected Pods or ExternalEntities for the given group.
+	GetEntities(groupType string, name string) ([]*v1.Pod, []*v1alpha2.ExternalEntity)
+	// GetGroupsForPod returns the groups that select the given Pod.
+	GetGroupsForPod(namespace, name string) (map[string][]string, bool)


Maybe an enum?

pkg/controller/grouping/group_entity_index.go

tnqn · 2021-03-11T17:37:19Z

/test-all

tnqn · 2021-03-16T15:12:23Z

/test-all

tnqn · 2021-03-16T15:53:35Z

/test-all

tnqn · 2021-03-16T16:32:41Z

@jianjuns @antoninbas @Dyanngg I added more unit tests and a method HasSynced to avoid intermediate result after restarting since your last review and have addressed all comments if I'm not missing one. Could you take another look?

Dyanngg

LGTM

antoninbas

Thanks for all the work on this. I took another quick look and it looks good to me. I found a couple nits.

pkg/controller/grouping/controller_test.go

pkg/controller/grouping/group_entity_index.go

pkg/controller/networkpolicy/networkpolicy_controller_perf_test.go

tnqn · 2021-03-17T17:41:22Z

/test-all

antoninbas

LGTM

Dyanngg · 2021-03-17T22:41:01Z

/test-e2e

jianjuns

Just a few nits.

pkg/controller/grouping/group_entity_index.go

tnqn · 2021-03-18T03:04:42Z

/test-all

pkg/controller/grouping/group_entity_index.go

Previously grouping member calculation was coupled with the NetworkPolicyController, and there were two kinds of heavy calculation in the process: For each event like Pod add, it needed to find all affected groups by matching the new Pod's labels with all groups that can potentially select it. If it was an update event, it even needed to perform the process one more time for its old labels. Then it triggered the sync process of the affected groups. For the sync process of a group, it needed to find all Pods or ExternalEntities by matching its selectors with all entities that can potentially match it. For each pair of Pod and Group, they would be matched at least twice. Besides, whenever the group was processed another time, it would again scan all Pods. The repeated calculation can be eliminated by caching the result of the first matching process when processing Pod events. Then the matching process will only occur when there's a new Pod or a new Group. Besides, most Pods actually share same labels with others when they are managed by same controller. The cache can be further optimized to maintain the relationship between label set and label selector, making Pods that having same labels and Groups that having same selectors share the cache. This patch introduces an interface which is responsible for group member calculation and an implementation implementing it with the above optimization. The interface is then consumed by NetworkPolicyController and features like EndpointQuerier and ClusterGroup to retrieve Pods/ExternalEntities selected by a given Group or Groups that select a given Pod. Besides, other controllers that have the group logic like EgressPolicy can consume it directly without having redundant code with NetworkPolicyController.

tnqn · 2021-03-18T04:30:25Z

/test-conformance
/test-all-features-conformance
/test-windows-conformance
/test-windows-networkpolicy

tnqn · 2021-03-18T05:16:58Z

/test-conformance
/test-e2e

tnqn · 2021-03-18T05:54:04Z

/test-networkpolicy
/test-e2e
/test-conformance

vmwclabot added the cla-not-required label Mar 3, 2021

tnqn force-pushed the grouping branch from 979b9db to 1584071 Compare March 3, 2021 17:48

tnqn marked this pull request as draft March 3, 2021 17:49

antoninbas reviewed Mar 3, 2021

View reviewed changes

tnqn commented Mar 4, 2021

View reviewed changes

jianjuns reviewed Mar 5, 2021

View reviewed changes

tnqn force-pushed the grouping branch 2 times, most recently from fcf9851 to a9126a2 Compare March 10, 2021 17:34

antoninbas reviewed Mar 10, 2021

View reviewed changes

Dyanngg reviewed Mar 10, 2021

View reviewed changes

pkg/controller/grouping/group_entity_index.go Outdated Show resolved Hide resolved

jianjuns reviewed Mar 11, 2021

View reviewed changes

tnqn mentioned this pull request Mar 11, 2021

Add nested ClusterGroup support #1920

Merged

tnqn force-pushed the grouping branch from 81440d2 to d65c0a2 Compare March 11, 2021 17:28

tnqn changed the title ~~[WIP] Extract grouping logic to a generic module~~ Extract grouping logic to a generic module Mar 11, 2021

tnqn marked this pull request as ready for review March 11, 2021 17:37

tnqn force-pushed the grouping branch 4 times, most recently from def159c to 17f15eb Compare March 16, 2021 15:08

tnqn changed the title ~~Extract grouping logic to a generic module~~ Extract and optimize group member calculation Mar 16, 2021

tnqn force-pushed the grouping branch from 17f15eb to 0fb2af1 Compare March 16, 2021 15:23

Dyanngg previously approved these changes Mar 17, 2021

View reviewed changes

antoninbas previously approved these changes Mar 17, 2021

View reviewed changes

tnqn dismissed stale reviews from antoninbas and Dyanngg via 6cd31c6 March 17, 2021 17:38

tnqn force-pushed the grouping branch from 0fb2af1 to 6cd31c6 Compare March 17, 2021 17:38

antoninbas previously approved these changes Mar 17, 2021

View reviewed changes

Dyanngg previously approved these changes Mar 17, 2021

View reviewed changes

jianjuns previously approved these changes Mar 17, 2021

View reviewed changes

pkg/controller/grouping/group_entity_index.go Show resolved Hide resolved

pkg/controller/grouping/group_entity_index.go Outdated Show resolved Hide resolved

pkg/controller/grouping/group_entity_index.go Outdated Show resolved Hide resolved

tnqn dismissed stale reviews from jianjuns, Dyanngg, and antoninbas via 1ea74f5 March 18, 2021 03:01

tnqn force-pushed the grouping branch from 6cd31c6 to 1ea74f5 Compare March 18, 2021 03:01

jianjuns reviewed Mar 18, 2021

View reviewed changes

pkg/controller/grouping/group_entity_index.go Outdated Show resolved Hide resolved

tnqn force-pushed the grouping branch from 1ea74f5 to 0b7d0d4 Compare March 18, 2021 03:17

jianjuns approved these changes Mar 18, 2021

View reviewed changes

tnqn merged commit 46a2fc5 into antrea-io:main Mar 18, 2021

tnqn mentioned this pull request Mar 19, 2021

[WIP] Skip processing ADD events of init Pods and Namespaces #636

Closed

tnqn deleted the grouping branch April 9, 2021 08:35

tnqn mentioned this pull request Jul 12, 2021

Network Policies are not working in a large cluster after restarting antrea-controller #2378

Closed

Extract and optimize group member calculation #1937

Extract and optimize group member calculation #1937

Conversation

tnqn commented Mar 3, 2021 • edited Loading

tnqn commented Mar 3, 2021

codecov-io commented Mar 3, 2021 • edited Loading

Codecov Report

Dyanngg commented Mar 3, 2021

antoninbas left a comment

Choose a reason for hiding this comment

tnqn commented Mar 4, 2021

tnqn left a comment

Choose a reason for hiding this comment

jianjuns left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

tnqn commented Mar 10, 2021

antoninbas left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

tnqn Mar 12, 2021 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Dyanngg left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

tnqn commented Mar 11, 2021

tnqn commented Mar 16, 2021

tnqn commented Mar 16, 2021

tnqn commented Mar 16, 2021

Dyanngg left a comment

Choose a reason for hiding this comment

antoninbas left a comment

Choose a reason for hiding this comment

tnqn commented Mar 17, 2021

antoninbas left a comment

Choose a reason for hiding this comment

Dyanngg commented Mar 17, 2021

jianjuns left a comment

Choose a reason for hiding this comment

tnqn commented Mar 18, 2021

tnqn commented Mar 18, 2021

tnqn commented Mar 18, 2021

tnqn commented Mar 18, 2021

tnqn commented Mar 3, 2021 •

edited

Loading

codecov-io commented Mar 3, 2021 •

edited

Loading

tnqn Mar 12, 2021 •

edited

Loading