Routing logic to take both NamespaceID and workflowID into account #629

samarabbas · 2020-07-29T18:40:11Z

What changed?
Update the routing logic to use both NamespaceID and workflowID to
compute the hash used for routing logic.

Why?
Cadence only uses workflowID as the key to compute the hash which is used for routing requests to
shard hosting the workflow execution. This is a problem as multiple namespaces could use the same
workflowID and this logic could result in hot partition problems.
Updated the routing logic to use both namespaceID and workflowID to compute the hash key.

How did you test it?
unit and integration tests

Potential risks
This is a non backwards compatible change so this is why we are making it before our first production release.

Update the routing logic to use both NamespaceID and workflowID to compute the hash used for routing logic.

samarabbas · 2020-07-29T18:44:44Z

service/worker/indexer/esProcessor.go

@@ -237,7 +237,10 @@ func (p *esProcessorImpl) hashFn(key interface{}) uint32 {
 		return 0
 	}
 	numOfShards := p.config.IndexerConcurrency()
-	return uint32(common.WorkflowIDToHistoryShard(id, numOfShards))
+


There was no need to use WorkflowIDToHistoryShard as the hashing function. All this logic needed is a hashing function which takes in a string and computes range hashKey which is used to route the call to worker go routine processing ES messages.

alexshtin · 2020-07-30T20:10:53Z

proto/internal/temporal/server/api/historyservice/v1/request_response.proto

@@ -403,7 +403,8 @@ message DescribeHistoryHostRequest {
    //ip:port
    string host_address = 1;
    int32 shard_id_for_host = 2;
-    temporal.api.common.v1.WorkflowExecution execution_for_host = 3;
+    string namespace_id = 3;


Why it is namespace above and namespace_id here?

I guess for consistency.

They represent 2 different identifiers for namespace. namespace is used whenever API takes namespace name and namespace_id is used whenever API takes internal uuid which represents the namespace. Internally within the system we use namespace_id as the identifier.

alexshtin · 2020-07-30T20:13:06Z

common/util.go

-	hash := farm.Fingerprint32([]byte(workflowID))
+// WorkflowIDToHistoryShard is used to map namespaceID-workflowID pair to a shardID
+func WorkflowIDToHistoryShard(namespaceID, workflowID string, numberOfShards int) int {
+	idBytes := []byte(namespaceID + "_" + workflowID)


What does separator provides? We are hashing it right away.

Routing logic to take both NamespaceID and workflowID into account

d4735d6

Update the routing logic to use both NamespaceID and workflowID to compute the hash used for routing logic.

samarabbas requested a review from alexshtin July 29, 2020 18:40

samarabbas commented Jul 29, 2020

View reviewed changes

samarabbas added 4 commits July 29, 2020 22:38

fix build break

2974bde

Merge branch 'master' into workflow-request-routing

face1dc

Merge branch 'master' into workflow-request-routing

ddf6818

Merge branch 'master' into workflow-request-routing

2d7bb57

alexshtin reviewed Jul 30, 2020

View reviewed changes

alexshtin approved these changes Jul 30, 2020

View reviewed changes

samarabbas added 5 commits July 30, 2020 15:19

Merge branch 'master' into workflow-request-routing

09b97ea

Merge branch 'master' into workflow-request-routing

c053377

resolve merge conflict on generated proto

0559d42

resolve merge conflict with shardId starts from 1 change

7de6a8a

Merge branch 'master' into workflow-request-routing

c55ecdf

samarabbas merged commit e299cdd into temporalio:master Jul 31, 2020

samarabbas deleted the workflow-request-routing branch July 31, 2020 01:22

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Routing logic to take both NamespaceID and workflowID into account #629

Routing logic to take both NamespaceID and workflowID into account #629

samarabbas commented Jul 29, 2020

samarabbas Jul 29, 2020

alexshtin Jul 30, 2020

alexshtin Jul 30, 2020

samarabbas Jul 31, 2020

alexshtin Jul 30, 2020

Routing logic to take both NamespaceID and workflowID into account #629

Routing logic to take both NamespaceID and workflowID into account #629

Conversation

samarabbas commented Jul 29, 2020

samarabbas Jul 29, 2020

Choose a reason for hiding this comment

alexshtin Jul 30, 2020

Choose a reason for hiding this comment

alexshtin Jul 30, 2020

Choose a reason for hiding this comment

samarabbas Jul 31, 2020

Choose a reason for hiding this comment

alexshtin Jul 30, 2020

Choose a reason for hiding this comment