Cluster - Moved/Ask Redirection Not Working Properly. #1693

Tbone542 · 2020-01-03T16:44:47Z

Expected behaviour

Running a cluster and using the slot cache. When I migrate keys I would expect the slot cache to update if it receives an ASK or MOVE.

Actual behaviour

Upon requesting the data from a key on a migrated slot, the cache does not seem to update and continually requests keys from the wrong node. If I restart apache the slot cache is refreshed and works fine.

Update: It appears ASK and MOVE redirects are not being handled properly.

I'm seeing this behaviour on

OS: linux
Redis: 5.05
PHP: 7.1
phpredis: 5.0.2 and 5.1.1

Steps to reproduce, backtrace or example script

I've checked

[ x ] There is no similar issue from other users
[ x ] Issue isn't fixed in develop branch

The text was updated successfully, but these errors were encountered:

michael-grunder · 2020-01-03T17:34:11Z

Thanks for the report. I do have logic to invalidate the cache in the event we receive any MOVE or ASK response, but perhaps it's broken.

Hopefully I can replicate the problem.

Tbone542 · 2020-01-03T18:02:14Z

Thank you Michael. I tried to replicate locally and was unsuccessful. The issue became apparent when I performed a small test migration on production. Maybe it has to do with the number of connected clients? It was difficult to emulate locally having hundreds of simultaneous clients.

Tbone542 · 2020-01-06T19:57:44Z

Turns out the boxes with the issue had 5.0.2 not 5.1.1. Could this be part of the problem?

I didn't see anything in the change log that seemed pertinent.

Thanks again,
Ted

michael-grunder · 2020-01-06T20:25:41Z

There were a couple of changes to the caching code between 5.0.2 and 5.1.1 although I don't recall if they would've been likely to prevent cache invalidation.

This is good to know though, as I'll make sure to check if invalidation is broken in 5.0.2 and not in 5.1.1. Might get lucky and make investigation simpler! 😄

Tbone542 · 2020-01-07T17:51:25Z

Thanks again. After another try with migrating a few slots, it appeared as if the issue arose on only some of the httpd processes and not all. Does that make any sense? i.e. Most requests had no issues finding the right slots, however a few would have problems contacting the cluster.

michael-grunder · 2020-01-07T20:04:08Z

Sure that makes sense. I assume it's a cache invalidation problem that only happens sometimes, which honestly means it might be trickier to track down.

It's been a while since I've used apache. Are you using prefork or workers (in case I need to use a VM to replicate the problem).

Also a uname -a might end up being helpful.

Tbone542 · 2020-01-07T21:04:29Z

It is on an Amazon Linux distribution, using prefork.

Is the cluster cache shared among children?

When the call fails, it is returning a (boolean) false.
i.e.
return $this->redis->hgetall('somekey');

Thanks again for your help.

michael-grunder · 2020-01-08T20:07:34Z

Is the cluster cache shared among children?

No I think each fork will have its own persistent_list hash table where we store cached slot information and any persistent connections.

Can you provide the output from phpinfo() so I can set up the VM with the same settings?

Tbone542 · 2020-01-08T20:43:58Z

Which sections do you need?

PHP Version => 7.0.33

redis

Redis Support => enabled
Redis Version => 5.0.2
Available serializers => php, json

Directive => Local Value => Master Value
redis.arrays.algorithm => no value => no value
redis.arrays.auth => no value => no value
redis.arrays.autorehash => 0 => 0
redis.arrays.connecttimeout => 0 => 0
redis.arrays.consistent => 0 => 0
redis.arrays.distributor => no value => no value
redis.arrays.functions => no value => no value
redis.arrays.hosts => no value => no value
redis.arrays.index => 0 => 0
redis.arrays.lazyconnect => 0 => 0
redis.arrays.names => no value => no value
redis.arrays.pconnect => 0 => 0
redis.arrays.previous => no value => no value
redis.arrays.readtimeout => 0 => 0
redis.arrays.retryinterval => 0 => 0
redis.clusters.auth => no value => no value
redis.clusters.cache_slots => 1 => 1
redis.clusters.persistent => 0 => 0
redis.clusters.read_timeout => 0 => 0
redis.clusters.seeds => no value => no value
redis.clusters.timeout => 0 => 0
redis.pconnect.connection_limit => 0 => 0
redis.pconnect.pooling_enabled => 1 => 1
redis.session.lock_expire => 0 => 0
redis.session.lock_retries => 10 => 10
redis.session.lock_wait_time => 2000 => 2000
redis.session.locking_enabled => 0 => 0

michael-grunder · 2020-01-08T21:09:57Z

That should work. Last question, are you using persistent connections with RedisCluster?

// e.g. passing `true` to the persistent argument
$obj_cluster = new RedisCluster(NULL, ["host:7000", "host:7001"], 1.5, 1.5, true);

Tbone542 · 2020-01-08T22:26:37Z

Yes, persistent connections are being used.

Tbone542 · 2020-01-09T20:20:28Z

So, I did a bit of testing and was able to produce a similar result locally.

I did it using jMeter and a local VM. It was tried with both 5.0.2 and 5.1.1 and each yielded similar results, with and without slot caching enabled. Most of my trials involved 50 -100 http threads and most often 100% of the threads would return a (boolean)false during the slot migration of a particular key.

What I found is in almost every case, that the time during which a slot is being migrated, I am receiving a (boolean)false, when trying to retrieve a key from said slot.

Below is a sample. It shows the Redis monitor output from the original node, then the output from my clients during the migration and then the Redis monitor output from the receiving node, once it has migrated completely. Noticed the microtime of the failures seem to align with the timestamps in the Redis monitor output, during migration.

All of the failures seem to occur after the receiving node gets the RESTORE-ASKING command.

After reviewing the code, it appears there are facilities to deal with ASK redirection, however I am not seeing it work in practice.

Hopefully this information is of some use.

[Node 7001] 1578593676.648111 [0 192.168.56.101:57862] "HGETALL" "mykey"
[Node 7001] 1578593676.648115 [0 192.168.56.101:58466] "HGETALL" "mykey"
[Node 7001] 1578593676.648119 [0 192.168.56.101:58012] "HGETALL" "mykey"
[Node 7001] 1578593676.648122 [0 192.168.56.101:57820] "HGETALL" "mykey"
[Node 7001] 1578593676.648126 [0 192.168.56.101:57872] "HGETALL" "mykey"
[Node 7001] 1578593676.648132 [0 192.168.56.101:58422] "HGETALL" "mykey"
[Node 7001] 1578593676.648136 [0 192.168.56.101:58054] "HGETALL" "mykey"
[Node 7001] 1578593676.648141 [0 192.168.56.101:58136] "MIGRATE" "192.168.56.101" "7002" "" "0" "60000" "KEYS" "mykey"
[Node 7002] 1578593676.648237 [0 192.168.56.101:51574] "RESTORE-ASKING" "mykey" "0" "data payload"

######## Failures Start

0.67937000 1578593676-FAIL...
bool(false)
0.68299400 1578593676-FAIL...
bool(false)

0.68035300 1578593676-FAIL...
bool(false)
0.68348600 1578593676-FAIL...
bool(false)

0.67933800 1578593676-FAIL...
bool(false)
0.68297400 1578593676-FAIL...
bool(false)

######## Failures End

[Node 7002] 1578593676.684348 [0 192.168.56.101:51186] "HGETALL" "mykey"
[Node 7002] 1578593676.684426 [0 192.168.56.101:51210] "HGETALL" "mykey"
[Node 7002] 1578593676.684475 [0 192.168.56.101:51142] "HGETALL" "mykey"
[Node 7002] 1578593676.684564 [0 192.168.56.101:51186] "HGETALL" "mykey"
[Node 7002] 1578593676.684570 [0 192.168.56.101:51210] "HGETALL" "mykey"
[Node 7002] 1578593676.684574 [0 192.168.56.101:51142] "HGETALL" "mykey"
[Node 7002] 1578593676.684579 [0 192.168.56.101:51070] "HGETALL" "mykey"
[Node 7002] 1578593676.684583 [0 192.168.56.101:51636] "HGETALL" "mykey"
[Node 7002] 1578593676.684589 [0 192.168.56.101:51760] "HGETALL" "mykey"
[Node 7002] 1578593676.684593 [0 192.168.56.101:51042] "HGETALL" "mykey"
[Node 7002] 1578593676.684598 [0 192.168.56.101:51762] "HGETALL" "mykey"

michael-grunder · 2020-01-09T20:52:08Z

This is really useful, thanks!

So maybe the problem is actually just that the ASKING redirection logic is broken. I'm not sure it's been changed since I first wrote it, but a bug could have been hiding in there since this is likely a rare event.

I'm going to set up a test environment to replicate this issue this afternoon after work.

Thanks again!

Tbone542 · 2020-01-09T21:05:27Z

Excellent...

Thanks again,
Ted

Tbone542 · 2020-01-10T20:07:45Z

I have been doing some more digging.

Turns out I am actually receiving (boolean)false back when I receive the MOVED error.

If MOVED is received, should it attempt to get the key from the new node before returning false?

Thanks,
Ted

Tbone542 · 2020-01-12T17:32:48Z

More notes:
I am also getting the same behavior on the GET (have been trying HGETALL previously) command, where it is returning a boolean false.

michael-grunder · 2020-01-13T04:58:04Z

I was able to investigate the issue this evening and can replicate the problem.

It does appear there is an issue with ASK redirection, as I'm seeing errors like ASK 3580 127.0.0.1:7000 when phpredis returns false during a slot migration.

The good news is I can trigger the error with a simple cli script, which with any luck means the problem will be straightforward to solve.

I'll try to get a fix up tomorrow or at the very least provide more information about what's actually happening.

Tbone542 · 2020-01-13T21:44:34Z

That sounds fantastic.

Thanks again.
Ted

michael-grunder · 2020-01-14T05:42:15Z

Quick update. I've tracked down the issue and have a preliminary fix working. That said, I want to spend some more time with it to make sure the fix doesn't have breaking side effects.

Again, mostly untested but here is the change.

diff --git cluster_library.c cluster_library.c
index bce78bc..01e49bd 100644
--- cluster_library.c
+++ cluster_library.c
@@ -1206,8 +1206,7 @@ static int cluster_set_redirection(redisCluster* c, char *msg, int moved)
  *
  * This function will return -1 on a critical error (e.g. parse/communication
  * error, 0 if no redirection was encountered, and 1 if the data was moved. */
-static int cluster_check_response(redisCluster *c, REDIS_REPLY_TYPE *reply_type
-                                 )
+static int cluster_check_response(redisCluster *c, REDIS_REPLY_TYPE *reply_type)
 {
     size_t sz;

@@ -1232,7 +1231,7 @@ static int cluster_check_response(redisCluster *c, REDIS_REPLY_TYPE *reply_type
         }

         // Check for MOVED or ASK redirection
-        if ((moved = IS_MOVED(inbuf)) || IS_ASK(inbuf)) { // Set our redirection information
+        if ((moved = IS_MOVED(inbuf)) || IS_ASK(inbuf)) {
             /* We'll want to invalidate slot cache if we're using one */
             c->redirections++;

@@ -1380,8 +1379,7 @@ static int cluster_sock_write(redisCluster *c, const char *cmd, size_t sz,
     /* If in ASK redirection, get/create the node for that host:port, otherwise
      * just use the command socket. */
     if (c->redir_type == REDIR_ASK) {
-        redis_sock = cluster_get_asking_sock(c);
-        if (cluster_send_asking(redis_sock) < 0) {
+        if (cluster_send_asking(c->cmd_sock) < 0) {
             return -1;
         }
     }
@@ -1627,10 +1625,13 @@ PHP_REDIS_API short cluster_send_command(redisCluster *c, short slot, const char
                return -1;
            }

-           /* Update mapping if the data has MOVED */
            if (c->redir_type == REDIR_MOVED) {
+               /* For MOVED redirection we want to update our cached mapping */
                cluster_update_slot(c);
                c->cmd_sock = SLOT_SOCK(c, slot);
+           } else if (c->redir_type == REDIR_ASK) {
+               /* For ASK redirection we want to redirect but not update slot mapping */
+               c->cmd_sock = cluster_get_asking_sock(c);
            }
         }

diff --git cluster_library.h cluster_library.h
index af69ad4..784087e 100644
--- cluster_library.h
+++ cluster_library.h
@@ -26,7 +26,7 @@
 /* MOVED/ASK comparison macros */
 #define IS_MOVED(p) (p[0]=='M' && p[1]=='O' && p[2]=='V' && p[3]=='E' && \
                      p[4]=='D' && p[5]==' ')
-#define IS_ASK(p)   (p[0]=='A' && p[1]=='S' && p[3]=='K' && p[4]==' ')
+#define IS_ASK(p)   (p[0]=='A' && p[1]=='S' && p[2]=='K' && p[3]==' ')

 /* MOVED/ASK lengths */
 #define MOVED_LEN (sizeof("MOVED ")-1)

I'll do proper testing tomorrow and get a formal branch up.

Tbone542 · 2020-01-14T14:39:23Z

Seems to be working properly and not returning FALSE on MOVED and ASK...

Thank you very much,
Ted

michael-grunder · 2020-01-14T23:24:16Z

Thanks for reporting it!

I'm going to mock up a fake redis cluster script so I generate ASK and MOVED redirections at will to make sure the changes are robust and then get them merged.

Tbone542 · 2020-01-15T21:23:53Z

Awesome. Not to be a pain, but do you have an ETA? :>)

-Ted

michael-grunder · 2020-01-16T19:04:50Z

I'm not sure about a specific ETA but it'll certainly be in the next official release.

Tbone542 · 2020-01-16T19:48:31Z

OK. As it dealt with a bug in a core feature of the cluster, I wasn't sure if it would warrant a more urgent bug release.

I was under the impression its release was imminent. I was hopeful to get a 'blessed' version before performing a needed migration.

Has the patch you pasted been tested formally, per your testing protocols? Trying to decide how to proceed.

Thanks again,
Ted

michael-grunder · 2020-01-16T23:04:24Z

I think it warrants moving to create a new release with more urgency than before the bug was identified but I tend to be cautious about releasing a core modification without a decent amount of testing.

I'll get the change into the develop branch and then we can cut a release candidate shortly thereafter.

Tbone542 · 2020-01-22T14:34:07Z

Understood. Thank you for the update.

Let me know if I can be of any assistance.

Tbone542 · 2020-01-29T14:02:55Z

Any updates on a bugfix release?

Thanks again,
Ted

michael-grunder · 2020-01-29T18:22:28Z

Yes, sorry about that. This will be merged into develop today then we can cut a release candidate.

Tbone542 · 2020-01-29T18:28:57Z

Awesome...

Thank you.

See #1693

michael-grunder · 2020-01-31T06:54:15Z

Sorry about the delay. I've opened a PR with the fix here

I also put together a mock redis cluster mechanism that I can instruct to randomly issue ASK and/or MOVE redirections to better test the logic and it appears to be working well.

Also I think it's probably a decent idea to add perform some tests while actively resharding a cluster.

See #1693

Tbone542 · 2020-02-11T16:19:00Z

Checking in to see if there are any updates on the 5.1.2 Milestone.

Thanks,
Ted

michael-grunder · 2020-02-12T04:46:26Z

Yes, release candidate coming tomorrow. I'm just putting together the changelog.

michael-grunder · 2020-02-16T18:44:43Z

5.2.0RC1 is up (both on GitHub and PECL) so I'm going to close this issue.

As always, if it's actually not fixed feel free to reopen.

michael-grunder self-assigned this Jan 3, 2020

michael-grunder added this to the 5.1.2 milestone Jan 16, 2020

Tbone542 changed the title ~~Cluster Slot Cache Not Updating on Migration.~~ Cluster - Moved/Ask Redirection Not working properly. Jan 29, 2020

Tbone542 changed the title ~~Cluster - Moved/Ask Redirection Not working properly.~~ Cluster - Moved/Ask Redirection Not Working Properly. Jan 29, 2020

michael-grunder added a commit that referenced this issue Jan 31, 2020

Initial commit of ASK redirection fix

8502562

See #1693

michael-grunder mentioned this issue Jan 31, 2020

Initial commit of ASK redirection fix #1705

Merged

michael-grunder added a commit that referenced this issue Jan 31, 2020

Initial commit of ASK redirection fix

ba73fbe

See #1693

michael-grunder closed this as completed Feb 16, 2020

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Cluster - Moved/Ask Redirection Not Working Properly. #1693

Cluster - Moved/Ask Redirection Not Working Properly. #1693

Tbone542 commented Jan 3, 2020 •

edited

Loading

michael-grunder commented Jan 3, 2020

Tbone542 commented Jan 3, 2020

Tbone542 commented Jan 6, 2020

michael-grunder commented Jan 6, 2020

Tbone542 commented Jan 7, 2020

michael-grunder commented Jan 7, 2020

Tbone542 commented Jan 7, 2020

michael-grunder commented Jan 8, 2020

Tbone542 commented Jan 8, 2020 •

edited

Loading

michael-grunder commented Jan 8, 2020

Tbone542 commented Jan 8, 2020

Tbone542 commented Jan 9, 2020 •

edited

Loading

michael-grunder commented Jan 9, 2020

Tbone542 commented Jan 9, 2020

Tbone542 commented Jan 10, 2020

Tbone542 commented Jan 12, 2020

michael-grunder commented Jan 13, 2020 •

edited

Loading

Tbone542 commented Jan 13, 2020

michael-grunder commented Jan 14, 2020

Tbone542 commented Jan 14, 2020

michael-grunder commented Jan 14, 2020

Tbone542 commented Jan 15, 2020

michael-grunder commented Jan 16, 2020

Tbone542 commented Jan 16, 2020 •

edited

Loading

michael-grunder commented Jan 16, 2020

Tbone542 commented Jan 22, 2020

Tbone542 commented Jan 29, 2020

michael-grunder commented Jan 29, 2020

Tbone542 commented Jan 29, 2020

michael-grunder commented Jan 31, 2020

Tbone542 commented Feb 11, 2020

michael-grunder commented Feb 12, 2020

michael-grunder commented Feb 16, 2020

Cluster - Moved/Ask Redirection Not Working Properly. #1693

Cluster - Moved/Ask Redirection Not Working Properly. #1693

Comments

Tbone542 commented Jan 3, 2020 • edited Loading

Expected behaviour

Actual behaviour

I'm seeing this behaviour on

Steps to reproduce, backtrace or example script

I've checked

michael-grunder commented Jan 3, 2020

Tbone542 commented Jan 3, 2020

Tbone542 commented Jan 6, 2020

michael-grunder commented Jan 6, 2020

Tbone542 commented Jan 7, 2020

michael-grunder commented Jan 7, 2020

Tbone542 commented Jan 7, 2020

michael-grunder commented Jan 8, 2020

Tbone542 commented Jan 8, 2020 • edited Loading

michael-grunder commented Jan 8, 2020

Tbone542 commented Jan 8, 2020

Tbone542 commented Jan 9, 2020 • edited Loading

michael-grunder commented Jan 9, 2020

Tbone542 commented Jan 9, 2020

Tbone542 commented Jan 10, 2020

Tbone542 commented Jan 12, 2020

michael-grunder commented Jan 13, 2020 • edited Loading

Tbone542 commented Jan 13, 2020

michael-grunder commented Jan 14, 2020

Tbone542 commented Jan 14, 2020

michael-grunder commented Jan 14, 2020

Tbone542 commented Jan 15, 2020

michael-grunder commented Jan 16, 2020

Tbone542 commented Jan 16, 2020 • edited Loading

michael-grunder commented Jan 16, 2020

Tbone542 commented Jan 22, 2020

Tbone542 commented Jan 29, 2020

michael-grunder commented Jan 29, 2020

Tbone542 commented Jan 29, 2020

michael-grunder commented Jan 31, 2020

Tbone542 commented Feb 11, 2020

michael-grunder commented Feb 12, 2020

michael-grunder commented Feb 16, 2020

Tbone542 commented Jan 3, 2020 •

edited

Loading

Tbone542 commented Jan 8, 2020 •

edited

Loading

Tbone542 commented Jan 9, 2020 •

edited

Loading

michael-grunder commented Jan 13, 2020 •

edited

Loading

Tbone542 commented Jan 16, 2020 •

edited

Loading