Add support for Sentinel and RE Discovery Service #302

tanguylebarzic · 2012-10-10T09:47:18Z

Hi,

Are there plans to make node_redis sentinel aware? I'm thinking about being able to use a list of sentinels at startup, then discover and connect to the masters, and subscribing to sentinels message to take into account masters changes. If not, would be happy to start working on it!

Tanguy

joaojeronimo · 2012-10-15T16:33:56Z

+1 , why not make a wrapper on top of this node_redis client ? If you start working on it please let me know because I'd like to star that repository.

tanguylebarzic · 2012-10-16T12:48:49Z

Hi,

I've started working on this, the result can be checked at https://github.com/tanguylebarzic/node_redis
An example of use:

var RedisMetaClient = require("./../node_redis/sentinel").RedisMetaClient;
var sentinels = [
    {host: "127.0.0.1", port: 26382},
    {host: "127.0.0.1", port: 26383},
    {host: "127.0.0.1", port: 26384}
];
var redisMetaClient = new RedisMetaClient("mymaster", sentinels);
var i = 0;

var client = redisMetaClient.createMasterClient();
client.on('error', function(error){
    console.log(error);
});

setInterval(function(){
    client.set('test:' + i, i, function(error, result){
        if(!error){
            console.log('success, ' + i);
            ++i;
        }
        else {
            console.log('error, ' + i);
        }
   });
}, 500);

Basically, you first have to create a 'RedisMetaClient', configured with the name of the master and a list of sentinels. Then, calling redisMetaClient.createMasterClient(); will give you a client similar to the usual redis.createClient();

What works:

Basic functionnality, ie. setup of the correct master, failover... If the master change, all the current masterClients (created by redisMetaClient.createMasterClient(); will be updated to point to the new master, so that one doesn't need to look for client.on('error') to update the connection.
I have try to preserve the idea of an offline queue, ie. being able to send commands before the master is found and configured.
I have tested with simple requests, pub/sub... but not with multi (may work, just not sure).

How was is it done:
I've tried to modify as less as possible the code and behaviour of the single client (what's in 'client.js'). sentinel.js contains kind of a wrapper around client.js, to deal with sentinels.

What remains to be done:

Dealing with slaves (for now, nothing is done around them).
Testing (both by hand and automatic) of multiple scenarios
Documentation

Known issues:

the event ready is triggered multiple times for a masterClient if a failover happened, which can result in unexpected behaviours based on the way it's used
many others that I don't know about

benbuckman · 2012-12-03T18:46:05Z

tanguylebarzic, thanks for starting to tackle sentinel support. I'm confused by your approach, however: You seem to be implementing the sentinel functionality (e.g. identifying the master and slaves, handling a quorum) in your node.js code. But in the sentinel docs that is all set up in the redis-sentinel daemon's config. Node shouldn't care about who the master or slaves are for a given sentinel, it should simply connect to the sentinel, which is handling all that already. (And that SentinelClient should behave like a regular RedisClient, so it's backwards-compatible with connect-redis session store, etc.)

Can you clarify your approach? Thanks!

benbuckman · 2012-12-05T01:32:21Z

I've started implementing an alternate approach with a RedisSentinelClient that behaves transparently like a RedisClient: https://github.com/DocuSignDev/node_redis

Always interested in feedback, suggestions, and/or collaborators.

tanguylebarzic · 2012-12-14T14:40:47Z

newleafdigital: sorry for the late reply. indeed, I noticed it was not clean to implement this the way I did (too much logic on the client). I've changed it to better match antirez' guidelines, although there are still some questions remaining IMO. Your approach looks interesting (and add cluster awareness!), I'm going to look at it!

jochenonline · 2013-02-13T16:51:55Z

@newleafdigital: I have tried your fork together with socket.io. When I setup the redisstore:

io.set('store', new socketio.RedisStore({
    redis: redis,
    redisPub:   redis.createClient( 26379, "127.0.0.1", {sentinel: true} ),
    redisSub:   redis.createClient( 26379, "127.0.0.1", {sentinel: true} ),
    redisClient:redis.createClient( 26379, "127.0.0.1", {sentinel: true} )
 }));

My app crashes on startup:

 Error: Redis connection to 127.0.0.1:6379 failed - connect ECONNREFUSED
     at RedisClient.on_error (/home/green/debug/node_modules/redis/index.js:168:24)
     at Socket.RedisClient.initialize_stream_listeners.stream.on.self.should_buffer (/home/green/debug/node_modules/redis/index.js:93:14)
     at Socket.EventEmitter.emit (events.js:96:17)
     at Socket._destroy.self.errorEmitted (net.js:328:14)
     at process.startup.processNextTick.process._tickCallback (node.js:244:9)
 [Wed, 13 Feb 2013 16:46:56 GMT] INFO worker 6622 died

Please be aware that the error message sais that the connection failed to port 6379, instead of 26379!!!

Any idea what could be the problem? When I connect directly to the redis-server (master) everything is fine.

benbuckman · 2013-02-13T16:55:41Z

Try using a single hash for the parameters:

redis.createClient({ port: 26379, host: "127.0.0.1", sentinel: true } );

When you connect to the sentinel via redis-cli -p 23679 (not to the master - sentinel != master), does it connect?

jochenonline · 2013-02-13T17:12:28Z

Yes, connecting to the sentinel via redis-cli -p 26379 connects successfully.
Using the alternate syntax doesn't crash on connect but returns an error on the first get of the redisSentinelClient:

[Error: ERR unknown command 'get']

In that case I used the normal redis functionality (not socket.io).

jochenonline · 2013-02-13T17:38:11Z

Hhhhhm. Maybe no problem of your module....When I connect to the sentinel it connects successfully, but it does not know any redis commands:

green@mycomp1:~/redis/redis$ src/redis-cli -p 26379
redis 127.0.0.1:26379> set test 1
(error) ERR unknown command 'set'
redis 127.0.0.1:26379>

What could be the reason for this? Just to be clear: I am connecting to the sentinel, neither to the master nor to one of the slaves. Correct?

ADDENDUM: Reading the docs more in detail I saw that this behavior is normal. But why does the your client do the same? Shouldn't it behave as a "normal" redisClient?

benbuckman · 2013-02-13T21:55:12Z

The sentinel itself shouldn't handle get/set/etc, but the sentinel client should handle it, by delegating it to the master.
Could you put some debugging code in the SentinelClient constructor, to make sure it's instantiating a SentinelClient and not a regular client?
There's also a sentinel test suite that we added, run with Mocha -- maybe run that (changing the port) and see if those pass on your system?

Thanks

jochenonline · 2013-02-15T11:38:48Z

@newleafdigital: Is it possible that you expect the sentinel running in the same machine as redis? I have my sentinel running on a different machine (the one on which node runs) and I doubt that this scenario works for the current implementation. I still get the error

 Error: Redis connection to 127.0.0.1:6379 failed - connect ECONNREFUSED

and that leads me to the conclusion that it tries to connect to a redis instance on 127.0.0,1 === this machine.

jochenonline · 2013-02-15T14:04:50Z

This seems to be a specific problem of using your client with socket.io. Did you ever test the two together?

jochenonline · 2013-02-15T15:07:49Z

@newleafdigital: Now I seem to have found out what is going on. I did not deeply dig into your code but I can quite exactly describe the behaviour:

In my current configuration (node-app on machine1, redis_master on machine2, redis_slaves on machines3+4, redis_sentinels on machines2+3+4) your code only works if I connect to one of the slave_sentinels on machine3+4. It does not work if I connect to the sentinel on the master (machine2).

When I connect to one of the two slave_sentinels the systems logs correctly:

debug connected to sentinel listener
debug connected to sentinel talker
debug new master info { host: 'the.ip.of.master', port: '6379' }
debug Changing master from 127.0.0.1:9999 to the.ip.of.master:6379
debug New master is ready [Yippieee!]

When I connect to the sentinel on the master it sais (and does not work afterwards):

debug connected to sentinel listener
debug connected to sentinel talker
debug new master info { host: '127.0.0.1', port: '6379' } 
debug Changing master from 127.0.0.1:9999 to 127.0.0.1:6379

and no debug New master is ready. As you can see, when I connect to the master's sentinel, the master is not addressed correctly (127.0.0.1 instead of the correct ip).

If the sentinels run on other machines but the redis' machines (i.e. on the same machine like the node-app - as in my very first configuration) it only works too, if the sentinel is connected to one of the slaves (and not the master).

And...together wird socket.io/RedisStore it does not work either (concerning my configuration) in any way, even if I connect to the sentinel on one of the slaves.

DTrejo · 2013-02-24T05:34:34Z

+1 this is great as a separate module as it is much more likely to change.

If you'd like to have your module mentioned in the wiki, come up with a tag for it e.g. #redis-sentinel and then
I'll add a link in the README to all packages with that tag e.g. https://npmjs.org/browse/keyword/redis-sentinel

benbuckman · 2013-02-26T00:13:53Z

Thanks for all the feedback here, and sorry I haven't had a chance to reply in depth. I've allocated myself time for this next week (March 4).

@DTrejo, you're right about making this its own module, I think I'll do that (next week as well).

jamessharp · 2013-03-11T20:03:05Z

Hi guys

I've also been having a look at this (I need to be able to connect to get the master from a list of sentinels and reconnect to a new master if the old one goes down).

I've taken a slightly different approach to @newleafdigital by just using the existing RedisClient reconnection stuff and changing the desired host/port so when reconnection happens we reconnect to a new instance (so for all intents and purposes it's as if the client has just done a normal reconnect)

The advantages to this method are that it is just a wrapper over node_redis so there is no need to get down and dirty messing with that code. Also it should mean that since the client returned is a (slightly extended) RedisClient then it should slot in nicely with existing code. And finally it just feels simpler than the existing code

My code is here: https://github.com/ortoo/node-redis-sentinel - it's fairly limited at the moment but feels pretty extensible. Any thoughts/assertions that I'm going off on the wrong track would be appreciated.

Current functionality:

Specify a list of sentinel endpoints and master name
Get a RedisClient back that will connect to the master
If the current master dies then RedisClient will attempt to reconnect (resolving the master from the sentinels)

brycebaril · 2013-03-12T04:19:13Z

👍 to all of the people working on sentinel libraries! I haven't had a chance to take a look yet, but I'm eager to try it out.

benbuckman · 2013-04-25T20:53:31Z

@jamessharp, nice work on node-redis-sentinel. I'm trying to figure out if I should adopt yours or split my SentinelClient into its own module. One thing I can't quite figure out with yours, is it meant to create a single client that transparently handles failover/reconnect? Or does the app need to instantiate 3 clients (master, slave, sentinel) and switch between the active client when it fails over?

From your description above, it sounds like the former, but looking at the code and trying to use it, it seems more like the latter.

Thanks!

jamessharp · 2013-04-25T21:39:26Z

Thanks. It's meant to create a single client that transparently handles failover/reconnect. The three clients that you can have (master/slave/sentinel) are meant to give you a persistent connection to each of the different types. So if you are happy to do reads from a slave then you can get a slave connection (which will attempt to transparently reconnect to a new slave instance if the one you're accessing goes down). If you need a permanent connection to the master (whichever server that happens to be) then you can just use the master client and if you want a direct connection to the sentinel instance (that again should transparently failover) you can grab the sentinel client.

Hope that makes sense.

Of course there's still quite a lot of work that needs doing but it works well enough for me at the moment...

benbuckman · 2013-04-26T19:09:17Z

Thanks for the quick reply @jamessharp. What are the big missing pieces?

I wrote a test here to see how it works - https://gist.github.com/newleafdigital/5469571. Before running the script, you start up a master+slave+sentinel; the script connects to the master, and every second adds an incremental key to a hash. After 5 seconds, the master is killed, and the idea is to see if anything is lost. It seems that the master client never actually connects to the new master, so I'm not sure if I'm doing it wrong, or if it's not working.

During a failover, is it supposed to buffer the data to avoid loss, or is I/O done during the failover supposed to be lost?

Thanks

jamessharp · 2013-04-26T19:20:47Z

A couple of thoughts as to why it may not be working:

the default sentinel configuration won't actually failover until the master has been down for 30 seconds (I think)
the time between reconnection attempts of the client backs off exponentially so there could be a minute before the reconnection actually happens

I'm not sure whether the data is buffered. I'd hope so but the behaviour will be the same as whatever happens when the underlying client tries to reconnect.

The main missing piece is detecting when the master is changed without it going down (I.e. a manual failover). But some more work could be done on making the reconnection as snappy as possible.

benbuckman · 2013-04-26T21:42:29Z

I added a test with our implementation to compare. The results are in the gist. All the I/O done during the failover is lost with node-redis-sentinel, and no data is lost with our (much bulkier) solution. We're buffering in ours; the basic redisClient drains its buffer on disconnect or errors (I don't remember exactly), so it can't easily buffer through a failover.

I like the simplicity of your solution, but I wonder if it's sufficient for the goals we were hoping to achieve:

Transparent, drop-in replacement for redisClient
handles all redisClient operations include pub/sub transparently
no loss of data during failover

For now, I'll assume our implementation adds some value, and I'll separate it into its own module. If it turns out your much simpler alternative will suffice, we'll switch to that.

Thanks for enriching the space!

jamessharp · 2013-04-28T12:06:29Z

Yeh your goals are spot on. However a lot of them would actually be useful in the core node_redis client. When it comes down to it there shouldn't be any difference between a reconnection to a single instance (i.e. what node_redis currently handles) and a failover to a new master/slave in a sentinel controlled cluster.

Rather than putting in the logic for no loss of data during failover and pub/sub persistence in a separate module, I reckon we should put it in the core client and then have a separate module for the sentinel specific logic (along the lines of what I've done in node-redis-sentinel). Its win-win. It makes for a better node_redis client and a simpler sentinel module.

brycebaril · 2013-04-28T16:19:17Z

I've started work on a refactor of the core node_redis client to accomplish a couple things: break up the current index.js so it is a bit easier to work on, but also to add in a mechanism for plugins such as this. I agree that due to the way that node_redis manages command replays and offline queueing for connection interruptions, the sentinel behavior probably works best working directly with the core client.

My goal is that the refactor results in code where you can drop in a 3rd party connection manager library much the way you can currently swap in a parser, and it will expose the appropriate hooks such that something like a sentinel library could use the core client's command queueing & bookkeeping, but replace the connection management portions.

I'll try to get a branch pushed soon for feedback.

benbuckman · 2013-04-28T17:35:30Z

That all sounds good but pretty heavy. Our fork of node_redis already handles the reconnection steps needed for sentinel to work. I'm not sure a pluggable connection manager is necessary.

benbuckman · 2013-04-28T17:37:24Z

Actually, correction: The fork there does not have all the necessary pieces; they exist in another copy which I need to merge into that fork. I will do that on Monday (tomorrow).

brycebaril · 2013-04-28T18:51:36Z

I'm not entirely sure what you mean by heavy -- the goal is to encourage a lighter codebase by making it easier to plug features in without forcing them to be a part of the core library.

benbuckman · 2013-04-30T17:51:35Z

I separated our sentinel client implementation into a new module, redis-sentinel-client:

https://github.com/DocuSignDev/node-redis-sentinel-client
https://npmjs.org/package/redis-sentinel-client

It still depends on some minor changes to node_redis (so it uses our fork); I will pair down the fork to only those necessary changes, and submit PR's for them.

(@brycebaril, my original thought was that a pluggable connection manager would be redundant/overkill. I'm probably wrong about that. I'll see what changes are actually necessary to node_redis to support a sentinel client in a little bit.)

benbuckman · 2013-04-30T19:39:54Z

I've submitted two pull requests to support redis-sentinel-client:

#428 - Export utils and commands to share (with sentinel client, or others) - this is non-breaking/low-impact and makes node_redis better overall regardless of this particular use.

#429 - Flexible connections for sentinel support - this is potentially more breaking/controversial and related to the thread above about pluggable connection managers.

Thanks to @tanguylebarzic's early work on sentinel support, and to everyone who works on node_redis!

nlfiedler · 2022-02-01T22:14:52Z

Guess we will switch to ioredis, this effort appears to have died.

leibale · 2023-01-25T19:35:08Z

So... after more than 10 years, I'm starting to work on that now... 🎉 better late than never 😆

nguyenpc · 2023-05-31T03:43:55Z

So... after more than 10 years, I'm starting to work on that now... 🎉 better late than never 😆

Great to hear that, it would be so nice if we have an optimistic eta for that @leibale ?

leibale · 2023-05-31T09:57:38Z

@nguyenpc I'm not too sure how much time it's gonna take, but the current roadmap is:
V5 with support for RESP3 + some more features
Sentinel

V5 should be ready "soon" (a month or so), then I'll start working on Sentinel :)

JustDoItSascha · 2024-02-06T16:40:42Z

Is it now implemented?

leibale · 2024-02-06T17:44:19Z

It's in the v5 branch waiting for some tests and last-minute changes, then we will release a "next"/beta version with it

jagdeepak-prat · 2024-02-18T21:48:28Z

Hope this comes soon to the main. :)

gianDiazM · 2024-06-05T06:49:32Z

Hi @leibale! I'd to know if there are any plans to release version V5 soon. I've noticed that the working branch is quite active, but I was wondering if there's a target date for releasing a stable or a beta version that includes the Sentinel package? :)

gianDiazM · 2024-06-19T09:23:25Z

Hi @leibale! I'd to know if there are any plans to release version V5 soon. I've noticed that the working branch is quite active, but I was wondering if there's a target date for releasing a stable or a beta version that includes the Sentinel package? :)

Hi again, my team have been closely following the development of the V5 version and is very interested in using the new features you are implementing.
Additionally, our team is willing to work with you to speed up the development process so we can use the new version as soon as possible. If we can help, please let us know. We are open to performing tests, providing feedback, or contributing in any other way you find useful.

Thank you in advance

leibale · 2024-06-20T15:23:07Z

@gianDiazM first, sorry for the huge delay, I've complement missed your message from 2 weeks ago..
ATM we are working on:

Client-side caching
Finishing sentinel
Aliging the commands

If you guys want to help with one of those, do performance tests, or just play around with the client, that would be very helpful.
Do you need sentinel specifically to try it out? Can you try it directly from git or do you need an "npm version"?

gianDiazM · 2024-06-21T07:44:49Z

@gianDiazM first, sorry for the huge delay, I've complement missed your message from 2 weeks ago.. ATM we are working on:

Client-side caching

Finishing sentinel

Aliging the commands

If you guys want to help with one of those, do performance tests, or just play around with the client, that would be very helpful. Do you need sentinel specifically to try it out? Can you try it directly from git or do you need an "npm version"?

Hi @leibale no worries and thank you for getting back to me. We will try to address the points you mentioned.

And regarding this "Can you try it directly from git or do you need an "npm version"?"

A Redis npm version that includes the client/sentinel package to integrate into our development would be great.

Thanks =)

Loki-Afro · 2024-07-29T12:38:10Z

@leibale we are in a very similar situation as @gianDiazM

what exactly means "finishing" sentinel?

and we kinda need a new tag

Bonfims · 2024-09-17T20:45:13Z

hey, is there any workaround? How can I use my sentinel configuration? redis-sentinel is a good way?

benbuckman mentioned this issue Apr 30, 2013

Flexible connections for sentinel support #429

Closed

kontrollanten mentioned this issue Jul 23, 2022

Redis Sentinel support Chocobozzz/PeerTube#5141

Closed

onedr0p mentioned this issue Nov 13, 2022

[Feature]: Support for redis sentinel immich-app/immich#473

Closed

leibale self-assigned this Feb 24, 2023

leibale mentioned this issue Mar 30, 2023

Lack of sentinel support needs to be prominent #2454

Open

leibale mentioned this issue Apr 11, 2023

How to handle read & write operations in different endpoint using Redis Client #2470

Open

jamesholcomb mentioned this issue Apr 11, 2023

chore: add ioredis dep, test feathersjs-ecosystem/feathers-sync#191

Open

3 tasks

uglide changed the title ~~Sentinel aware~~ Add support for Sentinel and RE Discovery Service Jun 28, 2023

uglide added this to Node-Redis V6 Roadmap Jul 6, 2023

kashalls mentioned this issue Jul 28, 2023

Migrate from node package "redis" to "ioredis" for redis high availability support. OneUptime/oneuptime#555

Closed

gmemstr mentioned this issue Aug 21, 2023

#26565 - Enable Redis Sentinel configuration mastodon/mastodon#26571

Closed

PatelVishalJ mentioned this issue Oct 27, 2023

Sentinel support redis/redis-om-node#217

Open

chasemcdo mentioned this issue Nov 23, 2023

ioredis vs node-redis #2658

Open

leibale assigned sjpotter and unassigned leibale Nov 27, 2023

leibale linked a pull request Feb 5, 2024 that will close this issue

Sentinel Support #2664

Merged

3 tasks

silverwind mentioned this issue Apr 16, 2024

Is this package still actively maintained? redis/ioredis#1870

Open

Add support for Sentinel and RE Discovery Service #302

Add support for Sentinel and RE Discovery Service #302

Comments

tanguylebarzic commented Oct 10, 2012

joaojeronimo commented Oct 15, 2012

tanguylebarzic commented Oct 16, 2012

benbuckman commented Dec 3, 2012

benbuckman commented Dec 5, 2012

tanguylebarzic commented Dec 14, 2012

jochenonline commented Feb 13, 2013

benbuckman commented Feb 13, 2013

jochenonline commented Feb 13, 2013

jochenonline commented Feb 13, 2013

benbuckman commented Feb 13, 2013

jochenonline commented Feb 15, 2013

jochenonline commented Feb 15, 2013

jochenonline commented Feb 15, 2013

DTrejo commented Feb 24, 2013

benbuckman commented Feb 26, 2013

jamessharp commented Mar 11, 2013

brycebaril commented Mar 12, 2013

benbuckman commented Apr 25, 2013

jamessharp commented Apr 25, 2013

benbuckman commented Apr 26, 2013

jamessharp commented Apr 26, 2013

benbuckman commented Apr 26, 2013

jamessharp commented Apr 28, 2013

brycebaril commented Apr 28, 2013

benbuckman commented Apr 28, 2013

benbuckman commented Apr 28, 2013

brycebaril commented Apr 28, 2013

benbuckman commented Apr 30, 2013

benbuckman commented Apr 30, 2013

nlfiedler commented Feb 1, 2022

leibale commented Jan 25, 2023 • edited Loading

nguyenpc commented May 31, 2023

leibale commented May 31, 2023

JustDoItSascha commented Feb 6, 2024

leibale commented Feb 6, 2024 • edited Loading

jagdeepak-prat commented Feb 18, 2024

gianDiazM commented Jun 5, 2024

gianDiazM commented Jun 19, 2024

leibale commented Jun 20, 2024 • edited Loading

gianDiazM commented Jun 21, 2024 • edited Loading

Loki-Afro commented Jul 29, 2024 • edited Loading

Bonfims commented Sep 17, 2024

leibale commented Jan 25, 2023 •

edited

Loading

leibale commented Feb 6, 2024 •

edited

Loading

leibale commented Jun 20, 2024 •

edited

Loading

gianDiazM commented Jun 21, 2024 •

edited

Loading

Loki-Afro commented Jul 29, 2024 •

edited

Loading