Sentinel: Master didn't update after sentinel failover without loosing connection to old master #435

reclosedev · 2014-02-05T17:54:44Z

I have sentinel managed redis instance:
redis = sentinel.master_for(...)
If failover happens without closing connection to old master, e.g. in redis-cli on sentinel:
> sentinel failover service_name
Then, after trying to write something (redis.set('test', 'test') I got error
ResponseError("READONLY You can't write against a read only slave.")

The reason is execute_command recconects only in case of ConnectionError.

I came with following workaround:

class SentinelAwareStrictRedis(StrictRedis):
    """ Handles case when failover happened without loosing connection to old master
    """
    def execute_command(self, *args, **options):
        try:
            return super(SentinelAwareStrictRedis, self).execute_command(*args, **options)
        except ResponseError as e:
            if e.message.startswith("READONLY"):
                self.connection_pool.get_master_address()
                return super(SentinelAwareStrictRedis, self).execute_command(*args, **options)
            raise

usage:

...
sentinel.master_for(..., redis_class=SentinelAwareStrictRedis)

If this solution is fine, I think it could be added as default redis_class for Sentinel.

The text was updated successfully, but these errors were encountered:

abe-winter · 2014-04-14T17:41:49Z

This is an important fix -- routine network blips in AWS cause sentinel reelections once a week or so in my cluster. My redis-py clients go nuts when it happens, have to be reset.

reclosedev · 2014-04-14T17:52:38Z

After more testing, it appears that connection_pool.disconnect() has to be called.
Also, if you use pipelines, you need to handle ResponseErrors in pipeline.
Code that I use now (logging removed):

class SentinelAwareStrictRedis(StrictRedis):
    """ Handles case when failover happened without loosing connection to old master
    """
    def execute_command(self, *args, **options):
        try:
            return super(SentinelAwareStrictRedis, self).execute_command(*args, **options)
        except ResponseError as e:
            if not e.message.startswith("READONLY"):
                raise
            old_master = self.connection_pool.master_address
            new_master = self.connection_pool.get_master_address()
            self.connection_pool.disconnect()
            return super(SentinelAwareStrictRedis, self).execute_command(*args, **options)

    def pipeline(self, transaction=True, shard_hint=None):
        return SentinelAwareStrictPipeline(
            self.connection_pool,
            self.response_callbacks,
            transaction,
            shard_hint
        )


class SentinelAwareStrictPipeline(StrictPipeline):
    def execute(self, raise_on_error=True):
        stack = self.command_stack
        try:
            return super(SentinelAwareStrictPipeline, self).execute(raise_on_error)
        except ResponseError as e:
            if "READONLY" not in e.message:
                raise
            if self.watching:
                raise WatchError("Sentinel failover occurred while watching one or more keys")
            # restore all commands
            self.command_stack = stack
            old_master = self.connection_pool.master_address
            new_master = self.connection_pool.get_master_address()
            self.reset()
            self.connection_pool.disconnect()
            return super(SentinelAwareStrictPipeline, self).execute(raise_on_error)

I'm not sure if SentinelAwareStrictPipeline implementation is correct, because sometimes pipe.execute() returns empty response, but I couldn't reproduce it in test environmet and had no time to debug it.

andymccurdy · 2014-05-07T05:52:35Z

@abe-winter @reclosedev

I believe the fix I just pushed should fix your issue. I just caught the ReadOnlyError in SentinelManagedConnection.send_command. You shouldn't have to modify the client or pipeline classes to make this work.

If you have any issues with this change, please feel free to re-open the issue.

reclosedev · 2014-06-02T08:24:55Z

I've just tried v2.10.0.

ReadOnlyError exception occures in connection.read_response(). So fix from ef05364 doesn't force to query new master.

Overriding read_response() with "try/except readonly/disconnect" works.

andymccurdy · 2014-06-02T20:31:27Z

@reclosedev You're right, the try/execpt readonly/disconnect logic should be in release_response(). Not sure why I put it in send_command().

…al fix for #435

andymccurdy · 2014-06-02T20:40:19Z

This is fixed in 2.10.1, which just went out on pypi. I need to get a better test suite for sentinel logic.

abe-winter · 2014-06-03T03:55:27Z

I have some test cases for this which I wrote when the READONLY error first started happening. Do you want the code?

reclosedev · 2014-06-03T07:19:59Z

@andymccurdy Thanks!

abe-winter mentioned this issue Apr 11, 2014

ResponseError READONLY doesn't force SentinelConnectionPool connections to ask for new primary #460

Closed

andymccurdy closed this as completed in ef05364 May 7, 2014

andymccurdy added a commit that referenced this issue Jun 2, 2014

need to detect READONLY errors in read_response, now send_command. re…

96f08d7

…al fix for #435

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Sentinel: Master didn't update after sentinel failover without loosing connection to old master #435

Sentinel: Master didn't update after sentinel failover without loosing connection to old master #435

reclosedev commented Feb 5, 2014

abe-winter commented Apr 14, 2014

reclosedev commented Apr 14, 2014

andymccurdy commented May 7, 2014

reclosedev commented Jun 2, 2014

andymccurdy commented Jun 2, 2014

andymccurdy commented Jun 2, 2014

abe-winter commented Jun 3, 2014

reclosedev commented Jun 3, 2014

Sentinel: Master didn't update after sentinel failover without loosing connection to old master #435

Sentinel: Master didn't update after sentinel failover without loosing connection to old master #435

Comments

reclosedev commented Feb 5, 2014

abe-winter commented Apr 14, 2014

reclosedev commented Apr 14, 2014

andymccurdy commented May 7, 2014

reclosedev commented Jun 2, 2014

andymccurdy commented Jun 2, 2014

andymccurdy commented Jun 2, 2014

abe-winter commented Jun 3, 2014

reclosedev commented Jun 3, 2014