WIP: 2797 should recover ips on peer loss #3171

bricef · 2017-11-14T16:12:31Z

Currently, weave will recover missing peers on relaunch, as per #3149. However, there remains some issues with updates (see #3170).

Furthermore, weave will not currently deal properly with peers going down while running. The recovery of the IP space will only occur after relaunching a weave agent.

In an ideal world, the IP space would be dynamically recovered and re-distributed. This PR includes a failing test to that purpose.

bboreham · 2017-11-14T16:21:08Z

"IP space dynamically recovered" is a niche benefit - we only really need to recover at the time we run out.

We could have some modest background task where any peer can say at any time "I perceive that I have 1% of the address space and someone else has 90%; I will ask for some more" - that would help with "re-distribute" and also lessen the impact of a delay in reclaiming.

We also want to avoid gratuitously fragmenting the overall space. Although that may be ok as a consequence of existing heuristics.

bricef · 2017-11-15T11:25:33Z

I think I get your point. Unreachable or badly distributed addresses aren't a problem unless they affect function. In most autoscaling scenarios, the launch of a new instance would recover unreachable slices anyway.

I wonder if this cleanup and management should be triggered when weave is asked to provide a new address to a user service. We'd be doing work at this point anyway, it would be triggered by user action, and it would avoid having a background process running anyway. That way, weave can say,

I need a new address
I don't have any available
Are there unreachable hosts I can recover?
If not, are there hosts with a slice I could steal?

Maybe this would have too much of an effect on latency?

bboreham · 2017-11-15T11:29:07Z

Since the reclaim process is (currently) highly Kubernetes-specific, coupling #3 to #1 is problematic.

1, 2 and 4 are what IPAM does already, although it calls it "request" rather than "steal".

brb · 2018-01-06T14:43:07Z

Is it still WIP?

bboreham · 2018-08-23T16:38:14Z

I just realised I am pointing other issues at this one, but this is a PR not an issue.

bboreham · 2018-11-01T11:15:18Z

Replaced by #3399

bricef added 2 commits November 9, 2017 15:42

Document test runner feature to pick number of hosts

f25cf55

Add integration test to ensure weave recovers ip when node goes down

380a63e

bboreham changed the title ~~2797 should recover ips on peer loss~~ WIP: 2797 should recover ips on peer loss Nov 14, 2017

bboreham mentioned this pull request Nov 15, 2017

Uneven distribution of IP address ownership #3162

Open

bboreham mentioned this pull request Aug 10, 2018

Node deletion does not clear up the IPs #3372

Closed

bboreham mentioned this pull request Aug 23, 2018

Weave not working correctly leads to containers stuck in ContainerCreating #3384

Closed

bboreham closed this Nov 1, 2018

bboreham added this to the n/a milestone May 16, 2019

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

WIP: 2797 should recover ips on peer loss #3171

WIP: 2797 should recover ips on peer loss #3171

bricef commented Nov 14, 2017 •

edited

Loading

bboreham commented Nov 14, 2017

bricef commented Nov 15, 2017

bboreham commented Nov 15, 2017

brb commented Jan 6, 2018

bboreham commented Aug 23, 2018

bboreham commented Nov 1, 2018

WIP: 2797 should recover ips on peer loss #3171

WIP: 2797 should recover ips on peer loss #3171

Conversation

bricef commented Nov 14, 2017 • edited Loading

bboreham commented Nov 14, 2017

bricef commented Nov 15, 2017

bboreham commented Nov 15, 2017

brb commented Jan 6, 2018

bboreham commented Aug 23, 2018

bboreham commented Nov 1, 2018

bricef commented Nov 14, 2017 •

edited

Loading