Shard Allocation Race Condition #34878

danielkasen · 2018-10-25T19:27:26Z

6.3.2 :

Plugins installed: []

JVM version (java -version): 1.8.72

OS version (uname -a if on a Unix-like system): Ubuntu 14.04

Description of the problem including expected versus actual behavior:
New Index Gets allocated into a yellow state vs. allocating to shards each available node when using a mixture of rack_awareness and shard_allocation_per_node = 1

Steps to reproduce:

Create a 15 node Cluster that has 3 different racks
Create an index with 7 shards and 1 replica ( 14 total ) that can allocate only 1 shard per node
Randomly get a race condition where the index can't allocate one of the replicas becase it's primary is in the same rack.

So basically you can get into a condition where even though you still have 2 nodes without a shard it can't allocate to either of them because then the primary and relica would be in the same rack. To fix this you have to move the free up a node in a different rack by moving it's primary or replica to one of the unused nodes, then assign the original replica that couldn't be assigned to that new node (moving 2 shards at once).

The text was updated successfully, but these errors were encountered:

elasticmachine · 2018-10-26T01:12:49Z

Pinging @elastic/es-distributed

dnhatn · 2018-10-26T01:13:33Z

@danielkasen Thanks for reporting this. Could you provide the shard allocation filter that you used? Thanks.

DaveCTurner · 2018-10-26T05:18:00Z

I think this duplicates #12273. The shard allocator does not consider moving shards around to make more of them fit, and backs itself into a corner, especially if there's a limit per node.

(It's not a race condition, this all happens on a single thread.)

danielkasen · 2018-10-26T15:29:43Z

Ahh yes I didn't see that other thread. This is basically what is happening, just with a different shard to node ratio.

dnhatn added the :Distributed Coordination/Allocation All issues relating to the decision making around placing a shard (both master logic & on the nodes) label Oct 26, 2018

dnhatn added the feedback_needed label Oct 26, 2018

DaveCTurner closed this as completed Oct 26, 2018

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Shard Allocation Race Condition #34878

Shard Allocation Race Condition #34878

danielkasen commented Oct 25, 2018

elasticmachine commented Oct 26, 2018

dnhatn commented Oct 26, 2018

DaveCTurner commented Oct 26, 2018

danielkasen commented Oct 26, 2018

Shard Allocation Race Condition #34878

Shard Allocation Race Condition #34878

Comments

danielkasen commented Oct 25, 2018

elasticmachine commented Oct 26, 2018

dnhatn commented Oct 26, 2018

DaveCTurner commented Oct 26, 2018

danielkasen commented Oct 26, 2018