Seeded order is platform-dependent #971

threedaymonk · 2016-04-18T15:04:20Z

When using --order=random with a seed, the order is determined by Ruby's Array#shuffle method.

Seeded Random produces consistent results across different versions of Ruby and JRuby, but shuffle is implemented differently in different interpreters:

Ruby 2.2.4:

[1,2,3].shuffle(random: Random.new(4))
=> [2, 1, 3]
[1,2,3].shuffle(random: Random.new(5))
=> [1, 2, 3]

Ruby 1.9.3 and JRuby 1.7.23:

[1,2,3].shuffle(random: Random.new(4))
=> [1, 2, 3]
[1,2,3].shuffle(random: Random.new(5))
=> [3, 2, 1]

As I discovered whilst working on #970, this makes it difficult to write specifications that will work across all supported platforms. The current randomize.feature effectively works only by accident because it shuffles only two items.

This also means that it's not always possible to replicate a particular seeded order on a different interpreter.

RSpec has a similar random ordering feature that uses a platform-independent sort on a hash of the item index and the seed. This seems like a more robust thing for cucumber-ruby to do, although it would come at the cost of invalidating any existing use of seeds.

The text was updated successfully, but these errors were encountered:

When you’re troubleshooting an order dependent failure, you want to get the repro case down to a minimal run that loads and runs as few specs as possible. With the old random ordering implementation, that was hard to achieve because while rerunning with a given seed produced the same order when the exact same set of examples were loaded, the ordering would be completely different when a subset was loaded. By ordering by `hash(seed + example_id)` it ensures that the ordering of any two examples should stay consistently regardless of how many other examples are loaded. Jenkins or MD5 is significantly slower than `shuffle`, but I think the tradeoff is worth it here. This isn’t a hot spot.

myronmarston · 2016-04-20T16:41:16Z

FWIW, I didn't realize that Array#shuffle is platform-dependent. The reason we switched to sorting by hash(item + seed) is to provide stable random ordering. %w[ a b c d ].shuffle may order c before b, but %w[ b c d ].shuffle may order c after b even if you use the same random seed. When you are trying to isolate an ordering dependency, it's essential that the examples are always ordered the same relative to each other, even as the user runs smaller and smaller subsets of the whole suite. Our --bisect feature would be impossible without it.

mattwynne · 2016-04-20T18:56:36Z

Thanks for the insight Myron. I covet your bisect feature, so this is worth bearing in mind. Thanks!

threedaymonk · 2016-04-21T10:08:38Z

That's really interesting. Thanks @myronmarston.

@mattwynne I'll add a scenario to cover stability. The scenario path/line is one obvious candidate as a basis for ordering, but it might need a bit of tweaking to ensure that it's not affected by the delimiter (i.e. that it works the same on Windows).

mattwynne · 2016-04-21T19:33:30Z

That's great, thanks @threedaymonk

lock · 2018-10-25T00:07:08Z

This thread has been automatically locked since there has not been any recent activity after it was closed. Please open a new issue for related bugs.

threedaymonk mentioned this issue Apr 26, 2016

Make random order stable and platform-independent #974

Merged

danascheider closed this as completed in #974 May 15, 2016

lock bot locked as resolved and limited conversation to collaborators Oct 25, 2018

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Seeded order is platform-dependent #971

Seeded order is platform-dependent #971

threedaymonk commented Apr 18, 2016

myronmarston commented Apr 20, 2016

mattwynne commented Apr 20, 2016

threedaymonk commented Apr 21, 2016

mattwynne commented Apr 21, 2016

lock bot commented Oct 25, 2018

Seeded order is platform-dependent #971

Seeded order is platform-dependent #971

Comments

threedaymonk commented Apr 18, 2016

myronmarston commented Apr 20, 2016

mattwynne commented Apr 20, 2016

threedaymonk commented Apr 21, 2016

mattwynne commented Apr 21, 2016

lock bot commented Oct 25, 2018