Run tests with Gradle test runner instead of randomizedtesting.junit4-ant #31496

alpar-t · 2018-06-21T09:09:43Z

Currently we use https://github.com/randomizedtesting/ for running the tests.

The Gradle plugin calls into ant to do the heavy lifting.
It also replaces the Gradle Test task that the java plugin creates ( called test ) with it's own RandomizedTestingTask . This proved to be problematic, see #31324, and might no longer be possible in some future Gradle release.
Also, running the tests in parallel with the ant runner is independent from Gradle and does not honor Gradle --max-workers setting, which makes it hard to make other parts of Gradle run in parallel as it can lead to too many threads being created.

It is possible to use the reandmizedtesting runner i.e. @RunWith and run the tests with the Grade test runner. This proposal is to replace the ant part and not the runner.

The text was updated successfully, but these errors were encountered:

elasticmachine · 2018-06-21T09:09:45Z

Pinging @elastic/es-core-infra

alpar-t · 2018-06-21T09:29:45Z

Looking at what the ant runner implements and what we use it seems that we'll loose test class randomization and heartbeat. The former might eventually land in junit5 junit-team/junit5#13.
Gradle also measures how long tests take, so we should be fine without the heart beat.

It doesn't seem extremely difficult to implement random test order in Gradle, there is a precedent for it:
https://github.com/gradle/gradle/blob/93c34bdb6709f7ae9276c28e3d1b47b0495abb6a/subprojects/testing-base/src/test/groovy/org/gradle/api/internal/tasks/testing/processors/RunPreviousFailedFirstTestClassProcessorTest.groovy

However this is internal API so would need to convince Gradle it's useful and contribute it.

Other features of randomizedtesting, like stall thread detection are implemented in the runner.

rjernst · 2018-06-21T17:56:01Z

it seems that we'll loose test class randomization and heartbeat.

I don't think it is acceptable to lose any of the functionality from randomized runner. We would need to implement this functionality over the gradle runner. I think it is all doable, though, at least with some work.

alpar-t · 2018-06-22T06:33:22Z

I agree. Especially the w.r.t the ordering of tests. Gradle doesn't have a public API (or any API I could find for that matter) that we can use. Implementing this by making changes to Gradle isn't neither hard or time consuming, but will have to figure out if that's something Gradle will accept, or if we are willing to live with the overhead of maintaining a custom Gradle distribution.

Another distinction that we might have to make is: are we ok to go without randomizing test class order temporarily, but I think in order to have an informed decision we would need to measure the advantages in terms of opportunity of build time reduction this will give us, and cluster formation improvements (#30904) also come into play there.

I think the first thing is to get to have a PoC run of the tests with Gradle so that we can at least have an initial measurement and comparison.

alpar-t · 2018-06-25T06:25:28Z

I isolated the hanging tests task in this repo: https://github.com/atorok/gradle_test_randomizedtesting_deadlock

alpar-t · 2018-06-25T06:52:48Z

The problem is in BootstrapForTesting if removed the tests run.

alpar-t · 2018-06-25T10:38:00Z

It's the security manage ./gradlew :server:test -Dtests.security.manager=false doesn't block and there's already gradle/gradle#3526

alpar-t · 2018-06-25T10:52:41Z

The Gradle runner seems to be a bit slower, running :server:test in 2m 6s vs 1m 52s (12.5%) both running on 6 forks, average of 3 runs each.

rjernst · 2018-06-25T14:35:34Z

Another distinction that we might have to make is: are we ok to go without randomizing test class order temporarily

No, losing this would be a deal breaker. Reproducibility is paramount above all other concerns. It doesn't matter if we could run all tests in 10 seconds; if we can't reproduce failures, the speed is worthless.

It is unfortunate (but not surprising) that gradle cannot handle running with security manager. This is also a deal breaker for using the gradle runner.

Another option I considered which would gain us speed, would be to have separate randomized runner tasks, one for each jvm we plan to spawn. We could then do the assignment of classes to jvms late in configuration (or just before execution in a common doFirst or dependsOn). This would allow us all the utility of randomized runner, but utilizing separate tasks would allow parallelization with gradle.

alpar-t · 2018-07-13T05:28:29Z

I created an issue some time ago to be able to randomize test class order with Gradle and just relized it's not mentioned here: gradle/gradle#5760

alpar-t · 2018-09-05T10:31:23Z

It seems that Gradle won't be adding direct support for this as the direction is to move to junit 5 platform which deals with test execution order. On the bright side, our tests do seem to work with junit-vintage-engine without any code changes. We could implement our own vintage engine that also happens to randomize the order. Looking at the size of the engine I don't think maintenance would be too bad.

mark-vieira · 2019-02-14T07:33:34Z

Sounds like controlling ordering is now supported in JUnit 4.13. This would still require a contribution to Gradle core to allow deferring ordering to the JUnit runner but that might be a more manageable way forward than migrating to JUnit 5, even with the legacy support. Probably worth spiking both options to get a better idea of what might be involved.

mark-vieira · 2019-03-14T20:40:03Z

@atorok I've removed the stalled label as I intend to progress this next week using your branch as a potential starting point.

alpar-t · 2019-03-15T07:01:23Z

For context we discussed this in the team and came to the conclusion that randomizing the class order might be not as important as we initially taught, so we will try to propose an implementation that doesn't take this into account.
- the test ordering right now is not fully deterministic based on the seed, there's balancing across JVMs based on historical timing data. This means that the order of classes might not be the same between runs even if the same seed is passed in. We would rather not have randomization and have reproducible order instead.
- ephemeral workers in CI miss out on the historical timing data entirely
- the performance benefit of the balancing is not significant enough to continue investing in this feature.
- the class of problems the test class order randomization was meant to solve ( static fields and initialization ) was largely eliminated and we are not overly concerned about it keeping back.

alpar-t added the :Delivery/Build Build or test infrastructure label Jun 21, 2018

alpar-t assigned rjernst and alpar-t Jun 21, 2018

alpar-t mentioned this issue Jun 21, 2018

Understand Gradle 4.8 test failure around setup of security policies #31324

Closed

alpar-t added the stalled label Jun 29, 2018

alpar-t mentioned this issue Sep 3, 2018

[Build] Gradle 5.0 deprecation: should not remove tasks #33343

Closed

alpar-t mentioned this issue Sep 5, 2018

Add support for JUnit5 randomizedtesting/randomizedtesting#256

Open

alpar-t added a commit to alpar-t/elasticsearch that referenced this issue Sep 5, 2018

Remove randomizedtesting plugin elastic#31496

7f77dea

alpar-t added a commit to alpar-t/elasticsearch that referenced this issue Sep 5, 2018

Alter test security policy to work with Gradle elastic#31496

792457a

alpar-t mentioned this issue Oct 15, 2018

Replace groovy build-tools (buildSrc) with java #34459

Open

31 tasks

alpar-t mentioned this issue Dec 6, 2018

Converting randomized testing to create a separate unitTest task instead of replacing the builtin test task #36311

Merged

mark-vieira self-assigned this Mar 14, 2019

mark-vieira removed the stalled label Mar 14, 2019

mark-vieira mentioned this issue Mar 27, 2019

Replace usages RandomizedTestingTask with built-in Gradle Test #40564

Merged

alpar-t closed this as completed Apr 12, 2019

dweiss mentioned this issue Jan 4, 2020

LUCENE-9077 Print repro line for failed tests apache/lucene-solr#1138

Closed

7 tasks

mark-vieira added the Team:Delivery Meta label for Delivery team label Nov 11, 2020

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Run tests with Gradle test runner instead of randomizedtesting.junit4-ant #31496

Run tests with Gradle test runner instead of randomizedtesting.junit4-ant #31496

alpar-t commented Jun 21, 2018

elasticmachine commented Jun 21, 2018

alpar-t commented Jun 21, 2018

rjernst commented Jun 21, 2018

alpar-t commented Jun 22, 2018

alpar-t commented Jun 25, 2018

alpar-t commented Jun 25, 2018

alpar-t commented Jun 25, 2018

alpar-t commented Jun 25, 2018 •

edited

Loading

rjernst commented Jun 25, 2018

alpar-t commented Jul 13, 2018

alpar-t commented Sep 5, 2018

mark-vieira commented Feb 14, 2019

mark-vieira commented Mar 14, 2019

alpar-t commented Mar 15, 2019

Run tests with Gradle test runner instead of randomizedtesting.junit4-ant #31496

Run tests with Gradle test runner instead of randomizedtesting.junit4-ant #31496

Comments

alpar-t commented Jun 21, 2018

elasticmachine commented Jun 21, 2018

alpar-t commented Jun 21, 2018

rjernst commented Jun 21, 2018

alpar-t commented Jun 22, 2018

alpar-t commented Jun 25, 2018

alpar-t commented Jun 25, 2018

alpar-t commented Jun 25, 2018

alpar-t commented Jun 25, 2018 • edited Loading

rjernst commented Jun 25, 2018

alpar-t commented Jul 13, 2018

alpar-t commented Sep 5, 2018

mark-vieira commented Feb 14, 2019

mark-vieira commented Mar 14, 2019

alpar-t commented Mar 15, 2019

alpar-t commented Jun 25, 2018 •

edited

Loading