-
Notifications
You must be signed in to change notification settings - Fork 3.1k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
WebDriver community stability jobs are frequently timing out #31499
Comments
I have PR #31499 open to run some jobs to investigate the issues. |
Fixing the slow running tests here is a good idea for general perf/resource usage reasons. But the stability jobs are running each test 10x in total, so if we can't run all the tests in under 12 minutes we aren't going to see the jobs complete. In practice the fact that changes to
In paratice I think we need to do both to fix this issue. |
This work fine now with a recent Firefox Nightly build. The calls take under 1s now:
@jgraham, I would propose that we run the stability jobs in |
I think for all CI jobs we're currently not using headless. I'm happy to use headless for Firefox stability jobs in particular, if you think that improves performance (for non-stability jobs correctness is more important, for stability jobs the tradeoff is more delicate). Currently we hardcode |
This is a problem that we frequently hit when changes are done that affect a large portion of WebDriver tests, eg. when changing fixtures or helpers. The stability jobs as triggered by the sink job will then most likely fail because of the 120 minutes timeout.
It would be great to get this problem investigated because it's strange that it takes that long for the jobs to run. When we run wdspec jobs in our own CI for Firefox each of the 3 chunks take approximately 20 minutes which means that we end-up with 60 minutes. In headless mode we even have only 2 chunks that take around 15 minutes.
Seeing a log of such a stability jobs for Firefox I noticed the following:
We do not run in headless mode. Maybe it might be good to change that to reduce the duration of the full job? If it cannot be done for Chrome maybe we could change it for Firefox?
For Firefox a lot of timeouts are visible for tests related to
test_no_top_browsing_context
. These are failures that we do not see in our own CI and are causing delays of 30s or 3 minutes for each instance of this test for different WebDriver commands - based on if thetimeout=long
meta tag is present or not. With the amount of these tests all sum up to around 22 minutes of extra time.For now I would like to get started with the above two issues. CC'ing @jgraham, @juliandescottes, @foolip for their ideas and feedback.
The text was updated successfully, but these errors were encountered: