Reenable multithreading tests #850

vyasr · 2022-11-03T15:53:56Z

Description

Tests of multithreaded support for synced collections were added as part of the development effort. The tests were generally reliable, but we saw sporadic failures on Windows for unknown reasons, so they were marked as conditionally expected to fail on Windows (but done so incorrectly, see below). The tests were changed to be unconditionally xfailed due to development difficulties in #777. This PR adds them back in with a more target scope for skipping. This change also allows us to enable xfail_strict in our testing.

Motivation and Context

I finally have access to a Windows machine myself, so I looked into these again this morning. I cannot reproduce the failures locally on WSL no matter what I try (with/without pytest-xdist, many repetitions with pytest-repeat, etc), but I can reproduce the failures in normal Windows. The error is consistently a PermissionError encountered in the os.replace call. There is a Python bug clearly indicating that Python intentionally avoids documenting such incompatibilities, but that this is a known problem with os.replace in multithreaded settings on Windows due to the underlying filesystem instruction. As such, we should not expect this to work on Windows, and the original exclusion Windows-based skip prior to #777 is correct, not the current blanket xfail.

Our previous thread justifying the change in testing behavior identified that there was a bug in our old filtering. We were using "win" in platform.python_implementation() as a way to identify whether we were on Windows, and using that to determine whether or not to run the tests. platform.python_implementation() indicates what Python implementation we are using (CPython, Jython, PyPy, etc), not the operating system, so this filter was clearly incorrect. However, the other claim in that thread is also obviously false:

Apparently this was getting WINDOWS = True because of that bug, and hiding failures on other platforms too:

At the time we ran tests on Pypy and CPython, and both "win" in "CPython" and "win" in PyPy are always False, not always True. The effect of this error therefore could not have been to hide failures on non-Windows platforms, all it could have done is run tests on Windows that we expected to fail. Therefore, my suspicion is that what was observed in that PR were only Windows-specific errors. There were additional reports of failures in #776, but they appear to have been scattered among various other failures that occurred during the CI transition, so it is unclear whether we ever had reliable examples of failures on other operating systems.

This PR reenables these tests on the platforms where we should expect them to succeed. Given the upcoming release of signac 2.0, if we start seeing failures in these tests again we should reinvestigate our claims of multithreading support. The tests don't affect any behavior that signac relies on, only otherwise unused features of synced collections, so this is mostly relevant for documenting synced collections when we split those into a separate package post 2.0.

Checklist:

I am familiar with the Contributing Guidelines.
I agree with the terms of the Contributor Agreement.
My name is on the list of contributors.
The changes introduced by this pull request are covered by existing or newly introduced tests.
The package documentation and framework documentation in signac-docs are up to date with these changes.
I have updated the changelog and added any related issue and pull request numbers for future reference.

vyasr · 2022-11-03T15:55:07Z

I'm happy to rerun tests on this PR a few times to see if we can reproduce the failures again before we merge.

codecov · 2022-11-03T16:06:24Z

Codecov Report

Merging #850 (370be08) into master (c6c0b48) will increase coverage by 0.35%.
The diff coverage is 92.85%.

@@            Coverage Diff             @@
##           master     #850      +/-   ##
==========================================
+ Coverage   86.27%   86.63%   +0.35%     
==========================================
  Files          49       49              
  Lines        4577     4585       +8     
  Branches      905      908       +3     
==========================================
+ Hits         3949     3972      +23     
+ Misses        451      431      -20     
- Partials      177      182       +5

Impacted Files	Coverage Δ
signac/contrib/job.py	`89.73% <66.66%> (-0.07%)`	⬇️
...ac/_synced_collections/backends/collection_json.py	`98.46% <100.00%> (-1.54%)`	⬇️
...ed_collections/buffers/file_buffered_collection.py	`95.76% <0.00%> (-1.70%)`	⬇️
signac/contrib/linked_view.py	`88.00% <0.00%> (-1.60%)`	⬇️
signac/_synced_collections/numpy_utils.py	`100.00% <0.00%> (+12.00%)`	⬆️
..._synced_collections/backends/collection_mongodb.py	`90.90% <0.00%> (+40.90%)`	⬆️

Help us with your feedback. Take ten seconds to tell us how you rate us. Have a feature suggestion? Share it here.

signac/_synced_collections/backends/collection_json.py

csadorf · 2022-11-04T08:19:21Z

signac/_synced_collections/backends/collection_json.py

    """

    _backend = __name__  # type: ignore
-    _supports_threading = True
+    _supports_threading = _IsWindows()  # type: ignore


Suggested change

_supports_threading = _IsWindows() # type: ignore

_supports_threading = not _IsWindows() # type: ignore

I think I've been spending too much time in compiled languages and so I created _IsWindows just to have something that would be very obviously something evaluated at "runtime" (as opposed to a module-level constant evaluated at "compile-time" even though that obviously doesn't exist in Python). In practice I don't know that this adds any extra clarity and just increases boilerplate. What if I just inlined

_supports_threading = not ( sys.platform.startswith("win32") or sys.platform.startswith("cygin"))

Then your previous two comments also become moot.

I've done this for now, let me know if you decide you don't like it and we can go back.

I think having a dedicated private constant for this makes sense and makes the code more readable and overall more consistent. We could reuse that variable also in the tests, no?

We could. I wouldn't want to leak a constant like that into our public API, but that's the sort of gray area where I would be OK with importing internals into tests I suppose.

tests/test_synced_collections/test_json_buffered_collection.py

tests/test_synced_collections/test_json_collection.py

signac/contrib/job.py

csadorf · 2022-11-04T17:23:50Z

setup.cfg

@@ -34,6 +34,7 @@ omit =
 	*/signac/common/configobj/*.py

 [tool:pytest]
+xfail_strict = True


I consider this a rather drastic change in the behavior of the overall test suite. Should we consider to apply this a bit more granular, i.e., use strict=True where this is relevant?

Actually, wanting to enable this was my original goal and what made me remember that these multithreaded tests had been xfailed. IMO setting this can only be a good thing because it forces us to immediately deal with changes that incidentally and unexpectedly fix a bug or change behaviors in other unexpected ways that we only catch because a test suddenly starts xpassing. The value of strict passed to the decorator supersedes the value in the file, so I would rather have xfail_strict=True configured here and then manually mark with strict=False cases where for whatever reason we actually need to allow xfails, for instance because a test is flaky. I would prefer to force developers to jump through hoops in order to enable flaky tests.

I'm +1 for making the change at a global level, but I would be okay with doing it in a separate PR, or at least adding a separate changelog line to indicate this.

No need for separate PR IMO (albeit cleaner), but +1 for dedicated changelog entry.

signac/_synced_collections/backends/collection_json.py

csadorf · 2022-11-04T17:31:51Z

signac/_synced_collections/backends/collection_json.py

    """

    _backend = __name__  # type: ignore
-    _supports_threading = True
+    _supports_threading = _IsWindows()  # type: ignore


I think having a dedicated private constant for this makes sense and makes the code more readable and overall more consistent. We could reuse that variable also in the tests, no?

signac/contrib/job.py

bdice

Thank you for tracking this down and citing the Python bug. This looks like an appropriate fix to me.

bdice · 2022-11-06T15:33:56Z

@csadorf @vyasr I'll let you two sort out the rest of this PR and merge it. I'm going to focus on a few other items for 2.0.

…ltithreading

csadorf

Good to go! :)

signac/_synced_collections/backends/collection_json.py

tests/test_synced_collections/test_json_buffered_collection.py

…ab/signac into testing/reenable_multithreading

bdice · 2022-11-08T14:41:56Z

@vyasr Excluding both win32 and cygwin was correct. I tested it locally and reverted my previous changes. I will automerge this PR.

vyasr added 3 commits November 3, 2022 08:30

Disable threading support for JSONCollection on Windows.

3924d66

Change all xfailed tests to conditional skips on Windows.

996d9b4

Enable xfail strict.

eefc61c

vyasr self-assigned this Nov 3, 2022

vyasr requested review from a team as code owners November 3, 2022 15:53

vyasr requested review from csadorf and Tobias-Dwyer and removed request for a team November 3, 2022 15:53

vyasr added 6 commits November 3, 2022 18:24

Don't try to modify locks when threading support doesn't exist.

81bc73b

Reset the filename whether or not we need to play with locks.

c66443b

Remove debug prints.

d7d3ac1

Minor cleanup.

ea3bed9

Merge branch 'master' into testing/reenable_multithreading

bae5f9f

Merge branch 'master' into testing/reenable_multithreading

0be3d61

csadorf requested changes Nov 4, 2022

View reviewed changes

vyasr added 3 commits November 4, 2022 08:23

Address PR comments.

8a61983

Fix typo in comment.

0405431

Setting filename also requires the lock.

401b5d4

vyasr requested a review from csadorf November 4, 2022 15:58

csadorf reviewed Nov 4, 2022

View reviewed changes

bdice approved these changes Nov 5, 2022

View reviewed changes

vyasr added 2 commits November 6, 2022 07:41

Merge remote-tracking branch 'origin/master' into testing/reenable_mu…

6f61b76

…ltithreading

Centralize ON_WINDOWS.

c6fd88c

vyasr requested a review from csadorf November 6, 2022 15:44

csadorf approved these changes Nov 7, 2022

View reviewed changes

bdice reviewed Nov 7, 2022

View reviewed changes

signac/_synced_collections/backends/collection_json.py Show resolved Hide resolved

tests/test_synced_collections/test_json_buffered_collection.py Outdated Show resolved Hide resolved

bdice added 5 commits November 7, 2022 07:56

Update changelog.

e3dfa24

No special behavior for cygwin.

17ffa15

Merge branch 'testing/reenable_multithreading' of github.com:glotzerl…

3f2334a

…ab/signac into testing/reenable_multithreading

Merge branch 'master' into testing/reenable_multithreading

d4697b5

Disable multithreading on Cygwin. Tests were failing.

370be08

bdice enabled auto-merge (squash) November 8, 2022 14:42

bdice merged commit e69e629 into master Nov 8, 2022

bdice deleted the testing/reenable_multithreading branch November 8, 2022 14:50

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Reenable multithreading tests #850

Reenable multithreading tests #850

vyasr commented Nov 3, 2022

vyasr commented Nov 3, 2022

codecov bot commented Nov 3, 2022 •

edited

Loading

csadorf Nov 4, 2022

vyasr Nov 4, 2022 •

edited

Loading

vyasr Nov 4, 2022

csadorf Nov 4, 2022

vyasr Nov 6, 2022

vyasr Nov 6, 2022

csadorf Nov 4, 2022

vyasr Nov 5, 2022

bdice Nov 5, 2022

csadorf Nov 7, 2022

csadorf Nov 4, 2022

bdice left a comment

bdice commented Nov 6, 2022

csadorf left a comment

bdice commented Nov 8, 2022

	_supports_threading = _IsWindows() # type: ignore
	_supports_threading = not _IsWindows() # type: ignore

Reenable multithreading tests #850

Reenable multithreading tests #850

Conversation

vyasr commented Nov 3, 2022

Description

Motivation and Context

Checklist:

vyasr commented Nov 3, 2022

codecov bot commented Nov 3, 2022 • edited Loading

Codecov Report

Choose a reason for hiding this comment

vyasr Nov 4, 2022 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

bdice left a comment

Choose a reason for hiding this comment

bdice commented Nov 6, 2022

csadorf left a comment

Choose a reason for hiding this comment

bdice commented Nov 8, 2022

codecov bot commented Nov 3, 2022 •

edited

Loading

vyasr Nov 4, 2022 •

edited

Loading