-
Notifications
You must be signed in to change notification settings - Fork 17.7k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
net: TestDialCancel is not compatible with new macOS ARM64 builders #52579
Comments
Hmm, I wonder if @ianlancetaylor, @neild, @bradfitz: do you think it would make sense to install |
Looks to me like |
Change https://go.dev/cl/402454 mentions this issue: |
@gopherbot, please backport to Go 1.17 and 1.18. This test is currently failing on the release branches too. |
Backport issue(s) opened: #52705 (for 1.17), #52706 (for 1.18). Remember to create the cherry-pick CL(s) as soon as the patch is submitted to master, according to https://go.dev/wiki/MinorReleases. |
My understanding of the state of this issue is that it can in Backlog milestone without a release-blocker label. That is, the known-failing test is skipped (via CL 402181) so it's not causing noise on the dashboard (#11811), and to me this test seem safe not to run on darwin/arm64 given it's run elsewhere (i.e., the chance of regression that would affect only this one particular port is low). @bcmills Should we remove release-blocker label, or can you clarify why it should be kept? Thanks. |
I agree that, given that the failure mode almost certainly indicates a mismatch in the test's assumptions rather than a bug in the However, the current skip is based on the builder name, so this test may still fail if someone who uses a similar network configuration imports the I suggest:
|
I don't see any immediate reason to think this is a platform-specific problem. It might be, but it seems much more likely that it's just a quirk of the network we're on. (I still hope to move them to a different network but it is taking a while.) I don't think skipping the tests is costing us much coverage, per Dmitri's comment, and I don't think there's anything release-blocking left now that we've backported to the skip to the release branches. I would like the net maintainers to make any further decisions on what to do here. (Perhaps there's a better way to write the test?) |
Is there a builder I can run the test on that will reliably fail? |
Both darwin-arm64-11 and -12 are exclusively machines in this
configuration.
|
I disagree: I think the remaining release-blocking action is for the That doesn't necessarily need to be a time-intensive process (a CL to skip a test only takes a couple of minutes to mail the change and set Auto-Submit+1), but it does need to happen before the release in order to tie up the loose ends of supporting |
I've been getting these failures since yesterday too, on the plan9-arm builders. It's not dependent on the platform, but on network connectivity. My internet provider's gateway machine is sending Network Unreachable ICMP messages for 198.18.0.254 - but TestDialCancel depends on connection timing out. |
Looking at RFC 6890, I don't think 198.18.0.0/15 should be globally routed - it seems to be a reserved address block for benchmarking within an organisation. Shouldn't a gateway router be dropping packets to this range, not sending them to the internet? My guess is that my internet provider has just been reconfigured to send Network Unreachable messages for this block instead of dropping them silently. I've set up a static route to black-hole this address range (by forwarding to a non-existent host). It looks like that makes the net test work again. |
Change https://go.dev/cl/496037 mentions this issue: |
TestDialCancel requires that traffic to 198.18.0.254 and 2001:2::254 be blackholed. The comment on slowDialTCP notes that "In some environments, the slow IPs may be explicitly unreachable". The network we're using for the new macOS ARM64 builders is one such environment. I skipped the test in https://go.dev/cl/402181, but it might be nice for someone more knowledgeable about this than me to look and decide what should be done.
The text was updated successfully, but these errors were encountered: