Fix/4732 #4759

jcnelson · 2024-05-08T01:21:42Z

This fixes #4732 by tracking pending connections in addition to completed connections, so that the state machine for StackerDB doesn't re-attempt a handshake if it already has one in-flight. It also disables disconnection from dead/broken StackerDB peers, since a StackerDB peer's inability to reply to a StackerDB request (itself a best-effort replication system) should not be grounds for disconnecting from and banning the node in its entirety.

…em; just don't talk to them). instead, track connections and connection attempts

…o do so

obycode

Makes sense to me. Does this seem to make a difference in practice?

…r its neighbor key

…r, not that the stackerdb sync state machine is

…s, and don't use the PoX bitvec length to determine when to retry an inv sync

…depends on bootstrap nodes staying connected)

… latter happens fast enough on its own (it only needs to be done once per reward cycle)

…we waste a *ton* of CPU)

jcnelson · 2024-05-09T16:56:45Z

Okay, I have this running on my node, and have tested it in both IBD and steady-state modes of operation. No duplicate connections have been established.

jcnelson · 2024-05-10T21:27:02Z

ping @kantai

jcnelson · 2024-05-10T21:27:15Z

(note that this is blocking the next point release)

kantai

These changes look fine to me, but reading through this PR, it's not really clear to me exactly what behavior is being changed, and why, and how it is tested. It seems like there isn't sufficient testing here to prevent a regression: the only testing change is an assertion about the number of connections, which I can imagine being related to some of these changes, but not all of them.

jcnelson · 2024-05-13T20:46:26Z

@kantai I added some regression tests for the new functionality

obycode · 2024-05-14T00:52:16Z

There are some build issues with the latest version.

blockstack-devops · 2024-10-31T00:22:22Z

This pull request has been automatically locked since there has not been any recent activity after it was closed. Please open a new issue for related bugs.

jcnelson requested review from kantai and obycode May 8, 2024 01:21

jcnelson added 4 commits May 7, 2024 21:27

chore: add test to see if a connection to a neighbor is pending

51ef7d4

chore: don't track dead or broken neighbors (don't disconnect from th…

9da1120

…em; just don't talk to them). instead, track connections and connection attempts

fix: don't attempt to connect to a neighbor if we're already trying t…

d5e0a7f

…o do so

chore: verify that we don't try to connect more often than we connect

e81d6e9

obycode previously approved these changes May 8, 2024

View reviewed changes

jcnelson dismissed obycode’s stale review via 464a335 May 8, 2024 18:31

jcnelson added 2 commits May 8, 2024 14:33

fix: determine that a neighbor is connecting by either its event ID o…

464a335

…r its neighbor key

fix: pub, not pub(crate), to remove warning about visibility

e036ef0

obycode previously approved these changes May 9, 2024

View reviewed changes

jcnelson dismissed obycode’s stale review via 032c486 May 9, 2024 02:19

jcnelson added 7 commits May 8, 2024 22:28

chore: check to see if the *network* is connecting to a given neighbo…

032c486

…r, not that the stackerdb sync state machine is

fix: when in IBD mode, verify that we're connecting to bootstrap peer…

4cee411

…s, and don't use the PoX bitvec length to determine when to retry an inv sync

fix: when in IBD, be aggressive about neighbor walks (since inv sync …

eb78eb0

…depends on bootstrap nodes staying connected)

fix: when in IBD, don't bother with antientropy or inv re-sync -- the…

bba307b

… latter happens fast enough on its own (it only needs to be done once per reward cycle)

fix: typo

2ae1c2d

fix: if we're not mining, then set the poll time to be 5s (otherwise …

1f989fb

…we waste a *ton* of CPU)

Merge branch 'develop' into fix/4732

1725915

obycode previously approved these changes May 9, 2024

View reviewed changes

Merge branch 'develop' into fix/4732

0f6b829

kantai reviewed May 11, 2024

View reviewed changes

jcnelson dismissed obycode’s stale review via 88517e4 May 13, 2024 20:46

jcnelson added 2 commits May 13, 2024 16:56

chore: address PR feedback by adding more regression tests

88517e4

Merge branch 'develop' into fix/4732

a71abf7

jcnelson added 3 commits May 14, 2024 14:19

chore: fix compile issues in stacks-node

e6021f5

Merge branch 'develop' into fix/4732

b65fa72

Merge branch 'develop' into fix/4732

253304a

jcnelson requested review from obycode and kantai May 15, 2024 02:40

kantai approved these changes May 15, 2024

View reviewed changes

jcnelson enabled auto-merge May 15, 2024 14:12

obycode approved these changes May 15, 2024

View reviewed changes

jcnelson added this pull request to the merge queue May 15, 2024

Merged via the queue into develop with commit 368541f May 15, 2024
1 check passed

blockstack-devops added the locked label Oct 31, 2024

stacks-network locked as resolved and limited conversation to collaborators Oct 31, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix/4732 #4759

Fix/4732 #4759

jcnelson commented May 8, 2024

obycode left a comment

jcnelson commented May 9, 2024

jcnelson commented May 10, 2024

jcnelson commented May 10, 2024

kantai left a comment

jcnelson commented May 13, 2024

obycode commented May 14, 2024

blockstack-devops commented Oct 31, 2024

Fix/4732 #4759

Fix/4732 #4759

Conversation

jcnelson commented May 8, 2024

obycode left a comment

Choose a reason for hiding this comment

jcnelson commented May 9, 2024

jcnelson commented May 10, 2024

jcnelson commented May 10, 2024

kantai left a comment

Choose a reason for hiding this comment

jcnelson commented May 13, 2024

obycode commented May 14, 2024

blockstack-devops commented Oct 31, 2024