Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Avoid potential deadlock in getaddrinfo #848

Open
wants to merge 6 commits into
base: master
Choose a base branch
from
Open

Conversation

vickenty
Copy link
Contributor

What does this PR do?

Avoid deadlock in the child process in getaddrinfo if the application forks while a thread is trying to create a socket.

Description of the Change

Acquire socket lock before forking to prevent the client library from calling getaddrinfo at the same time.

Alternate Designs

Possible Drawbacks

Verification Process

Additional Notes

Release Notes

Review checklist (to be filled by reviewers)

  • Feature or bug fix MUST have appropriate tests (unit, integration, etc...)
  • PR title must be written as a CHANGELOG entry (see why)
  • Files changes must correspond to the primary purpose of the PR as described in the title (small unrelated changes should have their own PR)
  • PR must have one changelog/ label attached. If applicable it should have the backward-incompatible label attached.
  • PR should not have do-not-merge/ label attached.
  • If Applicable, issue must have kind/ and severity/ labels attached at least.

Test failed to create a sender thread due to a name clash between
sender the thread and the sender callback parameter. Sender thread
produces lots of expected errors (of "Connection refused" kind), so we
also silence errors from the client library.
time.sleep(0) allows python to switch to another thread, allowing main
thread to progress through stages quicker, while still being able to
catch deadlocks from getaddrinfo or improper locking in fork hooks.
Try sending metric from the child process. This makes sure that all
locks are still in a serviceable state after the fork.
getaddrinfo may use an internal lock that, in case of a concurrent
fork, may be left in locked state and cause child process to deadlock.
@vickenty vickenty added the changelog/Fixed Fixed features results into a bug fix version bump label Aug 19, 2024
pre_fork is now mutually exclusive with get_socket() until post_fork
is called, so the sequence of events that the test was testing is no
longer possible.
@vickenty vickenty marked this pull request as ready for review August 19, 2024 13:52
@vickenty vickenty requested review from a team as code owners August 19, 2024 13:52
carlosroman
carlosroman previously approved these changes Aug 23, 2024
Copy link

This issue has been automatically marked as stale because it has not had activity in the last 30 days.
Note that the issue will not be automatically closed, but this notification will remind us to investigate why there's been inactivity.

@github-actions github-actions bot added the stale Stale - Bot reminder label Sep 23, 2024
@vickenty vickenty removed the stale Stale - Bot reminder label Sep 25, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
changelog/Fixed Fixed features results into a bug fix version bump
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants