Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Make Repository.getRepositoryData an Async API #49299

Merged

Conversation

original-brownbear
Copy link
Member

@original-brownbear original-brownbear commented Nov 19, 2019

This API call in most implementations is fairly IO heavy and slow
so it is more natural to be async in the first place.
Concretely though, this change is a prerequisite of #49060 since
determining the repository generation from the cluster state
introduces situations where this call would have to wait for other
operations to finish. Doing so in a blocking manner would break
SnapshotResiliencyTests and waste a thread.
Also, this sets up the possibility to in the future make use of async IO
where provided by the underlying Repository implementation.

In a follow-up SnapshotsService#getRepositoryData will be made async
as well (did not do it here, since it's another huge change to do so).
Note: This change for now does not alter the threading behaviour in any way (since Repository#getRepositoryData isn't forking) and is purely mechanical.

Note to reviewers: I did my best to not add new line-breaks and such so this should be easy to review using ?w=1 (only +125 - 80 lines in that case) hopefully :)

This API call in most implementations is fairly IO heavy and slow
so it is more natural to be async in the first place.
Concretely though, this change is a prerequisite of elastic#49060 since
determining the repository generation from the cluster state
introduces situations where this call would have to wait for other
operations to finish. Doing so in a blocking manner would break
`SnapshotResiliencyTests` and waste a thread.
Also, this sets up the possibility to in the future make use of async IO
where provided by the underlying Repository implementation.

In a follow-up `SnapshotsService#getRepositoryData` will be made async
as well (did not do it here, since it's another huge change to do so).
Note: This change for now does not alter the threading behavior in any way
with the exception of blocking a GENERIC thread in `SnapshotsService#getRepositoryData`
(this should be fine since this API is only used by status APIs)
and is purely mechanical otherwise.
@original-brownbear original-brownbear added >non-issue :Distributed/Snapshot/Restore Anything directly related to the `_snapshot/*` APIs v8.0.0 v7.6.0 labels Nov 19, 2019
@elasticmachine
Copy link
Collaborator

Pinging @elastic/es-distributed (:Distributed/Snapshot/Restore)

Copy link
Contributor

@ywelsch ywelsch left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

LGTM

@original-brownbear
Copy link
Member Author

Thanks Yannick!

@original-brownbear original-brownbear merged commit 4d659c4 into elastic:master Nov 19, 2019
@original-brownbear original-brownbear deleted the async-get-repositorydata branch November 19, 2019 13:39
original-brownbear added a commit to original-brownbear/elasticsearch that referenced this pull request Nov 19, 2019
This API call in most implementations is fairly IO heavy and slow
so it is more natural to be async in the first place.
Concretely though, this change is a prerequisite of elastic#49060 since
determining the repository generation from the cluster state
introduces situations where this call would have to wait for other
operations to finish. Doing so in a blocking manner would break
`SnapshotResiliencyTests` and waste a thread.
Also, this sets up the possibility to in the future make use of async IO
where provided by the underlying Repository implementation.

In a follow-up `SnapshotsService#getRepositoryData` will be made async
as well (did not do it here, since it's another huge change to do so).
Note: This change for now does not alter the threading behaviour in any way (since `Repository#getRepositoryData` isn't forking) and is purely mechanical.
original-brownbear added a commit that referenced this pull request Nov 19, 2019
This API call in most implementations is fairly IO heavy and slow
so it is more natural to be async in the first place.
Concretely though, this change is a prerequisite of #49060 since
determining the repository generation from the cluster state
introduces situations where this call would have to wait for other
operations to finish. Doing so in a blocking manner would break
`SnapshotResiliencyTests` and waste a thread.
Also, this sets up the possibility to in the future make use of async IO
where provided by the underlying Repository implementation.

In a follow-up `SnapshotsService#getRepositoryData` will be made async
as well (did not do it here, since it's another huge change to do so).
Note: This change for now does not alter the threading behaviour in any way (since `Repository#getRepositoryData` isn't forking) and is purely mechanical.
original-brownbear added a commit to original-brownbear/elasticsearch that referenced this pull request Nov 19, 2019
Follow up to elastic#49299 removing the blocking step for the
snapshot status APIs as well.
original-brownbear added a commit that referenced this pull request Nov 20, 2019
* Make SnapshotsService#getRepositoryData Async

Follow up to #49299 removing the blocking step for the
snapshot status APIs as well.
original-brownbear added a commit to original-brownbear/elasticsearch that referenced this pull request Nov 20, 2019
* Make SnapshotsService#getRepositoryData Async

Follow up to elastic#49299 removing the blocking step for the
snapshot status APIs as well.
original-brownbear added a commit that referenced this pull request Nov 20, 2019
* Make SnapshotsService#getRepositoryData Async (#49322)

Follow up to #49299 removing the blocking step for the
snapshot status APIs as well.
@original-brownbear original-brownbear restored the async-get-repositorydata branch January 6, 2021 14:08
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
:Distributed/Snapshot/Restore Anything directly related to the `_snapshot/*` APIs >non-issue v7.6.0 v8.0.0-alpha1
Projects
None yet
Development

Successfully merging this pull request may close these issues.

4 participants