Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Parallel processing for downloading and unziping #146

Open
koleckt opened this issue Dec 15, 2020 · 3 comments
Open

Parallel processing for downloading and unziping #146

koleckt opened this issue Dec 15, 2020 · 3 comments
Labels
enhancement New feature or request

Comments

@koleckt
Copy link

koleckt commented Dec 15, 2020

We can save a lot of processing time, specially for large dataset of product (time serie) if the downloading and the unziping (exemple of S1 products) can be done in 2 differents threads.

Actually, when downloading, the CPU is not used at full capacity, and when unzip, the network is not use.

Doing both in parallel can speedup a lot the processing codes.

@koleckt koleckt changed the title Parallel processing for downloding and unziping Parallel processing for downloading and unziping Dec 15, 2020
@sbrunato
Copy link
Collaborator

Thanks for this suggestion, we will work on a new feature that provide a way to parallelize products downloading, and keep you informed of the progress in this issue

@sbrunato
Copy link
Collaborator

#120

@sbrunato sbrunato added the enhancement New feature or request label Jan 29, 2021
@sbrunato
Copy link
Collaborator

See if there is a way to provide to the downloading method a parameter referencing to a generic pool / worker scheduler, compatible with Dask or other solutions

Ask as parameter an executor like Dask.Client, or concurrent.futures that use the same interfaces. See https://distributed.dask.org/en/latest/client.html

Some serializing might be needed (already implemented in eodag):

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
enhancement New feature or request
Projects
None yet
Development

No branches or pull requests

2 participants