Support use of multiprocessing start methods other than "spawn" (e.g. "dragon") #554

applio · 2024-08-18T23:33:01Z

Currently, cubed.runtime.executors.local.async_execute_dag() hard codes the use of the "spawn" start method when employing multiprocessing / concurrent.futures processes. This PR proposes a means for the user to specify their preferred start method via the existing keyword argument use_processes.

This proposed change would permit users to select from the existing multiprocessing start methods of "fork", "spawn", and "forkserver" as well as the newer "dragon" HPC distributed execution start method provided by the Dragon project (https://github.com/dragonhpc/dragon). An example snippet showing how a different start method can now be specified:

cubed.to_zarr(
    some_data,
    store=zg,
    use_processes="dragon",
)

It probably makes sense to document this new functionality though it appears that the keyword argument, use_processes, does not yet appear anywhere in the documentation. The "Configuration" page (https://cubed-dev.github.io/cubed/configuration.html#processes) might be a good spot to describe use_processes in general along with this added control. I would be happy to propose some documentation text if others agree on where it ought to go.

…wn" (e.g. "dragon" or "fork").

tomwhite

Thanks for the PR @applio! It's great that this is the only change needed to run on Dragon!

I was wondering if there was some way of adding a unit test (without installing Dragon), but I'm not sure there is. So I'm happy to merge it as it stands.

The "Configuration" page (https://cubed-dev.github.io/cubed/configuration.html#processes) might be a good spot to describe use_processes in general along with this added control.

Yes that would be ideal - please add some docs to that page (either as a part of this or another PR).

TomNicholas · 2024-08-19T16:57:29Z

It's great that this is the only change needed to run on Dragon!

Cool! Thanks @applio.

adding a unit test (without installing Dragon)

I think we should add unit tests which do install Dragon, but we can leave those to follow-up PRs.

TomNicholas · 2024-08-19T17:01:59Z

It's great that it's this easy to get running on Dragon, but I do wonder whether this use_processes/spawn interface is just going to be confusing for users. I would like to be able to recommend different Executors to users based on the system they are trying to run on (i.e. "Cloud? Use lithops! Local machine? Use the local executor! HPC? Use the DragonExecutor!").

Whilst the use_processes is extremely neat for us developers I wonder if that subtlety should actually be hidden from the users behind a DragonExecutor abstraction, even if it uses similar codepaths under the hood.

tomwhite · 2024-08-20T10:19:43Z

I agree with adding a DragonExecutor in a new dragon.py module. It would also be a natural place to add Dragon-specific configuration in the future.

tomwhite · 2024-09-17T08:08:39Z

We discussed this in the meeting and decided to merge it as it may be generally useful for specifying multiprocessing start methods other than "spawn". @applio will still do a separate PR for the DragonExecutor as discussed above.

Permit specification of multiprocessing start methods other than "spa…

865aa6c

…wn" (e.g. "dragon" or "fork").

This was referenced Aug 18, 2024

Example using "dragon" start method #555

Open

Running dragon as an executor from within a larger python program DragonHPC/dragon#20

Open

tomwhite approved these changes Aug 19, 2024

View reviewed changes

TomNicholas mentioned this pull request Aug 19, 2024

Relieve users of cluster management on HPC? #557

Open

tomwhite merged commit 4ff032b into cubed-dev:main Sep 17, 2024
11 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Support use of multiprocessing start methods other than "spawn" (e.g. "dragon") #554

Support use of multiprocessing start methods other than "spawn" (e.g. "dragon") #554

applio commented Aug 18, 2024

tomwhite left a comment

TomNicholas commented Aug 19, 2024

TomNicholas commented Aug 19, 2024

tomwhite commented Aug 20, 2024

tomwhite commented Sep 17, 2024

Support use of multiprocessing start methods other than "spawn" (e.g. "dragon") #554

Support use of multiprocessing start methods other than "spawn" (e.g. "dragon") #554

Conversation

applio commented Aug 18, 2024

tomwhite left a comment

Choose a reason for hiding this comment

TomNicholas commented Aug 19, 2024

TomNicholas commented Aug 19, 2024

tomwhite commented Aug 20, 2024

tomwhite commented Sep 17, 2024