HPC executor #467

tomwhite · 2024-05-21T15:05:10Z

Some possibilities:

Job arrays: https://slurm.schedmd.com/job_array.html (pointed out by @jeromekelleher)
funcX: https://funcx.org/
Parsl: https://parsl-project.org/

cc @TomNicholas

TomNicholas · 2024-05-21T15:12:17Z

The main NCAR machine I have access to uses PBS - perhaps there is an equivalent of slurm job arrays for PBS?

TomNicholas · 2024-06-10T13:23:56Z

I've been looking into this more, and all of these look potentially interesting to us!

perhaps there is an equivalent of slurm job arrays for PBS?

There is, and it looks very similar. Perhaps a PBSExecutor and SlurmExecutor could both inherit from an abstract JobArrayExecutor...

I actually started sketching this out, borrowing heavily from dask-jobqueue, which defines common base classes for different HPC cluster classes.

But got stuck when I realised that:

I need a way to propagate both the environment and the context of an arbitrary function (e.g. in apply_gufunc) to the bash scripts that are each job,
It might be really slow to have to wait for the queuing system to start running the tasks between every stage. Ideally once we have been allocated resources we don't really want to release them - with cloud serverless this doesn't matter so much as the time between requesting a worker and getting one is extremely short, but on a HPC queue it could literally be hours. A lot of real-world workloads are going to require the most resources at the start - could we imagine that e.g. a reduction with 3 rounds gets 100 workers, then keeps hold of 10 for the next round, then keeps hold of 1 for the final round?

funcX: https://funcx.org/

funcX has recently become Globus Compute, a new offering from the same people who run the widely-used Globus file transfer service. The Globus Compute Executor class is a subclass of Python’s concurrent.futures.Executor, so presumably could be used similarly to the existing ProcessesExecutor?.

Globus compute requires an endpoint to be set up (i.e. on the HPC system of interest), and it's pretty new so I don't know if it will be setup anywhere yet. Then it requires a Globus account (is that free?). There is a very small public tutorial endpoint we could try using for testing though.

Parsl

This seems quite similar to Globus Compute, except that it uses a decorator-centric python API, a bit like Modal stubs. It also requires setup, but it's already available on two large machines we use at [C]Worthy (Expanse and Perlmutter), which I might be able to get access to (full list of machines here). Perlmutter has a nice docs page about how to use Parsl on their system.

It solves the function context issue by restricting to only knowing about local variables within the function - see docs.

This feature also sounds really useful:

[Parsl] avoids long job scheduler queue delays by acquiring one set of resources for the entire program and it allows for scheduling of many tasks on individual nodes.

I would like to try one or two of these out. If we can get Cubed to run reasonably well on HPC that would be a big deal, and worth advertising on its own.

mgrover1 · 2024-07-12T22:27:25Z

After some discussion at SciPy, globus-compute seems like a solid option for executing functions on HPC.

Here is a link to an cookbook using it, remotely submitting jobs to an HPC cluster at a DOE lab

https://projectpythia.org/esgf-cookbook/notebooks/enso-globus.html

It might be helpful to use this in addition to parsl, to allow more function-based resource specification, as they show on the docs here

Happy to discuss more with you all - I hope this provides a good starting point!

TomNicholas · 2024-07-13T04:59:31Z

@jakirkham what was the library you were talking about earlier with the concurrent futures-like API but for HPC?

jakirkham · 2024-07-13T12:11:57Z

Was thinking of @applio's team's work on Dragon, which provides a multiprocessing style API

AIUI this could be used with concurrent.futures or Parsl (mentioned above). Though Davin would be able to say more about how to use it

https://github.com/DragonHPC/dragon

negin513 · 2024-07-13T12:21:26Z

Essentially there are two pieces needed:

an endpoint or ssh config setup to access your hpc
orchestration and interactions with schedulers. I think Dragon can do this part too. Does Dragon require docker? If so that needs to be changed to singularity/ podman for hpc,

TomNicholas · 2024-07-13T20:53:56Z

I chatted with @applio earlier and he also mentioned some kind of distributed dict, which we could potentially use as the storage layer via Zarr. Was that a part of dragon too @applio?

jakirkham · 2024-07-16T01:43:36Z

Ah forgot to mention that MPI4Py also contains concurrent.futures.Executors

tomwhite · 2024-07-18T10:34:34Z

I added some notes on how to write a new executor in #498.

tomwhite added the runtime label May 21, 2024

TomNicholas mentioned this issue Jul 15, 2024

In-memory rechunk #502

Open

This was referenced Aug 2, 2024

Various issues following installation instructions DragonHPC/dragon#19

Open

Running dragon as an executor from within a larger python program DragonHPC/dragon#20

Open

TomNicholas mentioned this issue Sep 6, 2024

Cubed vs dask.array: Convergent evolution? #570

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

HPC executor #467

HPC executor #467

tomwhite commented May 21, 2024

TomNicholas commented May 21, 2024

TomNicholas commented Jun 10, 2024

mgrover1 commented Jul 12, 2024

TomNicholas commented Jul 13, 2024

jakirkham commented Jul 13, 2024

negin513 commented Jul 13, 2024

TomNicholas commented Jul 13, 2024

jakirkham commented Jul 16, 2024

tomwhite commented Jul 18, 2024

HPC executor #467

HPC executor #467

Comments

tomwhite commented May 21, 2024

TomNicholas commented May 21, 2024

TomNicholas commented Jun 10, 2024

mgrover1 commented Jul 12, 2024

TomNicholas commented Jul 13, 2024

jakirkham commented Jul 13, 2024

negin513 commented Jul 13, 2024

TomNicholas commented Jul 13, 2024

jakirkham commented Jul 16, 2024

tomwhite commented Jul 18, 2024