Skip to content

Commit

Permalink
Merge pull request #37 from yut23/perlmutter-gpu-oom-note
Browse files Browse the repository at this point in the history
Add note about amrex.the_arena_init_size=0 on Perlmutter
  • Loading branch information
zingale authored Sep 13, 2024
2 parents 6ccaa3f + 87326c0 commit cf1dd00
Showing 1 changed file with 7 additions and 0 deletions.
7 changes: 7 additions & 0 deletions sphinx_docs/source/nersc-workflow.rst
Original file line number Diff line number Diff line change
Expand Up @@ -22,6 +22,13 @@ includes the restart logic to allow for job chaining.
.. literalinclude:: ../../job_scripts/perlmutter/perlmutter.submit
:language: sh

.. note::

With large reaction networks, you may get GPU out-of-memory errors during
the first burner call. If this happens, you can add
``amrex.the_arena_init_size=0`` after ``${restartString}`` in the srun call
so AMReX doesn't reserve 3/4 of the GPU memory for the device arena.

Below is an example that runs on CPU-only nodes. Here ``ntasks-per-node``
refers to number of MPI processes (used for distributed parallelism) per node,
and ``cpus-per-task`` refers to number of hyper threads used per task
Expand Down

0 comments on commit cf1dd00

Please sign in to comment.