Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

allow oversubscription in OpenMPI sanity check #2611

Merged
merged 1 commit into from
Dec 9, 2021

Conversation

ocaisa
Copy link
Member

@ocaisa ocaisa commented Oct 28, 2021

Hyperthreading can sometimes cause an unnecessary failure.

Alternative to #2570, closes easybuilders/easybuild-easyconfigs#13945

@ocaisa
Copy link
Member Author

ocaisa commented Oct 28, 2021

Test report by @ocaisa

Overview of tested easyconfigs (in order)

  • SUCCESS OpenMPI-4.1.1-GCC-11.2.0.eb

Build succeeded for 1 out of 1 (1 easyconfigs in total)
login1 - Linux rocky linux 8.4, x86_64, Intel(R) Xeon(R) CPU E5-2690 v3 @ 2.60GHz, Python 3.6.8
See https://gist.github.com/08193ef862eeeffed805539ec08beb58 for a full test report.

@ocaisa
Copy link
Member Author

ocaisa commented Oct 28, 2021

You can see the results in the trace output (or the logs):

  >> running command 'OMPI_MCA_rmaps_base_oversubscribe=1 mpirun -n 8 /home/ocaisa/.local/easybuild/build/OpenMPI/4.1.1/GCC-11.2.0/mpi_test_hello_c' ...
  >> result for command 'OMPI_MCA_rmaps_base_oversubscribe=1 mpirun -n 8 /home/ocaisa/.local/easybuild/build/OpenMPI/4.1.1/GCC-11.2.0/mpi_test_hello_c': OK

@smoors
Copy link
Contributor

smoors commented Nov 29, 2021

if hyperthreading is the only issue here, maybe it's enough to set OMPI_MCA_hwloc_base_use_hwthreads_as_cpus=1?
see #2596

@boegel
Copy link
Member

boegel commented Dec 7, 2021

if hyperthreading is the only issue here, maybe it's enough to set OMPI_MCA_hwloc_base_use_hwthreads_as_cpus=1? see #2596

It's more than that, we also want the OpenMPI sanity check to pass when only a single core is available, for example...

@boegel boegel changed the title Allow oversubscription in OpenMPI MPI execution check Allow oversubscription in OpenMPI sanity check Dec 7, 2021
@boegel boegel changed the title Allow oversubscription in OpenMPI sanity check allow oversubscription in OpenMPI sanity check Dec 7, 2021
@boegel
Copy link
Member

boegel commented Dec 9, 2021

Test report by @boegel

Overview of tested easyconfigs (in order)

  • SUCCESS OpenMPI-3.1.1-GCC-7.3.0-2.30.eb
  • SUCCESS OpenMPI-4.0.3-GCC-9.3.0.eb
  • SUCCESS OpenMPI-4.1.2-GCC-11.2.0.eb

Build succeeded for 3 out of 3 (3 easyconfigs in total)
node2624.swalot.os - Linux centos linux 7.9.2009, x86_64, Intel(R) Xeon(R) CPU E5-2660 v3 @ 2.60GHz (haswell), Python 3.6.8
See https://gist.github.com/34f767f48d21a86b83a916d01694c40a for a full test report.

Copy link
Member

@boegel boegel left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

lgtm

@boegel boegel merged commit 05456db into easybuilders:develop Dec 9, 2021
@ocaisa ocaisa deleted the ompi_oversubscribe branch December 9, 2021 08:43
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
Projects
None yet
Development

Successfully merging this pull request may close these issues.

OpenMPI does not build on machine with six cores
3 participants