Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

MPI fapl_mpio duplicate test failure on Fedora Rawhide #4925

Open
opoplawski opened this issue Oct 4, 2024 · 3 comments
Open

MPI fapl_mpio duplicate test failure on Fedora Rawhide #4925

opoplawski opened this issue Oct 4, 2024 · 3 comments
Assignees
Labels
Component - Parallel Parallel HDF5 (NOT thread-safety) Component - Testing Code in test or testpar directories, GitHub workflows Priority - 1. High 🔼 These are important issues that should be resolved in the next release Type - Bug Please report security issues to [email protected] instead of creating an issue on GitHub

Comments

@opoplawski
Copy link
Contributor

Describe the bug

Test log for testphdf5 
============================
===================================
PHDF5 TESTS START
===================================
MPI-process 4.MPI-process 1.MPI-process 0. hostname=8157f7de7c0d45a69bca387aecea5d54
For help use: /builddir/build/BUILD/hdf5-1.14.5-build/hdf5-hdf5_1.14.5/openmpi/testpar/.libs/testphdf5 -help
Linked with hdf5 version 1.14 release 5
*** Hint ***
You can use environment variable HDF5_PARAPREFIX to run parallel test files in a
different directory or to add file type prefix. e.g.,
   HDF5_PARAPREFIX=pfs:/PFS/user/me
   export HDF5_PARAPREFIX
*** End of Hint ***
Test filenames are:
    ParaTest.h5
Testing  -- fapl_mpio duplicate (mpiodup) 
Proc 0: *** Parallel ERROR ***
    VRFY (new and old nkeys equal) failed at line  104 in ../../testpar/t_ph5basic.c
aborting MPI processes
MPI-process 5. hostname=8157f7de7c0d45a69bca387aecea5d54
Proc 5: *** Parallel ERROR ***
    VRFY (new and old nkeys equal) failed at line  104 in ../../testpar/t_ph5basic.c
aborting MPI processes
MPI-process 3. hostname=8157f7de7c0d45a69bca387aecea5d54
Proc 3: *** Parallel ERROR ***
    VRFY (new and old nkeys equal) failed at line  104 in ../../testpar/t_ph5basic.c
aborting MPI processes
MPI-process 2. hostname=8157f7de7c0d45a69bca387aecea5d54
Proc 2: *** Parallel ERROR ***
    VRFY (new and old nkeys equal) failed at line  104 in ../../testpar/t_ph5basic.c
aborting MPI processes
 hostname=8157f7de7c0d45a69bca387aecea5d54
Proc 4: *** Parallel ERROR ***
    VRFY (new and old nkeys equal) failed at line  104 in ../../testpar/t_ph5basic.c
aborting MPI processes
 hostname=8157f7de7c0d45a69bca387aecea5d54
Proc 1: *** Parallel ERROR ***
    VRFY (new and old nkeys equal) failed at line  104 in ../../testpar/t_ph5basic.c
aborting MPI processes
--------------------------------------------------------------------------
MPI_ABORT was invoked on rank 3 in communicator MPI_COMM_WORLD
  Proc: [[61674,1],3]
  Errorcode: 1
NOTE: invoking MPI_ABORT causes Open MPI to kill all MPI processes.
You may or may not see output from other processes, depending on
exactly when Open MPI kills them.
--------------------------------------------------------------------------
�Command exited with non-zero status 1

A clear and concise description of what the bug is.

Expected behavior
No test failures

Platform (please complete the following information)

  • HDF5 version 1.14.5
  • OS and version - Fedora Rawhide
  • Compiler and version - gcc 14.2.1
  • Build system - autotools
  • Any configure options you specified
+ CFLAGS='-O2 -flto=auto -ffat-lto-objects -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS -specs=/usr/lib/rpm/redhat/redhat-hardened-cc1 -fstack-protector-strong -specs=/usr/lib/rpm/redhat/redhat-annobin-cc1  -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -fcf-protection -mtls-dialect=gnu2 -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer '
+ export CFLAGS
+ CXXFLAGS='-O2 -flto=auto -ffat-lto-objects -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS -specs=/usr/lib/rpm/redhat/redhat-hardened-cc1 -fstack-protector-strong -specs=/usr/lib/rpm/redhat/redhat-annobin-cc1  -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -fcf-protection -mtls-dialect=gnu2 -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer '
+ export CXXFLAGS
+ FFLAGS='-O2 -flto=auto -ffat-lto-objects -fexceptions -g -grecord-gcc-switches -pipe -Wall -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS -specs=/usr/lib/rpm/redhat/redhat-hardened-cc1 -fstack-protector-strong -specs=/usr/lib/rpm/redhat/redhat-annobin-cc1  -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -fcf-protection -mtls-dialect=gnu2 -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -I/usr/lib64/gfortran/modules '
+ export FFLAGS
+ FCFLAGS='-O2 -flto=auto -ffat-lto-objects -fexceptions -g -grecord-gcc-switches -pipe -Wall -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS -specs=/usr/lib/rpm/redhat/redhat-hardened-cc1 -fstack-protector-strong -specs=/usr/lib/rpm/redhat/redhat-annobin-cc1  -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -fcf-protection -mtls-dialect=gnu2 -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -I/usr/lib64/gfortran/modules '
+ export FCFLAGS
+ ../configure --build=x86_64-redhat-linux-gnu --host=x86_64-redhat-linux-gnu --program-prefix= --disable-dependency-tracking --prefix=/usr --exec-prefix=/usr --bindir=/usr/bin --sbindir=/usr/sbin --sysconfdir=/etc --datadir=/usr/share --includedir=/usr/include --libdir=/usr/lib64 --libexecdir=/usr/libexec --localstatedir=/var --runstatedir=/run --sharedstatedir=/var/lib --mandir=/usr/share/man --infodir=/usr/share/info --disable-silent-rules --enable-fortran --enable-hl --enable-shared --with-szlib --enable-cxx --enable-java --with-default-plugindir=/usr/lib64/hdf5/plugin
  • MPI library and version (parallel HDF5)

openmpi 5.0.5

Additional context
Working on trying to update the Fedora hdf5 package to the 1.14.X branch.

@jhendersonHDF
Copy link
Collaborator

See open-mpi/ompi#12742

@opoplawski
Copy link
Contributor Author

Thanks. With the autotools tests, is there an easy way to skip this particular test?

@vchoi-hdfgroup vchoi-hdfgroup added Component - Parallel Parallel HDF5 (NOT thread-safety) Component - Testing Code in test or testpar directories, GitHub workflows labels Oct 4, 2024
@jhendersonHDF
Copy link
Collaborator

Unfortunately there isn't really an easy way to do so with autotools without hacking on the source. Removing 'testphdf5' from https://github.com/HDFGroup/hdf5/blob/develop/testpar/Makefile.am#L36 should help, but testphdf5 is also the main parallel test so that may not be desirable.

@derobins derobins added Priority - 1. High 🔼 These are important issues that should be resolved in the next release Type - Bug Please report security issues to [email protected] instead of creating an issue on GitHub labels Oct 4, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
Component - Parallel Parallel HDF5 (NOT thread-safety) Component - Testing Code in test or testpar directories, GitHub workflows Priority - 1. High 🔼 These are important issues that should be resolved in the next release Type - Bug Please report security issues to [email protected] instead of creating an issue on GitHub
Projects
None yet
Development

No branches or pull requests

4 participants