H5diffgentest error


#1

I am trying to get HDF5 1.8.21 to build and test OK. I get a test failure from h5diffgentest which I believe is causing other failures. This is using these modules,

Currently Loaded Modules:
  1) intel/18.0.5   2) szip/2.1.1   3) openmpi/3.1.4

The same modules were used to compile, build, and test HDF5 1.10.5 successfully, but the same error occurs when using GCC 8.2.0 and the same versions of szip and OpenMPI.

[bennetsw@gl-build h5diff]$ ./h5diffgentest 
HDF5-DIAG: Error detected in HDF5 (1.8.21) thread 0:
  #000: H5D.c line 413 in H5Dclose(): not a dataset
    major: Invalid arguments to routine
    minor: Inappropriate type
HDF5-DIAG: Error detected in HDF5 (1.8.21) thread 0:
  #000: H5D.c line 413 in H5Dclose(): not a dataset
    major: Invalid arguments to routine
    minor: Inappropriate type

This is on a GPFS parallel file system.

Is this something that might be got to work?


#2

At least for HDF5 1.10.5, this was a result of warnings being produced by OpenMPI about our fabric, which were being captured and included in the logs that HDF5 was comparing.

I found that using

$ export UCX_LOG_LEVEL=debug

suppressed the warning messages, after which the h5diff tests. passed.