help interpreting h5stat output

miller86 · February 5, 2010, 12:46am

I admit I haven't searched for nor read information on h5stat. So, if
someone wants to tell me to rtfm, thats fine. Just tell me what to read.

Otherwise, I have some questions interpreting this output from h5stat...

Dataset dimension information:
        Max. rank of datasets: 2
        Dataset ranks:
                # of dataset with rank 1: 188
                # of dataset with rank 2: 18

I assume 'rank' is the number of dimensions. right?

1-D Dataset information:
Max. dimension size of 1-D datasets: 9

Ok, this totaly throwse me. What does 'dimension size' mean? It cannot
possibly mean the maximum size of a dimension in 1D datasets as the file
contains 1D datasets of size 1 all the way up to several thousand.

        Small 1-D datasets:
                # of dataset dimensions of size 1: 45
                # of dataset dimensions of size 4: 3
                # of dataset dimensions of size 6: 3
                # of dataset dimensions of size 8: 3
                # of dataset dimensions of size 9: 1

Ditto here. I just don't understand what the above lines are telling me?
How is 'small' defined? Is that controlled by any command-line switch?

                Total small datasets: 55
        1-D Dataset dimension bins:
                # of datasets of size 1 - 9: 55
                # of datasets of size 10 - 99: 117
                # of datasets of size 100 - 999: 10
                # of datasets of size 1000 - 9999: 4
                # of datasets of size 10000 - 99999: 1
                # of datasets of size 1000000 - 9999999: 1
                Total # of datasets: 188

This is great. But, I have 2D datasets also. Why don't I see similar
output for 2D datasets?

···

Dataset storage information:
        Total raw data size: 144275687
Dataset layout information:
        Dataset layout counts[COMPACT]: 0
        Dataset layout counts[CONTIG]: 206
        Dataset layout counts[CHUNKED]: 0
        Number of external files : 0
Dataset filters information:
        Number of datasets with:
                NO filter: 206
                GZIP filter: 0
                SHUFFLE filter: 0
                FLETCHER32 filter: 0
                SZIP filter: 0
                NBIT filter: 0
                SCALEOFFSET filter: 0
                USER-DEFINED filter: 0

--
Mark C. Miller, Lawrence Livermore National Laboratory
================!!LLNL BUSINESS ONLY!!================
miller86@llnl.gov urgent: miller86@pager.llnl.gov
T:8-6 (925)-423-5901 M/W/Th:7-12,2-7 (530)-753-851

Quincey_Koziol · February 8, 2010, 4:23pm

Hi Mark,

I admit I haven't searched for nor read information on h5stat. So, if
someone wants to tell me to rtfm, thats fine. Just tell me what to read.

Otherwise, I have some questions interpreting this output from h5stat...

Dataset dimension information:
       Max. rank of datasets: 2
       Dataset ranks:
               # of dataset with rank 1: 188
               # of dataset with rank 2: 18

I assume 'rank' is the number of dimensions. right?

Yes.

1-D Dataset information:
Max. dimension size of 1-D datasets: 9

Ok, this totaly throwse me. What does 'dimension size' mean? It cannot
possibly mean the maximum size of a dimension in 1D datasets as the file
contains 1D datasets of size 1 all the way up to several thousand.

Hmm, it does mean that, so the h5stat tool may have a bug. Are you working with the latest (1.8.4) release?

       Small 1-D datasets:
               # of dataset dimensions of size 1: 45
               # of dataset dimensions of size 4: 3
               # of dataset dimensions of size 6: 3
               # of dataset dimensions of size 8: 3
               # of dataset dimensions of size 9: 1

Ditto here. I just don't understand what the above lines are telling me?
How is 'small' defined? Is that controlled by any command-line switch?

"small" is <10. It's not a user-tunable request (but I'll file an enhancement request for you

               Total small datasets: 55
       1-D Dataset dimension bins:
               # of datasets of size 1 - 9: 55
               # of datasets of size 10 - 99: 117
               # of datasets of size 100 - 999: 10
               # of datasets of size 1000 - 9999: 4
               # of datasets of size 10000 - 99999: 1
               # of datasets of size 1000000 - 9999999: 1
               Total # of datasets: 188

This is great. But, I have 2D datasets also. Why don't I see similar
output for 2D datasets?

Well, it seemed like it would be much more complex for 2D datasets, since there would be n^2 combinations possible, and n^3 combinations for 3D datasets, etc. I'll file another enhancement request to revisit this idea and see if we can come up with good ways to present the information for 2+ dimension datasets.

Quincey

···

On Feb 4, 2010, at 6:46 PM, Mark Miller wrote:

Dataset storage information:
       Total raw data size: 144275687
Dataset layout information:
       Dataset layout counts[COMPACT]: 0
       Dataset layout counts[CONTIG]: 206
       Dataset layout counts[CHUNKED]: 0
       Number of external files : 0
Dataset filters information:
       Number of datasets with:
               NO filter: 206
               GZIP filter: 0
               SHUFFLE filter: 0
               FLETCHER32 filter: 0
               SZIP filter: 0
               NBIT filter: 0
               SCALEOFFSET filter: 0
               USER-DEFINED filter: 0

--
Mark C. Miller, Lawrence Livermore National Laboratory
================!!LLNL BUSINESS ONLY!!================
miller86@llnl.gov urgent: miller86@pager.llnl.gov
T:8-6 (925)-423-5901 M/W/Th:7-12,2-7 (530)-753-851

_______________________________________________
Hdf-forum is for HDF software users discussion.
Hdf-forum@hdfgroup.org
http://mail.hdfgroup.org/mailman/listinfo/hdf-forum_hdfgroup.org

Attention! https://support.hdfgroup.org is the NEW home for documentation from The HDF Group. (Details)

help interpreting h5stat output