chunk cache

Ger_van_Diepen · June 15, 2009, 5:08am

On my MacBook I create a chunked float array of shape [nz,ny,nx]=[500,1024,1024] (2 GByte) with a chunk size of [20,32,32] = 81920 bytes. The array is as large as the memory to measure true IO times by avoiding that file pages are kept in the kernel's file cache. I'm using HDF5 1.8.3 to make use of the new H5Pset_chunk_cache function.

Creating the array takes about 60 seconds and reading it chunk by chunk takes about 60 seconds as well. In both cases the cache size is setup as 1 chunk (81920 bytes). These times are more or less as expected.

However, when reading the data x-vector by x-vector (in chunk order) it takes 160 seconds, although the cache is setup to hold 32 chunks (2621440 bytes). Its hash table is 3203 entries (next prime larger than 100x cache size as advised in the HDF5 documentation). It appears that most time is spent in user time (over 100 seconds); the actual IO time seems to be fine.
With reading x-vectors in chunk order I mean that the vectors are not read in strict y,z order, but first all y,z indices of a chunk are processed before moving to the next row of chunks. When doing it in strict y,z order, the cache needs to be much larger.

I did another test with a cube shape of [10,512,512] and the chunk shape the same as the cube shape. Also in this case reading by x-vector took much more time than reading by chunk.
So the question is what HDF5 is doing that it takes so much user time to perform this task. Is it memcpy or hashing or something else?

This is a very important issue for us. Our astronomical image cubes will be 3-dim with axes [freq,dec,ra]. Usually the data are retrieved as [dec,ra] planes, but sometimes as a frequency-profile for a specific [dec,ra] point. So efficient access in all directions is important. We thought that chunking would help us here, but that is not all that clear.

Cheers,
Ger van Diepen

···

----------------------------------------------------------------------
This mailing list is for HDF software users discussion.
To subscribe to this list, send a message to hdf-forum-subscribe@hdfgroup.org.
To unsubscribe, send a message to hdf-forum-unsubscribe@hdfgroup.org.

andrew.collette · June 15, 2009, 6:35pm

Hi,

However, when reading the data x-vector by x-vector (in chunk order) it takes 160 seconds, although the cache is setup to hold 32 chunks (2621440 bytes). Its hash table is 3203 entries (next prime larger than 100x cache size as advised in the HDF5 documentation). It appears that most time is spent in user time (over 100 seconds); the actual IO time seems to be fine.

I haven't had this particular issue, but I know there's a serious
performance problem for chunked writing if the rank of your memory
dataspace doesn't match the rank of the dataset. It results in a
slowdown of 3x-6x in my case, and manifests in much the same way (all
user time). If you haven't already, you could try reading in with a
dataspace of [1,1,x] instead of just [x].

Andrew Collette

···

----------------------------------------------------------------------
This mailing list is for HDF software users discussion.
To subscribe to this list, send a message to hdf-forum-subscribe@hdfgroup.org.
To unsubscribe, send a message to hdf-forum-unsubscribe@hdfgroup.org.

Quincey_Koziol · June 16, 2009, 2:15pm

Hi Ger,

···

On Jun 15, 2009, at 12:08 AM, Ger van Diepen wrote:

On my MacBook I create a chunked float array of shape [nz,ny,nx]=[500,1024,1024] (2 GByte) with a chunk size of [20,32,32] = 81920 bytes. The array is as large as the memory to measure true IO times by avoiding that file pages are kept in the kernel's file cache. I'm using HDF5 1.8.3 to make use of the new H5Pset_chunk_cache function.

Creating the array takes about 60 seconds and reading it chunk by chunk takes about 60 seconds as well. In both cases the cache size is setup as 1 chunk (81920 bytes). These times are more or less as expected.

However, when reading the data x-vector by x-vector (in chunk order) it takes 160 seconds, although the cache is setup to hold 32 chunks (2621440 bytes). Its hash table is 3203 entries (next prime larger than 100x cache size as advised in the HDF5 documentation). It appears that most time is spent in user time (over 100 seconds); the actual IO time seems to be fine.
With reading x-vectors in chunk order I mean that the vectors are not read in strict y,z order, but first all y,z indices of a chunk are processed before moving to the next row of chunks. When doing it in strict y,z order, the cache needs to be much larger.

I did another test with a cube shape of [10,512,512] and the chunk shape the same as the cube shape. Also in this case reading by x-vector took much more time than reading by chunk.
So the question is what HDF5 is doing that it takes so much user time to perform this task. Is it memcpy or hashing or something else?

This is a very important issue for us. Our astronomical image cubes will be 3-dim with axes [freq,dec,ra]. Usually the data are retrieved as [dec,ra] planes, but sometimes as a frequency-profile for a specific [dec,ra] point. So efficient access in all directions is important. We thought that chunking would help us here, but that is not all that clear.

Hmm, I would have thought that our recent changes in the 1.8.3 release would have improved this situation. Can you send us your test code that shows the problem and we can try to find a little time to look into the situation?

Quincey