Chunking and Parallel HDF5

Hello All,

It seems to me that having a chunk cache is rather important in ensuring performance with chunked storage. How do existing applications leverage chunking and parallel HDF5 without the chunk cache?

Also, the documentation seems to imply that there is some thought going into raw dataset chunk caching with MPI. Is this true?

Thanks!
David