Jason,
I hear you guys need funding regularly but I thought DOE/Whamcloud/Cray gave you guys a healthy chunk of change to fix alot of long standing issues that would be very beneficial to get done.
Unfortunately support for better handling of the VL datatypes (including compression) was not among the features they decided to fund. I'll let Quincey to elaborate on what DOE and Co. sponsored.
Things I've been noting during my usage of HDF5:
No support for filters on vlen types, in particular - compression
Architecture is already in place. We just need resources.
No atomic transactions for writing new data (ie you crash, your document can be corrupted because of metadata issues)
Good news is that this will be addressed by metadata journaling. Also, we are wrapping the SWMR feature (for data append-only) that addresses the issue too. Bad news is that I cannot tell you the exact day of the HDF5 1.10.0 release. We are targeting the end of the year and the features should be available in the snapshots during the year, but at this point no dates have been set. Data-append only SWMR prototype will be ready for users to try in early March 2014.
Usage of a global mutex for most any routine, which can cause severe performance degradation if multiple threads are concurrently doing IO... (I don't want to hear about studies saying about how IO itself is the bigger problem... there are ram filesystems, SSDs, and different storage locations to invalidate these claims, all easy things to come by in HPC)
Currently there are no plans to address this issue.
Mediocre examples that don't really show you how to get things done or clarify all that much. Reading source is often the only way to get an answer, I've found. It makes the learning curve appear huge to newbies who I've introduced to HDF.
Please share with us your ideas how we should improve, which important features are missing, etc. We also encourage this community to share the knowledge and the code.
I love HDF as it really addresses write once, use anywhere and gives me something to deal with long term storage of custom binary data formats. I hope it continues to spread throughout the world as the defacto data storage format for pretty much anything between embedded systems to HPC in all it's spaces. I also hope in the future more things will be leaving the powerpoint slides / R&D and become production ready.
Well… Your wishes coincide with our desire
I hope you guys get everything you need to improve areas HDF is still lacking in.
Thank you! We hope too!
Elena
···
On Jan 17, 2014, at 12:17 PM, Jason Newton <nevion@gmail.com> wrote:
-Jason
On Fri, Jan 17, 2014 at 8:46 AM, Elena Pourmal <epourmal@hdfgroup.org> wrote:
Andrea,
On Jan 16, 2014, at 3:37 PM, Andrea Bedini <andrea.bedini@gmail.com> wrote:
Hi,
in a post few years ago [1] Quincey Koziol explained that the VL data is stored in a "global heap" in
the file, which is not compressed. He also mentioned that a new "fractal heap" code was being developed (which, I assume, would allow compression of VL data).
Is there any news on this front? I there a way to compress VL data?
No news. We need funding to implement compression of VL data. If any organization is willing to sponsor the feature, please contact us at info@hdfgroup.org
Thank you!
Elena
~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
Elena Pourmal
Director of Technical Services and Operations
The HDF Group
1800 So. Oak St., Suite 203,
Champaign, IL 61820
www.hdfgroup.org
(217)531-6112 (office)
~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
Thanks,
Andrea
[1] http://hdf-forum.184993.n3.nabble.com/hdf-forum-Compression-in-variable-length-datasets-not-working-td194091.html
--
Andrea Bedini <andrea.bedini@gmail.com>
_______________________________________________
Hdf-forum is for HDF software users discussion.
Hdf-forum@lists.hdfgroup.org
http://mail.lists.hdfgroup.org/mailman/listinfo/hdf-forum_lists.hdfgroup.org
_______________________________________________
Hdf-forum is for HDF software users discussion.
Hdf-forum@lists.hdfgroup.org
http://mail.lists.hdfgroup.org/mailman/listinfo/hdf-forum_lists.hdfgroup.org
_______________________________________________
Hdf-forum is for HDF software users discussion.
Hdf-forum@lists.hdfgroup.org
http://mail.lists.hdfgroup.org/mailman/listinfo/hdf-forum_lists.hdfgroup.org