HDF5 on Apache beam


#1

Hello,

I am working with the Google Cloud Apache Beam team to process HDF5 reader. I am working with bioinformatics data that is save as H5 file.
The developers of Apache Beam are happy to help.
Could you please let me know what would be the right way to proceed.

Bet regards,
Eila

Following is the email of the google team:

HDF group, nice to meet you. I’m happy to help write
a HDF5 source for Apache Beam Python SDK. I have mentioned few links below
that might be useful. Please let me know if you have any questions.
Documentation for Apache Beam: https://beam.apache.org/documentation/
Documentation for Apache Beam Python SDK:
https://beam.apache.org/documentation/sdks/python/
A document on writing new sources and sinks for Python SDK:
https://beam.apache.org/documentation/sdks/python-custom-io/
Apache Beam contribution guide:
https://beam.apache.org/contribute/contribution-guide/


#2

Hi Eila

I reached out to you directly to discuss.

Kind Regards,
Dax