MapReduce Archives - The HDF Group - ensuring long-term access and usability of HDF data and supporting users of HDF technologies

From HDF5 Datasets to Apache Spark RDDs

March 12, 2015

… HDF% and Spark: Balancing the workload among tasks is a concern in any parallel environment. However, that does not mean that all datasets have to be the same size. HDF5 can help with partial I/O: Instead of reading entire datasets, one could just read hyperslabs or other selections. Sampling is…

How likely are you to recommend The HDF Group's products and services to your friends, colleagues, and peers?

This field is for validation purposes and should be left unchanged.