Blog - Page 24 of 24 - The HDF Group - ensuring long-term access and usability of HDF data and supporting users of HDF technologies

HDF5 Data Compression Demystified #1

April 23, 2015

Elena Pourmal, The HDF Group What happened to my compression? One of the most powerful features of HDF5 is the ability to compress or otherwise modify, or “filter,” your data during I/O. By far, the most common user-defined filters are ones that perform data compression. As you know, there are many compression options. There are […]

Putting some Spark into HDF-EOS

April 16, 2015

…we focus on how far we can push our personal computing devices with Spark. It consists of 7,850 HDF-EOS5 files covering 27 years and totals about 120 GB. We use a driver script, which reads a dataset of interest from each file in the collection, computes per-file quantities of interest, and gathers them in a CSV file for visualization. The processing time on our reference tablet machine for 3.5 years of data using 4 logical processors was about 10 seconds.

Parallel I/O – Why, How, and Where to?

April 9, 2015

Mohamad Chaarawi, The HDF Group First in a series: parallel HDF5 What costs applications a lot of time and resources rather than doing actual computation? Slow I/O. It is well known that I/O subsystems are very slow compared to other parts of a computing system. Applications use I/O to store simulation output for future use

HDF5 for the Web – HDF Server

April 2, 2015

HDF Group has just announced “HDF Server” – a freely available service that enables remote access to HDF5 content using a RESTful API. In our scenario, using HDF Server, we upload our Monopoly simulation results to the server and then interested parties can make requests for any desired content to the server – no file size issues, no downloading entire files…

HDF5 as a zero-configuration, ad-hoc scientific database for Python

March 25, 2015

Andrew Collette, Research Scientist with IMPACT, HDF Guest Blogger “…HDF5 is that rare product which excels in two fields: archiving and sharing data according to strict standardized conventions, and also ad-hoc, highly flexible and iterative use for local data analysis. For more information on using Python together with HDF5…” An enormous amount of effort has

HDF Earth Science Program

March 20, 2015

HDF has been the primary format for NASA Earth Observing System (EOS) satellite data products for more than fifteen years. The HDF Earth Science Program supports EOS and other significant earth science monitoring activities.

HDF at the 2015 Oil & Gas High Performance Computing Workshop

March 18, 2015

The workshop program has two main tracks, one on HPC-oriented technologies that support the industry, and one on oil & gas technologies and how they can leverage HPC.

From HDF5 Datasets to Apache Spark RDDs

March 12, 2015

… HDF% and Spark: Balancing the workload among tasks is a concern in any parallel environment. However, that does not mean that all datasets have to be the same size. HDF5 can help with partial I/O: Instead of reading entire datasets, one could just read hyperslabs or other selections. Sampling is…

Welcome to our blog

March 4, 2015

Welcome, again, to the new HDF Blog. Let this be the beginning of a lively and informative dialogue.

The HDF Group – who we are

March 4, 2015

The HDF Group’s mission is to provide high quality software for managing large complex data, to provide outstanding services for users of these technologies, and to insure effective management of data throughout the data life cycle.