Uncategorized – Page 3

Worried about your unlimited data plan bills? Cut them with OPeNDAP

May 15, 2015

Large, rich and complex collections of HDF data can be filtered and viewed with the help of OPeNDAP. HDF data can be provided in manageable servings, on demand in real time, inexpensively, even on the user’s desktop or mobile device.

The HDF5 “Value Proposition” for the Fusion Data Lifecycle

May 7, 2015

When storing data, the rich, portable metadata capabilities, including directed graph structures (e.g., hierarchies), complex attributes, and inter-object references make HDF5 a superior choice for maintaining the bond between data and metadata at the lowest level. Community involvement is an essential part of the HDF Group’s mission: It is vital to sustaining the business and is our brain trust when making decisions about changes to HDF5, setting priorities, and adding new features.

HDF5 Data Compression Demystified #1

April 23, 2015

Elena Pourmal, The HDF Group What happened to my compression? One of the most powerful features of HDF5 is the ability to compress or otherwise modify, or “filter,” your data during I/O. By far, the most common user-defined filters are ones that perform data compression. As you know, there are many compression options. There are

Putting some Spark into HDF-EOS

April 16, 2015

…we focus on how far we can push our personal computing devices with Spark. It consists of 7,850 HDF-EOS5 files covering 27 years and totals about 120 GB. We use a driver script, which reads a dataset of interest from each file in the collection, computes per-file quantities of interest, and gathers them in a CSV file for visualization. The processing time on our reference tablet machine for 3.5 years of data using 4 logical processors was about 10 seconds.

Parallel I/O – Why, How, and Where to?

April 9, 2015

Mohamad Chaarawi, The HDF Group First in a series: parallel HDF5 What costs applications a lot of time and resources rather than doing actual computation? Slow I/O. It is well known that I/O subsystems are very slow compared to other parts of a computing system. Applications use I/O to store simulation output for future use

HDF5 as a zero-configuration, ad-hoc scientific database for Python

March 25, 2015

Andrew Collette, Research Scientist with IMPACT, HDF Guest Blogger “…HDF5 is that rare product which excels in two fields: archiving and sharing data according to strict standardized conventions, and also ad-hoc, highly flexible and iterative use for local data analysis. For more information on using Python together with HDF5…” An enormous amount of effort has

HDF Earth Science Program

March 20, 2015

HDF has been the primary format for NASA Earth Observing System (EOS) satellite data products for more than fifteen years. The HDF Earth Science Program supports EOS and other significant earth science monitoring activities.

From HDF5 Datasets to Apache Spark RDDs

March 12, 2015

… HDF% and Spark: Balancing the workload among tasks is a concern in any parallel environment. However, that does not mean that all datasets have to be the same size. HDF5 can help with partial I/O: Instead of reading entire datasets, one could just read hyperslabs or other selections. Sampling is…