Blog - Page 4 of 25 - The HDF Group - ensuring long-term access and usability of HDF data and supporting users of HDF technologies

Answering biological questions using HDF5 and physics-based simulation data

August 21, 2015

David Dotson, doctoral student, Center for Biological Physics, Arizona State University; HDF Guest Blogger Recently I had the pleasure of meeting Anthony Scopatz for the first time at SciPy 2015, and we talked shop. I was interested in his opinions on MDSynthesis, a Python package our lab has designed to help manage the complexity of raw and derived […]

HDF5 and The Big Science of Nuclear Stockpile Stewardship

August 17, 2016

DOE has continued to partner with The HDF Group, supporting development of HDF5 through two generations of computing; sponsoring this development has benefited the entire HDF5 user community. Today, DOE supports current HDF5 R&D to ensure that the data challenges of third generation exascale computing …

Worried about your unlimited data plan bills? Cut them with OPeNDAP

May 15, 2015

Large, rich and complex collections of HDF data can be filtered and viewed with the help of OPeNDAP. HDF data can be provided in manageable servings, on demand in real time, inexpensively, even on the user’s desktop or mobile device.

Python & HDF5 – A Vision

September 18, 2015

Anthony Scopatz, Assistant Professor at the University of South Carolina, HDF guest blogger “Python is great and its ecosystem for scientific computing is world class. HDF5 is amazing and is rightly the gold standard for persistence for scientific data. Many people use HDF5 from Python, and this number is only growing due to pandas’ HDFStore.

Damaris: An Asynchronous Data Aggregator Middleware in Support of HDF5 Library

June 1, 2018

Damaris is a middleware that enriches existing HPC data format libraries (e.g. HDF5) with data aggregation and asynchronous data management capabilities. At the same time, it can be employed for in situ analysis and visualization purposes.

HDF5 Data Compression Demystified #2: Performance Tuning

May 24, 2017

Internal compression is one of several powerful HDF5 features that distinguish HDF5 from other binary formats and make it very attractive for storing and organizing data. Internal HDF5 compression saves storage space and I/O bandwidth and allows efficient partial access to data. Chunk storage must be used when HDF5 compression is enabled.

ExaHyPE goes HDF5

June 14, 2017

Tobias Weinzierl, Durham University, UK, Sven Köppel, FIAS, Germany, Michael Bader, TUM, Germany, HDF Guest Bloggers ExaHyPE develops a solver engine for hyperbolic differential equations solved on adaptive Cartesian meshes. It supports various HDF5 output formats. Exascale computing is expected to allow scientists and engineers to simulate, and ultimately understand, wave phenomena with unprecedented accuracy

Multiple Independent File (MIF, aka N:M) Parallel I/O With HDF5

March 21, 2017

Mark Miller, Lawrence Livermore National Laboratory, Guest Blogger The HDF5 library has supported the I/O requirements of HPC codes at Lawrence Livermore National Labs (LLNL) since the late 90’s. In particular, HDF5 used in the Multiple Independent File (MIF) parallel I/O paradigm has supported LLNL code’s scalable I/O requirements and has recently been gainfully used

Handling (and ingesting) data streams at 500K mess/s

August 24, 2017

By Francesc Alted. He is a freelance consultant and developing author of different open source libraries like PyTables, Blosc, bcolz and numexpr and an experienced programmer in Python and C. Francesc collaborates regularly with the The HDF Group in different projects. We explain our solution for handling big data streams using HDF5 (with a little help

C++ has come a long way and there’s plenty in it for users of HDF5

July 30, 2018

A few years ago, I was looking for a data format with low latency block and stream support. While protocol buffers offered streams, it was lacking indexed block access. Soon, I realized I was looking for a container with file system-like properties. When I examined HDF5, I found it was very close to what I needed to store massive financial engineering datasets….