User Forum – We Want to Hear from You!

David Pearah, The HDF Group Hello again, HDF User Community, As I mentioned in my last blog post — HDF: The Next 30 Years (Part 2) — we’re looking for ways to better engage our users, which includes providing better tools for you to get support from the HDF Community.  We are looking for your input on three things: […]

Community development projects from The HDF Group

We are pleased to announce the launch of HDF Group’s new Support Portal and the HDF Forum. The new Support Portal, is located at https://portal.hdfgroup.org is the new home of materials previously found at support.hdfgroup.org. The old support will remain online but will no longer be updated. The new HDF Forum can be found at

Citations for HDF Data and Software

The topic of software citation has been discussed in many forums recently and several major discovery repositories (e.g. zenodo and DataCite) support metadata for software in addition to datasets and other resource types. HDF5 stradles the boundary between the dataset and software worlds. It is most commonly thought of and referred to as a data format, but, as in any case, data written in the HDF formats can not be read without HDF software. So, the answer to the question: is it a format or is it software? is clearly both.

The HDF Group will launch their new website Tuesday, October 17th

Greetings! The HDF Group is pleased to launch our new website on the evening of Tuesday, October 17th. Our new site features a new design, a new logo, and other improvements—all based on feedback from our users. You’ll find improved and simplified navigation with a better layout focused on the needs of our customers and

The HDF Group welcomes Ann Johnson as Director of Engineering

The HDF Group is pleased to announce Ann Johnson has joined as the new Director of Engineering, reporting to David Pearah, CEO. Ann was most recently the Vice President of Engineering at Reservoir Labs, responsible for global engineering operations, personnel, and project management. Prior to Reservoir Labs, Ann held several executive management positions at SiCortex,

BioSimulations: a platform for sharing and reusing biological simulations

The group at BioSimulations.org has been doing some very interesting work using HSDS on Kubernetes to store biomodelling data and visualizing the results using Vega as described in the paper below. Biosimulations chose to use HSDS due to its support for very large data sets,  REST API (for use with web applications), and its ability to run on Google Cloud as well as on-premise installations. 

The GFED Analysis Tool – An HDF Server Implementation

The HDF Server allows producers of complex datasets to share their results with a wide audience base. We used it to develop the Global Fire Emissions Database (GFED) Analysis Tool, a website which guides the user through our dataset. A simple webmap interface allows users to select an area of interest and produce data visualization charts.

Speed up cloud access using multiprocessing!

Accessing large data stores over the internet can be rather slow, but often you can speed things up using multiprocessing—i.e. running multiple processes that divvy up the work needed. Even if you run more processes than you have cores on your computer, since much of the time each process will be waiting on data, in many cases you’ll find things speed up nicely.

Large wind dataset now available via HDF Cloud

50TB of Wind Integration National Dataset (WIND) toolkit data is now available to anyone via HDF Cloud thanks to the work and collaboration between John Readey, Sr. Architect at The HDF Group and NREL (the National Renewable Energy Laboratory). Access the data now with a Jupyter Notebook or through the interactive web-based visualization tool. If you want

Scroll to Top