The HDF5® Library & File Format

HDF5®

High-performance data management and storage suite

Utilize the HDF5 high performance data software library and file format to manage, process, and store your heterogeneous data. HDF5 is built for fast I/O processing and storage.

What is HDF5®?

Heterogeneous Data

HDF® supports n-dimensional datasets and each element in the dataset may itself be a complex object.

Easy Sharing

HDF® is portable, with no vendor lock-in, and is a self-describing file format, meaning everything all data and metadata can be passed along in one file.

Cross Platform

HDF® is a software library that runs on a range of computational platforms, from laptops to massively parallel systems, and implements a high-level API with C, C++, Fortran 90, and Java interfaces. HDF has a large ecosystem with 700+ Github projects.

Fast I/O

HDF® is high-performance I/O with a rich set of integrated performance features that allow for access time and storage space optimizations.

Big Data

There is no limit on the number or size of data objects in the collection, giving great flexibility for big data.

Keep Metadata with Data

HDF5® allows you to keep the metadata with the data, streamlining data lifecycles and pipelines.