kedro.io.HDFLocalDataSet

class kedro.io.HDFLocalDataSet(filepath, key, load_args=None, save_args=None, version=None)[source]

HDFLocalDataSet loads and saves data to a local hdf file. The underlying functionality is supported by pandas, so it supports all allowed pandas options for loading and saving hdf files.

Example:

from kedro.io import HDFLocalDataSet
import pandas as pd

data = pd.DataFrame({'col1': [1, 2], 'col2': [4, 5],
                     'col3': [5, 6]})
data_set = HDFLocalDataSet(filepath="test.hdf",
                           key="test_hdf_key",
                           load_args=None,
                           save_args=None)
data_set.save(data)
reloaded = data_set.load()

assert data.equals(reloaded)
__init__(filepath, key, load_args=None, save_args=None, version=None)[source]

Creates a new instance of HDFLocalDataSet pointing to a concrete filepath.

Parameters:
Return type:

None

Methods

__init__(filepath, key[, load_args, …]) Creates a new instance of HDFLocalDataSet pointing to a concrete filepath.
exists() Checks whether a data set’s output already exists by calling the provided _exists() method.
from_config(name, config[, load_version, …]) Create a data set instance using the configuration provided.
load() Loads data by delegation to the provided load method.
save(data) Saves data by delegation to the provided save method.