kedro.io.ParquetLocalDataSet

class kedro.io.ParquetLocalDataSet(filepath, engine='auto', load_args=None, save_args=None, version=None)[source]

AbstractDataSet with functionality for handling local parquet files.

Example:

from kedro.io import ParquetLocalDataSet
import pandas as pd

data = pd.DataFrame({'col1': [1, 2], 'col2': [4, 5],
                     'col3': [5, 6]})
data_set = ParquetLocalDataSet('myFile')
data_set.save(data)
loaded_data = data_set.load()
assert data.equals(loaded_data)
__init__(filepath, engine='auto', load_args=None, save_args=None, version=None)[source]

Creates a new instance of ParquetLocalDataSet pointing to a concrete filepath.

Parameters:
Return type:

None

Methods

__init__(filepath[, engine, load_args, …]) Creates a new instance of ParquetLocalDataSet pointing to a concrete filepath.
exists() Checks whether a data set’s output already exists by calling the provided _exists() method.
from_config(name, config[, load_version, …]) Create a data set instance using the configuration provided.
load() Loads data by delegation to the provided load method.
save(data) Saves data by delegation to the provided save method.