sktime.datasets.base.load_UCR_UEA_dataset

sktime.datasets.base.load_UCR_UEA_dataset(name, split=None, return_X_y=False, extract_path=None)[source]

Load dataset from UCR UEA time series classification repository. Downloads and extracts dataset if not already downloaded.

Parameters
  • name (str) – Name of data set

  • split (None or str{"train", "test"}, optional (default=None)) – Whether to load the train or test partition of the problem. By default it loads both.

  • return_X_y (bool, optional (default=False)) – If True, returns (features, target) separately instead of a single dataframe with columns for features and the target.

  • extract_path (str, optional (default=None)) – Default extract path is sktime/datasets/data/

Returns

  • X (pandas DataFrame with m rows and c columns) – The time series data for the problem with m cases and c dimensions

  • y (numpy array) – The class labels for each case in X