Usage

Basic Usage

from chunking_pandas.core import ChunkingExperiment, ChunkingStrategy, FileFormat

# Create an experiment
experiment = ChunkingExperiment(
    "input.csv",
    "output.csv",
    n_chunks=3,
    chunking_strategy="rows"
)

Web Interface

To run the web interface programatically:

from chunking_pandas import launch_interface
launch_interface()

Or to run it as a console script:

chunking-interface

Chunking Strategies

The package supports several chunking strategies:

  • rows: Split data by rows

  • columns: Split data by columns

  • tokens: Split data by approximate token count

  • blocks: Split data into block matrices (for 2D data)

  • parallel rows: Split data by parallel strategies

  • parallel columns: Split data by parallel strategies

  • none: Keep data as single chunk

File Formats

Supported file formats:

  • CSV

  • JSON

  • Parquet

  • NumPy arrays (.npy)