Usage
Basic Usage
from chunking_pandas.core import ChunkingExperiment, ChunkingStrategy, FileFormat
# Create an experiment
experiment = ChunkingExperiment(
"input.csv",
"output.csv",
n_chunks=3,
chunking_strategy="rows"
)
Web Interface
To run the web interface programatically:
from chunking_pandas import launch_interface
launch_interface()
Or to run it as a console script:
chunking-interface
Chunking Strategies
The package supports several chunking strategies:
rows: Split data by rows
columns: Split data by columns
tokens: Split data by approximate token count
blocks: Split data into block matrices (for 2D data)
parallel rows: Split data by parallel strategies
parallel columns: Split data by parallel strategies
none: Keep data as single chunk
File Formats
Supported file formats:
CSV
JSON
Parquet
NumPy arrays (.npy)