Assembling

fuzzy_join(): Joining tables on non-normalized categories with approximate matching. Example

Joiner, AggJoiner: transformers for joining multiple tables together. Example

Encoding

TableVectorizer: turn a pandas dataframe into a numerical array for machine learning. Example

GapEncoder: OneHotEncoder but robust to typos or non-normalized categories. Example

Cleaning

deduplicate(): merge categories of similar morphology (spelling). Example