Skip to content

ths_defrag

Console script for compacting THS parquet datasets

ths_ds_defrag

Compact and repartition the dataset.

Can be used on both realisation and aggregate datasets.

Arguments:

SOURCE: path to the source (folder OR S3 URI).

TARGET: path to the target (folder OR S3 URI).

Usage:

ths_ds_defrag [OPTIONS] SOURCE TARGET

Options:

  -p, --parts TEXT  comma-separated list of partition keys for the target DS
  -v, --verbose
  --help            Show this message and exit.