Releases · X-DataInitiative/SCALPEL-Flattening

The main feature isthe possibility to join the tables by month so that we avoid memory problems.
The main changes:
• When converting tables from csv to parquet, add the possibility to partition by a column
• Add joinByYearAndMonth method in order to partition by month
• Add some config parameters :
-- partition_column to partition the single table (optional)
-- monthly_partition : yes or no to join by month
• Change sameAs definition so that two dataframes that have different column ordering are considered

Assets 2

01 Jun 13:25

danielpes

fall-validation

7ff731c

Fall data flattening validation

Ran at the CNAM on 29/05/2017

Assets 4

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Releases: X-DataInitiative/SCALPEL-Flattening

First public release

performance improvement

integration-pureconfig-release 1.1

Flattening by month

Fall data flattening validation