Aggregate Nodes

Aggregate nodes help you summarize and analyze your data by grouping and calculating statistics. These nodes are essential for creating summaries and transforming data structure.

Node Details

Group By

The Group By node aggregates data based on selected columns, allowing calculations such as sums, averages, counts, and more.

Key Features

Group by one or more columns
Apply aggregation functions to other columns
Rename output columns

Usage

Select one or more columns to group by.
Choose aggregation functions for other columns.
Set custom output column names if needed.

Configuration Options

Parameter	Description
Group By Columns	Columns used to define groups.
Aggregations	Functions like `sum`, `count`, `avg`, `min`, `max`.
Output Column Name	Custom name for the aggregated result (optional).

This node is essential for summarizing datasets and preparing structured outputs.

Pivot Data

The Pivot Data node converts data from a long format to a wide format by creating new columns based on unique values in a pivot column.

Key Features

Transform long format into wide format
Select multiple index columns
Aggregate values during pivoting

Usage

Select index columns to retain in the final output.
Choose a pivot column whose unique values will become new columns.
Select a value column containing the data to fill the new columns.
Apply aggregation functions (e.g., sum, count, avg) if needed.

Configuration Options

Parameter	Description
Index Columns	Columns that define the groups in the final table.
Pivot Column	Unique values from this column become new column names.
Value Column	The column containing values to be placed in the new columns.
Aggregations	Functions applied when multiple values exist per pivot column entry.

This node is useful for restructuring datasets into a summary-friendly format.

Unpivot Data

The Unpivot Data node transforms data from wide format to long format, making it easier for analysis and reporting.

Key Features

Convert multiple columns into key-value pairs
Select index columns to retain
Use dynamic data type selection

Usage

Select index columns to keep unchanged.
Choose value columns to transform into key-value pairs.
(Optional) Enable dynamic data type selection to filter columns automatically.

Configuration Options

Parameter	Description
Index Columns	Columns that remain unchanged in the final structure.
Value Columns	Columns that will be unpivoted into key-value pairs.
Data Type Selector	Automatically select columns based on data type (e.g., `string`).
Selection Mode	Choose between `column` or `data_type` for unpivot selection.

This node helps in restructuring datasets, especially when working with reporting or analytical tools.

Count Records

The Count Records node calculates the total number of rows in the dataset.

Key Features

Simple row count operation
No configuration required
Adds a new column number_of_records

Usage

Add the Count Records node to your workflow.
It will automatically count the total number of rows.

Configuration Options

This node has no additional settings—it simply returns the record count.

This transformation is useful for quick dataset validation and workflow monitoring.