Description
This component groups features that share the same value in a chosen column and calculates summary statistics (mean, sum, min, max, median, standard deviation, unique count, mode) for numeric columns within each group. The result is one feature per unique group value, with a new attribute for each statistic.
Inputs
Input dataset [Dataset]Group by column [Column]— the column used to form groupsColumns to analyze [Column]— (optional) numeric columns to calculate statistics for; leave empty to include all numeric columnsStatistics [Selection]— one or more of: mean, sum, min, max, median, std, unique_count, modeInclude count [Boolean]— whether to add a column showing the number of features in each group
Outputs
Result dataset [Dataset]
Use Cases
- Summarising sales figures by region or product category
- Calculating average sensor readings grouped by device ID
- Counting the number of incidents per municipality