Description
This component groups features that share the same value in a chosen column and calculates summary statistics for numeric and text columns within each group. The result is one feature per unique group value, with a new attribute for each statistic.
Inputs
Input dataset [Dataset]Group by column [Column]— the column used to form groupsColumns to analyze [Column]— numeric or text columns to calculate statistics for; leave empty to include all columnsNumeric statistics [Selection]— one or more of: mean, sum, min, max, median, std, unique_count, mode; applied only to numeric columnsText statistics [Selection]— one or more of: list_unique, unique_count, mode; applied only to text columnsText sort order [Selection]— controls how values inside_list_uniqueoutput columns are ordered: source (original record order), alphabetical_asc, alphabetical_desc, or frequency_descInclude count [Boolean]— whether to add a column showing the number of features in each group
Outputs
Result dataset [Dataset]
Use Cases
- Summarising sales figures by region or product category
- Calculating average sensor readings grouped by device ID
- Counting the number of incidents per municipality
- Collecting all distinct tags or categories assigned to records within each group