SSIS- Aggregate Transform
This is a Fast Forward tutorial on Aggregate Transformation in Microsoft SQL Server Integration Services.
Aggregate Transformation
The Aggregate transformation is used to perform aggregate operations/functions on groups in a dataset. The aggregate functions available are- Count, Count Distinct, Sum, Average, Minimum and Maximum. The Aggregate transformation has one input and one or more outputs. It does not support an error output.
Implementation
Here in this example we want to get the sum of the salary for each departments based on the employee data. We want to perform database equivalent of SUM(SAL) GROUP BY DEPTNO operation.
data:image/s3,"s3://crabby-images/5d8cf/5d8cf73f0a5c1f584a1b9466d7eeaa25837a6ae2" alt="Control Flow"
Here, we have EMP table as OLEDB Source. Next we use an Aggregate transform followed by Excel Destination.
data:image/s3,"s3://crabby-images/c2c29/c2c296a30c9fadc115164f58a065a5952e73994b" alt="Data Flow"
Double-click the Aggregate transform to open the editor. Next in the lower pane we select the Input Column, set Output Alias to columns, select the Operation i.e. Group By clause or any aggregate functions as below:
data:image/s3,"s3://crabby-images/2f260/2f26065865f61582d57a3d75da28233ce27bae89" alt="Aggregate Transformation Editor- Aggregations"
Below Aggregate Transformation Editor- Advanced
data:image/s3,"s3://crabby-images/03505/0350556571ac7a0f3f44c091583fc86ba2d6b6cd" alt="Aggregate Transformation Editor- Advanced"
Let's explore the Advanced Editor properties.
data:image/s3,"s3://crabby-images/bc9f5/bc9f53503b2aa4b80455b80eebec6dcb2f0a393f" alt="Aggregate Transform Advanced Editor- Component Properties"
Below Aggregate Transform Advanced Editor- Input Columns
data:image/s3,"s3://crabby-images/4acaf/4acaf8a51cb3cc5481d1d37ba9dc45b5ede8a047" alt="Aggregate Transform Advanced Editor- Input Columns"
Below Aggregate Transform Advanced Editor- Input & Output Properties
data:image/s3,"s3://crabby-images/45b82/45b828b062fa851402811df1963fb72d07fd0ce1" alt="Aggregate Transform Advanced Editor- Input & Output Properties"
Next we take a look into the Excel Destination object. Double-click to open the editor. Select the connection object from OLEDB Connection Manager browser. Select Data access mode to table or view and set the Name of the Excel sheet.
data:image/s3,"s3://crabby-images/47e84/47e84b113dd7ba9e22507d31f0e518c04f1059c3" alt="Excel Destination Editor- Connection Manager"
Next we go to the Mapping tab and map the Input to Destination columns.
data:image/s3,"s3://crabby-images/7d8ea/7d8ea7adcd8d36ea81084ce40b751964a1f47d36" alt="Excel Destination Editor- Mappings"
Below Excel Destination Editor- Error Output
data:image/s3,"s3://crabby-images/81efc/81efce463c9363d6c7acdc39717c5d8734dce925" alt="Excel Destination Editor- Error Output"
Let's take a look into the Advanced Editor of Excel Destination.
data:image/s3,"s3://crabby-images/96de8/96de8dd9263285de2f0d9729c41f3fdd03a84cec" alt="Excel Destination Advanced Editor- Connection Managers"
Below Excel Destination Advanced Editor- Component Properties
data:image/s3,"s3://crabby-images/7856c/7856c02d911153ac4eee927d11a64f2a14d79708" alt="Excel Destination Advanced Editor- Component Properties"
Below Excel Destination Advanced Editor- Column Mappings
data:image/s3,"s3://crabby-images/f0c47/f0c47d908871b974e4ea0dc720e2c0460966da8f" alt="Excel Destination Advanced Editor- Column Mappings"
Below Excel Destination Advanced Editor- Input & Output Properties
data:image/s3,"s3://crabby-images/352fa/352fa31a4675263dbbb6d9acf92c9cd0864cb345" alt="Excel Destination Advanced Editor- Input & Output Properties"
Aggregate transform aggregates and groups values in a dataset. This transformation does not pass through any columns, but creates new columns in the data flow for the data it publishes. Only the input columns to which aggregate functions apply or the input columns the transformation uses for grouping are copied to the transformation output.
NULL Handling
The Aggregate transformation handles null values in the same way as any ANSI compliant database.
- In a GROUP BY clause, nulls are treated like other column values. If the grouping column contains more than one null value, the null values are put into a single group.
- In the COUNT (column name) and COUNT (DISTINCT column name) functions, nulls are ignored and the result excludes rows that contain null values in the column.
- In the COUNT (*) function, all rows are counted, including rows with null values.
We will discuss more about the Performance considerations while working with Aggregate transformation in another article. We will discuss on the properties like- Comparison Flags, Count Distinct Scale, Count Distinct Keys, Keys scale, Number of keys, Count distinct scale, Count distinct keys, Auto extend factor etc. Stay tuned...