Row Sampling (Selected Output) Transformationin SSIS


In this article we are going to see how to use Row Sampling transformations in SSIS Packaging. Row sampling is used to randomly select some rows and move them as output to the required process as and when required to divide the rows. An example of using this transformation is say for example if we want to randomly select some 10 users of a community for a random prize then we can use this transformation. In this process we are going to see an example of how to use this process for the selected output values. Let's jump start to see this sample of how to set the properties of the control.
To follow my series of articles on SSIS, go through my profile.


Follow steps 1 to 3 on my first article to open the BIDS project and select the right project to work on an integration services project. Once the project is created, we will see how to use the Percentage sampling to see the flow. Now once the project is opened drag and drop a source and a Percentage sampling task as shown in the screen below:


Now we can see the red mark on the control which indicates that the controls are yet to be configured. Now let us configure the source (refer to my previous articles on how to configure OLEDB source). Now your output will look like before:


Now we need to configure the Row Sampling task; double click the task to open the screen as below:


Here we can specify the number of random rows that can be selected and the name for the Sample output selected and unselected. Since in our example we are going to see the Selected Output we will see how to use it. Now after specifying the number of rows just click on the OK button. Now drag and drop the Flat File Destination as shown in the screen below:


Now drag the green line from the Row Sampling to the Flat File destination task. It will open a window as shown in the screen below:


Here we have selected the Selected output as the Output to get the data as we can see in the screen above. Now click on OK to do the further configuration of the destination as shown in the screen below:


Once the configuration is done we can see the screen as below:


Now our package is ready to execute. Press F5 and execute the project; we will see the screen as below which indicates that the package has executed successfully.


We can see that only 10 rows are randomly selected and passed to the destination. We can see the output by opening the file which we gave in the destination configuration section. You can see the result as shown in the screen below:



So in this article we have seen how to use Row Sampling (Selected Output) to execute dataset and split based on the number of rows and uses it across the requirement.