I had to modify an existing SSIS package and found that the author had an Excel data source with duplicate rows in it. The destination needed unique records. The author’s solution was to save the data to a staging table and then do a select distinct from there to the ultimate destination. This is an unnecessary step. The Sort Transformation provides a setting that will eliminate duplicate rows. See the Remove Duplicate Rows checkbox in lower left hand corner in the picture below.
As can be seen below, the Sort Transformation has removed duplicate rows fro the source Excel spreadsheet.