Difference between revisions of "SSIS:Profiler:Analysis Options"

From Melissa Data Wiki
Jump to navigation Jump to search
(Created page with "← SSIS Reference {| class="mw-collapsible" cellspacing="2" style="background-color:#f9f9f9; border:1px solid #aaaaaa; font-size:9pt; color:#0645ad; pa...")
 
Line 38: Line 38:
{{CustomTOC}}
{{CustomTOC}}


These options analyze a table's data to provide a large number of statistics at varying levels of detail. Analysis Options can have a significant impact on the processing speed. For faster processing, turn off any unnedded options.
The Analysis Options tab allows for enabling/disabling certain profiling calculations. Disabling unused Analysis Options will become beneficial due to the increase in processing time.


[[file:SSIS_PF_AnalysisOptions.png|link=]]
[[file:SSIS_PF_AnalysisOptions.png|link=]]
Line 44: Line 44:
==Analysis Options==
==Analysis Options==
;Sort Analysis
;Sort Analysis
:This is an analysis of any prevailing sortation for each profiled column.
:This is an analysis of any prevailing sortation for each profiled column. This enables/disables the sortation analysis, which can increase profiling time. This time penalty grows geometrically as more records are added. If you are not interested in this statistic, disable it to decrease your profiling time.


;MatchUp Analysis
;MatchUp Analysis
:This is an analysis of duplicate record detection.
:This is an analysis of duplicate record detection. This enables/disables duplicate record detection. Duplicate analysis increases the profiling time by under 5% and ProfileData profiling time by about 30%.


;RightFielder Analysis
;RightFielder Analysis
:This is an analysis of profiled columns' inferred data type (e.g., Full Name, Address, etc.).
:This is an analysis of profiled columns' inferred data type (e.g., Full Name, Address, etc.). This enables/disables inferred data type analysis. This analysis is responsible for the Inconsistent Data and Inferred Data Type statistics. This increases the profiling time by under 10%.  


;Data Aggregation
;Data Aggregation
:This is an analysis of aggregate data determination (e.g., averages, median, quartiles, etc.).
:This is an analysis of aggregate data determination (e.g., averages, median, quartiles, etc.). This enables/disables all forms of aggregation and value gathering. Any statistic that cannot be determined incrementally (for example, median, population standard deviation, etc.) is determined via aggregation. This analysis is also responsible for all value tables (Frequency, Pattern, SoundEx, etc.). All iterators and data aggregation statistics are dependent on this analysis. This increases profiling time by over 90%.


;Count Generation
;Count Generation
Line 60: Line 60:


==Setup Options==
==Setup Options==
These options are purely for documentation. They have no impact on profiling results.
The Setup Options are not required. They are used purely for documentation purposes and will have no impact on profiling results.
 
;Table Name
:This function sets the user name for a particular run.
 
;User Name
:This function sets the user name for a particular run.
 
;Job Name
:This function sets the job name for a particular run.
 
;Job Description
:This function sets the job description for a particular run.




[[Category:SSIS]]
[[Category:SSIS]]
[[Category:SSIS Profiler]]
[[Category:SSIS Profiler]]

Revision as of 20:10, 9 February 2015

← SSIS Reference

Profiler Navigation
Overview
Tutorial
Advanced Configuration
Profiler Tabs
Input Field
Analysis Options
Output
Result Codes
Returned Result Codes
Result Codes



The Analysis Options tab allows for enabling/disabling certain profiling calculations. Disabling unused Analysis Options will become beneficial due to the increase in processing time.

SSIS PF AnalysisOptions.png

Analysis Options

Sort Analysis
This is an analysis of any prevailing sortation for each profiled column. This enables/disables the sortation analysis, which can increase profiling time. This time penalty grows geometrically as more records are added. If you are not interested in this statistic, disable it to decrease your profiling time.
MatchUp Analysis
This is an analysis of duplicate record detection. This enables/disables duplicate record detection. Duplicate analysis increases the profiling time by under 5% and ProfileData profiling time by about 30%.
RightFielder Analysis
This is an analysis of profiled columns' inferred data type (e.g., Full Name, Address, etc.). This enables/disables inferred data type analysis. This analysis is responsible for the Inconsistent Data and Inferred Data Type statistics. This increases the profiling time by under 10%.
Data Aggregation
This is an analysis of aggregate data determination (e.g., averages, median, quartiles, etc.). This enables/disables all forms of aggregation and value gathering. Any statistic that cannot be determined incrementally (for example, median, population standard deviation, etc.) is determined via aggregation. This analysis is also responsible for all value tables (Frequency, Pattern, SoundEx, etc.). All iterators and data aggregation statistics are dependent on this analysis. This increases profiling time by over 90%.
Count Generation
This is an analysis of basic count calculation.


Setup Options

The Setup Options are not required. They are used purely for documentation purposes and will have no impact on profiling results.

Table Name
This function sets the user name for a particular run.
User Name
This function sets the user name for a particular run.
Job Name
This function sets the job name for a particular run.
Job Description
This function sets the job description for a particular run.