SSIS:Profiler:Analysis Options: Difference between revisions

From Melissa Data Wiki
Jump to navigation Jump to search
Created page with "← SSIS Reference {| class="mw-collapsible" cellspacing="2" style="background-color:#f9f9f9; border:1px solid #aaaaaa; font-size:9pt; color:#0645ad; pa..."
 
No edit summary
 
(4 intermediate revisions by the same user not shown)
Line 1: Line 1:
[[SSIS:Reference|← SSIS Reference]]
{{SSISProfilerNav
 
|ProfilerTabsCollapse=
{| class="mw-collapsible" cellspacing="2" style="background-color:#f9f9f9; border:1px solid #aaaaaa; font-size:9pt; color:#0645ad; padding:8px 8px;"
}}
! style="text-align:left; color:black; border-bottom:1px solid #aaaaaa;"|'''Profiler Navigation'''
|-
| style="padding-right:220px;"|
|-
|[[SSIS:Profiler|Overview]]
|-
|[[SSIS:Profiler Tutorial|Tutorial]]
|-
|[[SSIS:Profiler:Advanced Configuration|Advanced Configuration]]
|-
|
{| class="mw-collapsible" style="width:100%;"
|- align="left"
! style="color:black;"|Profiler Tabs
|-
|[[SSIS:Profiler:Input Field|Input Field]]
|-
|[[SSIS:Profiler:Analysis Options|Analysis Options]]
|-
|[[SSIS:Profiler:Output|Output]]
|}
|-
|
{| class="mw-collapsible mw-collapsed" style="width:100%;"
|- align="left"
! style="color:black;"|[[SSIS:Profiler:Result Codes|Result Codes]]
|-
|[[Returned Result Codes:Enterprise Solutions#Data Quality Components for SSIS|Returned Result Codes]]
|-
|[[Result Codes]]
|}
|}




{{CustomTOC}}
{{CustomTOC}}


These options analyze a table's data to provide a large number of statistics at varying levels of detail. Analysis Options can have a significant impact on the processing speed. For faster processing, turn off any unnedded options.
The Analysis Options tab allows for enabling/disabling certain profiling calculations. Disabling unused Analysis Options will become beneficial due to the increase in processing time.


[[file:SSIS_PF_AnalysisOptions.png|link=]]
[[file:SSIS_PF_AnalysisOptions.png|link=]]
Line 44: Line 12:
==Analysis Options==
==Analysis Options==
;Sort Analysis
;Sort Analysis
:This is an analysis of any prevailing sortation for each profiled column.
:This is an analysis of any prevailing sortation for each profiled column. This enables/disables the sortation analysis, which can increase profiling time. This time penalty grows geometrically as more records are added. If you are not interested in this statistic, disable it to decrease your profiling time.


;MatchUp Analysis
;MatchUp Analysis
:This is an analysis of duplicate record detection.
:This is an analysis of duplicate record detection. This enables/disables duplicate record detection. Duplicate analysis increases the profiling time by under 5% and ProfileData profiling time by about 30%.


;RightFielder Analysis
;RightFielder Analysis
:This is an analysis of profiled columns' inferred data type (e.g., Full Name, Address, etc.).
:This is an analysis of profiled columns' inferred data type (e.g., Full Name, Address, etc.). This enables/disables inferred data type analysis. This analysis is responsible for the Inconsistent Data and Inferred Data Type statistics. This increases the profiling time by under 10%.  


;Data Aggregation
;Data Aggregation
:This is an analysis of aggregate data determination (e.g., averages, median, quartiles, etc.).
:This is an analysis of aggregate data determination (e.g., averages, median, quartiles, etc.). This enables/disables all forms of aggregation and value gathering. Any statistic that cannot be determined incrementally (for example, median, population standard deviation, etc.) is determined via aggregation. This analysis is also responsible for all value tables (Frequency, Pattern, SoundEx, etc.). All iterators and data aggregation statistics are dependent on this analysis. This increases profiling time by over 90%.
 
 
==Setup Options==
The Setup Options are not required. They are used purely for documentation purposes and will have no impact on profiling results.


;Count Generation
;Table Name
:This is an analysis of basic count calculation.
:This function sets the user name for a particular run.


;User Name
:This function sets the user name for a particular run.


==Setup Options==
;Job Name
These options are purely for documentation. They have no impact on profiling results.
:This function sets the job name for a particular run.
 
;Job Description
:This function sets the job description for a particular run.




[[Category:SSIS]]
[[Category:SSIS]]
[[Category:SSIS Profiler]]
[[Category:SSIS Profiler]]

Latest revision as of 00:32, 14 November 2015

← SSIS:Data Quality Components

Profiler Navigation
Overview
Tutorial
Advanced Configuration
Profiler Tabs
Input Field
Analysis Options
Output
Output Columns
Result Codes
Result Codes



The Analysis Options tab allows for enabling/disabling certain profiling calculations. Disabling unused Analysis Options will become beneficial due to the increase in processing time.

Analysis Options

Sort Analysis
This is an analysis of any prevailing sortation for each profiled column. This enables/disables the sortation analysis, which can increase profiling time. This time penalty grows geometrically as more records are added. If you are not interested in this statistic, disable it to decrease your profiling time.
MatchUp Analysis
This is an analysis of duplicate record detection. This enables/disables duplicate record detection. Duplicate analysis increases the profiling time by under 5% and ProfileData profiling time by about 30%.
RightFielder Analysis
This is an analysis of profiled columns' inferred data type (e.g., Full Name, Address, etc.). This enables/disables inferred data type analysis. This analysis is responsible for the Inconsistent Data and Inferred Data Type statistics. This increases the profiling time by under 10%.
Data Aggregation
This is an analysis of aggregate data determination (e.g., averages, median, quartiles, etc.). This enables/disables all forms of aggregation and value gathering. Any statistic that cannot be determined incrementally (for example, median, population standard deviation, etc.) is determined via aggregation. This analysis is also responsible for all value tables (Frequency, Pattern, SoundEx, etc.). All iterators and data aggregation statistics are dependent on this analysis. This increases profiling time by over 90%.


Setup Options

The Setup Options are not required. They are used purely for documentation purposes and will have no impact on profiling results.

Table Name
This function sets the user name for a particular run.
User Name
This function sets the user name for a particular run.
Job Name
This function sets the job name for a particular run.
Job Description
This function sets the job description for a particular run.