Difference between revisions of "SSIS:Fuzzy Match:Matches"

From Melissa Data Wiki
Jump to navigation Jump to search
 
(One intermediate revision by the same user not shown)
Line 1: Line 1:
[[SSIS:Reference|← SSIS Reference]]
{{SSISFuzzyMatchNav
 
|FuzzyMatchTabsCollapse=
{| class="mw-collapsible" cellspacing="2" style="background-color:#f9f9f9; border:1px solid #aaaaaa; font-size:9pt; color:#0645ad; padding:8px 8px;"
}}
! style="text-align:left; color:black; border-bottom:1px solid #aaaaaa;"|'''Fuzzy Match Navigation'''
|-
| style="padding-right:220px;"|
|-
|[[SSIS:Fuzzy Match|Overview]]
|-
|[[SSIS:Fuzzy Match Tutorial|Tutorial]]
|-
|
{| class="mw-collapsible mw-collapsed" style="width:100%;"
|- align="left"
! style="color:black;"|[[SSIS:Fuzzy Match:Advanced Configuration|Advanced Configuration]]
|-
|[[SSIS:Fuzzy Match:Advanced Configuration#On-Premise|On-Premise]]
|}
|-
|
{| class="mw-collapsible" style="width:100%;"
|- align="left"
! style="color:black;"|Fuzzy Match Tabs
|-
|[[SSIS:Fuzzy Match:Matches|Matches]]
|-
|[[SSIS:Fuzzy Match:Advanced Options|Advanced Options]]
|-
|[[SSIS:Fuzzy Match:Source Pass-Through Columns|Source Pass-Through Columns]]
|-
|[[SSIS:Fuzzy Match:Compare Pass-Through Columns|Compare Pass-Through Columns]]
|}
|-
|[[SSIS:Fuzzy Match:Matching Algorithms|Matching Algorithms]]
|-
|
{| class="mw-collapsible mw-collapsed" style="width:100%;"
|- align="left"
! style="color:black;"|[[SSIS:Fuzzy Match:Result Codes|Result Codes]]
|-
|[[Returned Result Codes:Enterprise Solutions#Data Quality Components for SSIS|Returned Result Codes]]
|-
|[[Result Codes]]
|}
|}


{{CustomTOC}}
{{CustomTOC}}
Line 59: Line 17:
;Match Type
;Match Type
:Select the fuzzy matching algorithm to use when comparing the two fields.
:Select the fuzzy matching algorithm to use when comparing the two fields.
:For more information on the different algorithms, see [[Matching Algorithms]].
:For more information on the different algorithms, see [[SSIS:Fuzzy Match:Matching Algorithms|Matching Algorithms]].


;Upper Threshold
;Upper Threshold
Line 89: Line 47:
:*Overlap Coefficient
:*Overlap Coefficient


:For more information on the different algorithms, see [[Matching Algorithms]].
:For more information on the different algorithms, see [[SSIS:Fuzzy Match:Matching Algorithms|Matching Algorithms]].




[[Category:SSIS]]
[[Category:SSIS]]
[[Category:SSIS Fuzzy Match]]
[[Category:SSIS Fuzzy Match]]

Latest revision as of 00:08, 14 November 2015

← SSIS:Data Quality Components

Fuzzy Match Navigation
Overview
Tutorial
Advanced Configuration
On-Premise
Fuzzy Match Tabs
Matches
Advanced Options
Source Pass-Through Columns
Compare Pass-Through Columns
Matching Algorithms
Result Codes
Result Codes



The Matches tab of the Fuzzy Match Component sets up the relationships between fields in the Source and Compare tables, and determines which algorithms will be used to determine the level of matches.

SSIS FM Matches.png

Source
Select a field from the Source input table that will be compared to a field in the Compare input table.
Compare
Select the matching field in the Compare table.
Match Type
Select the fuzzy matching algorithm to use when comparing the two fields.
For more information on the different algorithms, see Matching Algorithms.
Upper Threshold
Anything above this percentage will be considered an exact match.
Anything below this threshold but above the Lower is considered a possible match.
Lower Threshold
Anything below this level will be considered a non-match.
Anything above this threshold but below the Upper is considered a possible match.
WS
White Space. Check this box to ignore spaces in the data as well as letter casing.
Search
Enter a regular expression to search for this field in every record.
Replace
Enter the text to replace data that matches the regular expression entered in Search. if Search is filled and this field is left blank, the Fuzzy Match Component will delete any matching text.


Remove Current Row
Click this button to remove the currently selected comparison from the current setup.
N-Gram Size
This setting determines the size of the substring used by the following algorithms:
  • N-Gram
  • Dice's Coefficient
  • Jaccard Similarity
  • Overlap Coefficient
For more information on the different algorithms, see Matching Algorithms.