SSIS:Fuzzy Match:Matches: Difference between revisions
Jump to navigation
Jump to search
Created page with "← SSIS Reference {| class="mw-collapsible" cellspacing="2" style="background-color:#f9f9f9; border:1px solid #aaaaaa; font-size:9pt; color:#0645ad; pa..." |
No edit summary |
||
Line 31: | Line 31: | ||
|[[SSIS:Fuzzy Match:Compare Pass-Through Columns|Compare Pass-Through Columns]] | |[[SSIS:Fuzzy Match:Compare Pass-Through Columns|Compare Pass-Through Columns]] | ||
|} | |} | ||
|- | |||
|[[Matching Algorithms]] | |||
|- | |- | ||
| | | |
Revision as of 23:58, 12 November 2014
Fuzzy Match Navigation | |||||
---|---|---|---|---|---|
Overview | |||||
Tutorial | |||||
| |||||
| |||||
Matching Algorithms | |||||
|
The Matches tab of the Fuzzy Match Component sets up the relationships between fields in the Source and Compare tables, and determines which algorithms will be used to determine the level of matches.
- Source
- Select a field from the Source input table that will be compared to a field in the Compare input table.
- Compare
- Select the matching field in the Compare table.
- Match Type
- Select the fuzzy matching algorithm to use when comparing the two fields.
- For more information on the different algorithms, see Matching Algorithms.
- Upper Threshold
- Anything above this percentage will be considered an exact match.
- Anything below this threshold but above the Lower is considered a possible match.
- Lower Threshold
- Anything below this level will be considered a non-match.
- Anything above this threshold but below the Upper is considered a possible match.
- WS
- White Space. Check this box to ignore spaces in the data as well as letter casing.
- Search
- Enter a regular expression to search for this field in every record.
- Replace
- Enter the text to replace data that matches the regular expression entered in Search. if Search is filled and this field is left blank, the Fuzzy Match Component will delete any matching text.
- Remove Current Row
- Click this button to remove the currently selected comparison from the current setup.
- N-Gram Size
- This setting determines the size of the substring used by the following algorithms:
- N-Gram
- Dice's Coefficient
- Jaccard Similarity
- Overlap Coefficient
- For more information on the different algorithms, see Matching Algorithms.