SSIS:Fuzzy Match:Matches

From Melissa Data Wiki
Jump to navigation Jump to search

← SSIS:Data Quality Components

Fuzzy Match Navigation
Overview
Tutorial
Advanced Configuration
On-Premise
Fuzzy Match Tabs
Matches
Advanced Options
Source Pass-Through Columns
Compare Pass-Through Columns
Matching Algorithms
Result Codes
Result Codes



The Matches tab of the Fuzzy Match Component sets up the relationships between fields in the Source and Compare tables, and determines which algorithms will be used to determine the level of matches.

SSIS FM Matches.png

Source
Select a field from the Source input table that will be compared to a field in the Compare input table.
Compare
Select the matching field in the Compare table.
Match Type
Select the fuzzy matching algorithm to use when comparing the two fields.
For more information on the different algorithms, see Matching Algorithms.
Upper Threshold
Anything above this percentage will be considered an exact match.
Anything below this threshold but above the Lower is considered a possible match.
Lower Threshold
Anything below this level will be considered a non-match.
Anything above this threshold but below the Upper is considered a possible match.
WS
White Space. Check this box to ignore spaces in the data as well as letter casing.
Search
Enter a regular expression to search for this field in every record.
Replace
Enter the text to replace data that matches the regular expression entered in Search. if Search is filled and this field is left blank, the Fuzzy Match Component will delete any matching text.


Remove Current Row
Click this button to remove the currently selected comparison from the current setup.
N-Gram Size
This setting determines the size of the substring used by the following algorithms:
  • N-Gram
  • Dice's Coefficient
  • Jaccard Similarity
  • Overlap Coefficient
For more information on the different algorithms, see Matching Algorithms.