Pentaho:MatchUp:Options

From Melissa Data Wiki
Revision as of 00:57, 13 June 2015 by Admin (talk | contribs) (Created page with "{{PentahoMatchUpNav |MatchUpCollapse= }} {{CustomTOC}} The Options tab allows you to specify which of the available output results you want returned. [[File:PENT_MatchUp_O...")
(diff) ← Older revision | Latest revision (diff) | Newer revision → (diff)
Jump to navigation Jump to search

← Data Quality Components for Pentaho

MatchUp Navigation
Overview
Tutorial
Advanced Configuration
On-Premise
MatchUp Tabs
Matchcode
Field Mapping
Options
Source Pass-Through Columns
Lookup Pass-Through Columns
Output Filter
Matchcode Editor
Matchcode List
Component List
Component Properties
Algorithms
Expression Elements
Matchcodes
Result Codes



The Options tab allows you to specify which of the available output results you want returned.



Output Columns

In addition to the source data, you will often want to output processing information about the disposition of a record. This allows you to analyze the results in a number of ways.

Result Codes
This field displays the results of the comparison, whether the record is unique or is a duplicate, was suppressed or intersected, which matchcode combination resulted in a match, etc. See MatchUp Result Codes for a list of possible Result Codes that the component can return.
Dupe Group
Each group of matching records is assigned a sequential unique group number. This field displays the group number that the record, whether unique or a member of a duplicate grouping, is in.
Dupe Count
This field displays the number of matching records in each Dupe Group.
Matchcode Key
Based on the matchcode and matchcodes component used to process the source table(s), every record has a matchkey built. It is this key, a representation of the record, that is used in deduping. This field will be populated with the key and is useful in analyzing the output results.


Lookup Options

*This section is GREYED-OUT When there is only a single upstream source connected to the MatchUp Component.

When a second upstream data source is connected to the Lookup pin, that data source will be used as a filter.

List Suppress
Source pin records that match any record from the Lookup Pin will not be returned with an Output Result Code. They will be marked as Suppressed.
List Intersect
Only Source pin records that match any record from the Lookup Pin will be returned with an Output Result Code (Unique or Has Duplicate result code). The second to nth Source records that match the Lookup record will be marked with a Duplicate Result Code. Source pin records that do not match any Lookup pin records will be returned with a Non-Intersected Result Code.
No Purge
Source pin records that match other Source pin records will not be matched. In other words, a suppressed group or an intersected group will be returned as suppressed or intersected, but each record will have their own Dupe Group number.


Golden Record

The Golden Record Selection Options allow for intelligent selection of the Golden or Master Record using different logic and algorithms. These algorithms include selecting the Golden Record according to: Latest Date, Most Complete, Data Quality Score, and Custom Built Expressions.