Difference between revisions of "Tutorial:Data quality project using Address Check"

From Melissa Data Wiki
Jump to navigation Jump to search
Line 26: Line 26:
image.
image.


#Select the '''Create a domain''' icon.
<ol>
<li>Select the '''Create a domain''' icon.</li>


image.small
image.small


#Enter a '''Domain Name''' and '''Description''' for the new domain.
<li>Enter a '''Domain Name''' and '''Description''' for the new domain.</li>
#Click '''OK'''.
<li>Click '''OK'''.</li>
#Make as many new domains as you need.
<li>Make as many new domains as you need.</li>
   
   
In order to group the domains you create a composite domain:
In order to group the domains you create a composite domain:
   
   
#Click the '''Create a composite domain''' icon.
<li>Click the '''Create a composite domain''' icon.</li>


image.small
image.small


#Enter a '''Composite Domain Name''' and '''Description'''.
<li>Enter a '''Composite Domain Name''' and '''Description'''.</li>
#Select which domains you want to add to the composite domain by selecting the domain in the '''Domains List''', then clicking the arrow to move the domain to '''Domains in Composite Domain'''.
<li>Select which domains you want to add to the composite domain by selecting the domain in the '''Domains List''', then clicking the arrow to move the domain to '''Domains in Composite Domain'''.</li>
#Click '''OK'''.
<li>Click '''OK'''.</li>
#Make as many new composite domains as you need.
<li>Make as many new composite domains as you need.</li>
#Under each composite domain, select the Reference Data tab.
<li>Under each composite domain, select the Reference Data tab.</li>
#Click '''Browse''' and check the check box next to '''Melissa Data - Address Check'''.
<li>Click '''Browse''' and check the check box next to '''Melissa Data - Address Check'''.</li>
#Under the '''Melissa Data - Address Check''' section, map the '''RDS Schema''' to the '''Domains''' you created and placed under this Composite Domain.
<li>Under the '''Melissa Data - Address Check''' section, map the '''RDS Schema''' to the '''Domains''' you created and placed under this Composite Domain.</li>
::*For example: Map AddressLine (M) to Address.
::*For example: Map AddressLine (M) to Address.
#Click '''Add Schema Entry''' to map additional schema to domains.
<li>Click '''Add Schema Entry''' to map additional schema to domains.</li>
#When you are done mapping the schema, click '''OK'''.
<li>When you are done mapping the schema, click '''OK'''.</li>
</ol>


===Providers settings===
===Providers settings===

Revision as of 17:07, 13 July 2012

Data quality project using Address Check

Getting Started

Data Quality Services uses a knowledge base to compare against a data set you provide to run a data quality project. So you must first set up a knowledge base and then run a data quality project.

image.

Knowledge base management

The knowledge base is what you will compare your data to, so it is important to note that your data cleansing will only be as good as your knowledge base.

Setting up a new knowledge base

image.

  1. Select New Knowledge Base.
  2. Enter a Name and Description for the New Knowledge Base.
  • Ensure that Domain Management is selected.
  1. Click Next.
  • Wait a few moments for the new knowledge base to be created.

Domain management

A domain is a named data set, and a composite domain is a grouping of domains. For instance, as a domain you can have: Address, City, State, and Zip. Then for a composite domain you group these four domains under Address Record.

Setting up new domains

image.

  1. Select the Create a domain icon.
  2. image.small
  3. Enter a Domain Name and Description for the new domain.
  4. Click OK.
  5. Make as many new domains as you need.
  6. In order to group the domains you create a composite domain:
  7. Click the Create a composite domain icon.
  8. image.small
  9. Enter a Composite Domain Name and Description.
  10. Select which domains you want to add to the composite domain by selecting the domain in the Domains List, then clicking the arrow to move the domain to Domains in Composite Domain.
  11. Click OK.
  12. Make as many new composite domains as you need.
  13. Under each composite domain, select the Reference Data tab.
  14. Click Browse and check the check box next to Melissa Data - Address Check.
  15. Under the Melissa Data - Address Check section, map the RDS Schema to the Domains you created and placed under this Composite Domain.
    • For example: Map AddressLine (M) to Address.
  16. Click Add Schema Entry to map additional schema to domains.
  17. When you are done mapping the schema, click OK.

Providers settings

With a composite domain selected, you can modify the settings of the Melissa Data - Address Check service. This includes the Auto Correction Threshold, Suggested Candidates, and Min Confidence.

Auto Correction Threshold

This decimal value sets the confidence threshold for records to be automatically corrected by Melissa Data.

Suggested Candidates

This determines the number of possible suggestions you will receive for an invalid record that can be corrected.

Min Confidence

This decimal value sets the minimum confidence required for a record to be considered valid. All other records will be set as invalid and not correctable

image.

  1. Once you are satisfied with your Domain Management, click Finish.
  • You will then be prompted to publish the Knowledge Base with the latest changes.
  1. Click Publish.
  2. Click OK.