Difference between revisions of "Profiler Object:Introduction"

From Melissa Data Wiki
Jump to navigation Jump to search
Line 43: Line 43:
*Content analysis relies on reference data to determine if the input data contains information consistent with what is expected.
*Content analysis relies on reference data to determine if the input data contains information consistent with what is expected.
*Field analysis determines if the input data is consistently fielded, using the data contained in the entire record to analyze the context of the data.
*Field analysis determines if the input data is consistently fielded, using the data contained in the entire record to analyze the context of the data.
==Demo Mode==
Profiler Object API has a Demo mode (no license is necessary) that will allow you to:
*Process up to 50k records per run.
*Result codes are all suppressed.
*The AppendMode is forced to be Overwrite only.




[[Category:Profiler Object]]
[[Category:Profiler Object]]

Revision as of 23:05, 30 January 2015

← Profiler Object Reference

Profiler Object Navigation
Introduction
System Requirements
Licensing
Order of Operations



Overview

Melissa Data’s Data Profiler is an object that can be used to analyze a table’s data within custom applications or directly within leading database management systems. The analysis provides a great number of statistics at varying levels of detail. Using these statistics, a user can make educated decisions on what strategies he may need to employ to handle the data.

Supported Data Profiling Techniques

Discovery

The analysis of new data before it is inserted into a Data Warehouse. This analysis is used to ensure that the data is correctly fielded, consistently formatted, standardized, etc. Because it can be very difficult to fix problems once data has been merged into a Data Warehouse, it is critical that issues are detected and eradicated prior to the merge.

Monitoring

The continual analysis of warehoused data in an effort to ensure a consistent quality of data. In systems where records are actively inserted, updated and deleted, it is nearly impossible to maintain a comprehensive set of business rules that foresee every situation. In addition, in systems that support multiple methods of access (ie, web, desktop, tablet/phone), it can be difficult to ensure that all program code adequately enforces all business rules.

Columns and Data Types

The Profiler Object is designed to work with a variety of column types, and analyzes data to ensure that it adheres to the limitations imposed by the user-specified type.

  • Numeric:Integers (8, 16, 32 or 64-bit), Floats (single or double), Decimal and Currency.
  • String:Unicode and Multi-byte, both fixed- or variable-length.
  • Date and/or Time, of varying resolutions.
  • Boolean

Data Analysis Summary

Deep data analysis is performed on several levels:

  • General Formatting analysis is used to determine if the input data ‘looks’ like what is expected.
  • Content analysis relies on reference data to determine if the input data contains information consistent with what is expected.
  • Field analysis determines if the input data is consistently fielded, using the data contained in the entire record to analyze the context of the data.