Profiler Object:Column-Based Numeric Statistics

From Melissa Data Wiki
Jump to navigation Jump to search

← Profiler Object Reference

Profiler Object Interface Navigation
Initialization
Object Information
Enumeration Listing and Parsing
Column Specification
Initiate Profiling
Data Input
Profiling
Table-Based Statistics
Column-Based Statistics
Column-Based String Statistics
Column-Based Numeric Statistics
Column-Based Date/Time Statistics
Column-Based Name Statistics
Column-Based State/Province Statistics
Column-Based Zip/Postal Code Statistics
Column-Based Country Statistics
Column-Based Email Statistics
Column-Based Phone Statistics
Frequency Iterators
Column-Based Value Frequency Table Iteration
Column-Based Value Length Frequency Table Iteration
Column-Based Value Pattern Table Iteration
Column-Based Value Date/Time Table Iteration
Column-Based Value SoundEx Table Iteration
Column-Based Word Table Iteration
Column-Based Word Length Table Iteration
Result Codes
Profiler Object Result Codes
Result Codes


These statistics should only be retrieved after calling the ProfileData function. These values are all column-specific. Numeric statistics are computed for all column types including non numeric-based ones (such as strings, date/times, etc.)

Quartiles and Medians

A median value is the 'middle' value. That is, if one were to order a list of 100 values from lowest to highest (including duplicate values), the median value is the value of the 50th item. There are two quartile values: first and third. In our example, the first quartile would be the 25th value, and the third quartile would be the 75th value. In cases where the total number of records is not evenly divisible, then a pair of quartile values is computed: the integral quartile, which is the value closest to the quartile or median point, and the interpolated quartile, which is an interpolated quartile value.

GetColumnNumericMaxValue

This function returns the column’s largest value as sorted numerically.

This function takes one parameter.

Parameters

Name Data Type Description
ColumnName String Column Name to get the columns largest value.


Syntax profiler->GetColumnNumericMaxValue(columnNameStr);
C double = mdProfilerGetColumnNumericMaxValue(profiler, columnNameStr);
.Net double = profiler.GetColumNumericMaxValue(columnNameStr);


GetColumnNumericMinValue

This function returns the column’s smallest numeric value.

This function takes one parameter.

Parameters

Name Data Type Description
ColumnName String Column Name to get the columns smallest value.


Syntax profiler->GetColumnNumericMinValue(columnNameStr);
C double = mdProfilerGetColumnNumericMinValue(profiler, columnNameStr);
.Net double = profiler.GetColumNumericMinValue(columnNameStr);


GetColumnNumericAvgValue

This function returns the column’s numeric average value.

This function takes one parameter.

Parameters

Name Data Type Description
ColumnName String Column Name to get the columns average value.


Syntax profiler->GetColumnNumericAvgValue(columnNameStr);
C double = mdProfilerGetColumnNumericAvgValue(profiler, columnNameStr);
.Net double = profiler.GetColumNumericAvgValue(columnNameStr);


GetColumnNumericQ1Value

This function returns the column’s first interpolated quartile value.

This function takes one parameter.

Parameters

Name Data Type Description
ColumnName String Column Name to get the columns first interpolated quartile value.


Syntax profiler->GetColumnNumericQ1Value(columnNameStr);
C double = mdProfilerGetColumnNumericQ1Value(profiler, columnNameStr);
.Net double = profiler.GetColumNumericQ1Value(ColumnNameStr);


GetColumnNumericQ1IntValue

This function returns the column’s first integral quartile value. This is a value from your input data that is closest to the quartile value (non-interpolated).

This function takes one parameter.

Parameters

Name Data Type Description
ColumnName String Column Name to get the columns first integral quartile value.


Syntax profiler->GetColumnNumericQ1IntValue(columnNameStr);
C double = mdProfilerGetColumnNumericQ1IntValue(profiler, columnNameStr);
.Net double = profiler.GetColumNumericQ1IntValue(columnNameStr);


GetColumnNumericMedValue

This function returns the column’s interpolated median value.

This function takes one parameter.

Parameters

Name Data Type Description
ColumnName String Column Name to get the columns interpolated median value.


Syntax profiler->GetColumnNumericMedValue(columnNameStr);
C double = mdProfilerGetColumnNumericMedValue(profiler, columnNameStr);
.Net double = profiler.GetColumNumericMedValue(columnNameStr);


GetColumnNumericMedIntValue

This function returns the column’s Integral median quartile value. This is a value from your input data that is closest to the quartile value (non-interpolated).

This function takes one parameter.

Parameters

Name Data Type Description
ColumnName String Column Name to get the columns integral median quartile value.


Syntax profiler->GetColumnNumericMedIntValue(columnNameStr);
C double = mdProfilerGetColumnNumericMedIntValue(profiler, columnNameStr);
.Net double = profiler.GetColumNumericMedIntValue(columnNameStr);


GetColumnNumericQ3Value

This function returns the column’s third interpolated quartile value.

This function takes one parameter.

Parameters

Name Data Type Description
ColumnName String Column Name to get the columns third interpolated quartile value.


Syntax profiler->GetColumnNumericQ3Value(columnNameStr);
C double = mdProfilerGetColumnNumericQ3Value(profiler, columnNameStr);
.Net double = profiler.GetColumNumericQ3Value(columnNameStr);


GetColumnNumericQ3IntValue

This function returns the column’s third integral quartile value. This is a value from your input data that is closest to the quartile value (non-interpolated).

This function takes one parameter.

Parameters

Name Data Type Description
ColumnName String Column Name to get the columns third integral quartile value.


Syntax profiler->GetColumnNumericQ3IntValue(columnNameStr);
C double = mdProfilerGetColumnNumericQ3IntValue(profiler, columnNameStr);
.Net double = profiler.GetColumNumericQ3IntValue(columnNameStr);


GetColumnNumericStdDevValue

This function returns the column’s population standard deviation value.

This function takes one parameter.

Parameters

Name Data Type Description
ColumnName String Column Name to get the columns population standard deviation value.


Syntax profiler->GetColumnNumericStdDevValue(columnNameStr);
C double = mdProfilerGetColumnNumericStdDevValue(profiler, columnNameStr);
.Net double = profiler.GetColumNumericStdDevValue(columnNameStr);