RightFielder Object:Config Example: Difference between revisions

Revision as of 18:22, 24 February 2014

mdRightFielder.cfg Case Studies

The mdRightFielder.cfg file is a plain text file that users can use to override the default entries contained in the mdRightFielder.dat data file.

For complete instructions of available tables and types which can be overridden, as well as syntax and examples, open the mdRightFielder.cfg in a text editor and follow the instructions.

There are 3 types of modifications that can be made in mdRightFielder.cfg:

Lookup Table Overrides

This is the addition or removal of words (and phrases) to the Object’s dictionaries. This essentially expands (or limits) Right Fielder’s vocabulary.

There are three lookup tables in Right Fielder Object:

LeftToken – used to recognize words and phrases that usually appear at the start of data (name, company, titles)
MiddleToken – used to recognize words and phrases that usually appear at the middle of the data (addresses, apartments, PO Boxes, etc)
RightToken – used to recognize words and phrases that usually appear at the end of data (city, state, country)

Say, for example, you are processing a list of car dealerships and the company recognition isn’t working as well as you would like. In evaluating the results, it appears that if Right Fielder knew a bit more about car manufacturers, processing might be a lot more accurate. In this case, you would modify the LeftToken table in this way:

[LeftToken] FORD,C TOYOTA,C CHEVY,C NISSAN,C KIA,C HONDA,C LINCOLN,C ALFA ROMEO,C MOTORS,C

Regular Expression Overrrides

The addition of regular expressions that are used to recognize specific character patterns (for example, phone numbers, e-mails, etc).

Pattern Table Overrides

The addition or removal of patterns of words and phrases. Words and phrases are first identified via Lookup Tables and assigned tokens (specified in the Lookup Table itself). Sequences of tokens (patterns) are matched to entries in this table and transformed into output data.

RightFielder Object:Config Example: Difference between revisions

Revision as of 18:22, 24 February 2014

Contents

Lookup Table Overrides

Regular Expression Overrrides

Pattern Table Overrides

Example mdRightFielder.cfg Overrides

NOTES on cfg overrides

Navigation menu

@@ Line 10: / Line 10: @@
 ==Lookup Table Overrides==
-The addition or removal of words (and phrases) to the Object’s dictionaries. This essentially expands (or limits) Right Fielder’s vocabulary.
+This is the addition or removal of words (and phrases) to the Object’s dictionaries. This essentially expands (or limits) Right Fielder’s vocabulary.
+There are three lookup tables in Right Fielder Object:
+*LeftToken – used to recognize words and phrases that usually appear at the start of data (name, company, titles)
+*MiddleToken – used to recognize words and phrases that usually appear at the middle of the data (addresses, apartments, PO Boxes, etc)
+*RightToken – used to recognize words and phrases that usually appear at the end  of data (city, state, country)
+Say, for example, you are processing a list of car dealerships and the company recognition isn’t working as well as you would like. In evaluating the results, it appears that if Right Fielder knew a bit more about car manufacturers, processing might be a lot more accurate. In this case, you would modify the LeftToken table in this way:
+[LeftToken]
+FORD,C
+TOYOTA,C
+CHEVY,C
+NISSAN,C
+KIA,C
+HONDA,C
+LINCOLN,C
+ALFA ROMEO,C
+MOTORS,C
 ==Regular Expression Overrrides==

RightFielder Object:Config Example: Difference between revisions

Revision as of 18:22, 24 February 2014

Lookup Table Overrides

Regular Expression Overrrides

Pattern Table Overrides

Example mdRightFielder.cfg Overrides

NOTES on cfg overrides

Navigation menu

Search