Translate Table

A Translate Table is a file that provides the information necessary to properly match similar words based upon synonym pairings and root extraction or stemming rules. Any number of Translate Tables may be generated. Although no Keyword Index is required to have a Translate Table associated with it, having a Translate Table associated with a Keyword Index is often desired. Each Keyword Index may have its own Translate Table or multiple Keyword Indexes may share a Translate Table.

The Translate Table is defined by Load Information Fields and several word lists:

Load Information Fields

Field

Description

Resource ID

Value assigned to the Translate Table to identify it from other resources used by the installation. The resource ID must be different from that of any other resource defined for the installation.

Language Processing

Flag indicating whether the language specific routine defined in the Language field should be used.  Set the flag to ON to use a language specific routine when stemming.  Set the flag to OFF to do only the stemming specified in the Suffix List.

Language

Defines the language that will be used to finalize the stemming process.  Currently ENGLISH is the only language supported.

Trim Plurals Flag

Flag indicating whether recognized plurals are to be trimmed using English language standards. The choices are: OFF, indicating trimming of plurals should not be done; and ON, indicating trimming of plurals should be done. Besides simple trimming of “s” and “es”, “ies” is replaced by “y” and “lves” is replaced by “lf”. However, Latin plurals ending in “ae” or “i” are not handled, nor are special spellings such as “mice” from “mouse” or “oxen” from “ox”.

Compress Doubles Flag

Flag indicating that all double consonants are to be compressed down to a single character. This can be very useful in combatting simple spelling errors, both within documents and within search requests.

Case Sensitive

Flag indicating whether keywords should be treated equivalent independent of their case.  Set the flag to ON to use an internal case sensitive character set.  Set the flag to OFF to use a case insensitive character set.

Stemming Threshold

The number of characters a word must contain before trimming of plurals or stemming will take place.

Translate Table Lists

SEARCH & LOAD

SEARCH & LOAD lists are applied to both searching and loading.

List Name

Description

Suffix List

List of passes containing suffix patterns used in the stemming process.

Stop List

An alphabetized list of stop words.

Exception List

A list of words that are not to be stemmed.

Start List

Only words found within the Start List are indexed.

Synonym List

Used as a thesaurus or to handle special plurals.

LOAD ONLY

LOAD ONLY lists are applied only during loading.

List Name

Description

Stop List

An alphabetized list of stop words.

Exception List

A list of words that are not to be stemmed.

Start List

Only words found within the Start List are indexed.

Synonym List

Used as a thesaurus or to handle special plurals.

SEARCH ONLY

SEARCH ONLY lists are applied only during searching.

List Name

Description

Stop List

An alphabetized list of stop words.

Exception List

A list of words that are not to be stemmed.

Start List

Only words found within the Start List are indexed.

Synonym List

Used as a thesaurus or to handle special plurals.