Editing the Lists

Five word lists are maintained within the Translate Table editor. To switch between lists, simply click the list name in the LISTS task bar along the left side of the translate table editor.

Besides simply typing words into a particular list. Data can be manipulated in the following ways:

Method

Cut, Copy and Paste.

Inserting and Removing Rows.

Undo and Redo.

Select All.

A translate table consists of three groups of lists: SEARCH & LOAD, LOAD ONLY and SEARCH ONLY.

SEARCH & LOAD

The SEARCH & LOAD group consists of five lists: Suffix List, Stop List, Exception List, Start List and Synonym List.

Suffix List

The suffix list is used to define a list of passes containing suffix patterns used in the stemming process.

The word list consists of three columns:

Column

Description

Threshold

Must be numeric. Number of characters required to be in a word for current pattern-replacement processing to take place. (Overrides Process Threshold if greater)

Pattern

ASCII pattern to be matched as a suffix for replacement.

Replacement

ASCII string to be used as a suffix to replace matched pattern.

Stop List

The stop list is used to define an alphabetized list of stop words.

Exception List

The exception list is used to define a list of words that are not to be stemmed.

Start List

The start list is used to define only specific words to be indexed.

The word list consists of two columns:

Column

Description

Pattern

Words to be found in the data

Replacement

Words to be indexed in place of their corresponding Pattern

Synonym List

The synonym list is used to define a thesaurus or to handle special plurals such as “mice” which is the plural of “mouse”.

The word list consists of two columns:

Column

Description

Pattern

Words to be found in the data

Replacement

Words to be indexed in place of their corresponding Pattern

 

LOAD ONLY

The LOAD ONLY group consists of four lists: Stop List, Exception List, Start List and Synonym List.

Stop List

The stop list is used to define an alphabetized list of stop words.

Exception List

The exception list is used to define a list of words that are not to be stemmed.

Start List

The start list is used to define only specific words to be indexed.

The word list consists of two columns:

Column

Description

Pattern

Words to be found in the data

Replacement

Words to be indexed in place of their corresponding Pattern

Synonym List

The synonym list is used to define a thesaurus or to handle special plurals such as “mice” which is the plural of “mouse”.

The word list consists of two columns:

Column

Description

Pattern

Words to be found in the data

Replacement

Words to be indexed in place of their corresponding Pattern

 

SEARCH ONLY

The SEARCH ONLY group consists of four lists: Stop List, Exception List, Start List and Synonym List.

Stop List

The stop list is used to define an alphabetized list of stop words.

Exception List

The exception list is used to define a list of words that are not to be stemmed.

Start List

The start list is used to define only specific words to be indexed.

The word list consists of two columns:

Column

Description

Pattern

Words to be found in the data

Replacement

Words to be indexed in place of their corresponding Pattern

Synonym List

The synonym list is used to define a thesaurus or to handle special plurals such as “mice” which is the plural of “mouse”.

The word list consists of two columns:

Column

Description

Pattern

Words to be found in the data

Replacement

Words to be indexed in place of their corresponding Pattern

 

Editing a Translate Table

Saving a File