Although DQM provides over 3,000 word replacement pairs in American English, you can modify the provided lists or create lists of word replacement pairs with words that users often enter with errors or as shortcuts.
The fuzzy key generation program uses only the following 3 word lists for fuzzy search
ADDRESS_DICTIONARY - for address
ORGANIZATION_NAME_DICTIONARY - for organization
PERSON_NAME_DICTIONARY - for person
You cannot create your own replacement list, but must update any of the applicable lists listed above, for fuzzy search.
Attention: A new word list is not used until you create custom transformations that use the list. See: Creating Custom Transformations.
When you create or copy a word list, you must specify the word identification method. You cannot change the method when you update a list.
Note: The Nondelimited method is usually used for relevant non-English languages, such as Japanese, that are based on characters, not words separated by spaces.
Example
John is the original word, Jonathan is the replacement word, and the attribute value is John Johnson. If the word replacement with the Delimited method is applied, then the attribute value becomes Jonathan Johnson, because only John surrounded by spaces is replaced. If with Nondelimited, then the value becomes Jonathan Jonathanson, because John is replaced no matter where it appears.
This table describes some terms in the pages used for this procedure.
Selected Terminology
| Term | Description |
|---|---|
| Condition | Criterion that must be met for the word replacement to occur. Conditions are particularly useful for country-specific word replacements. For example, in the UK, LTD or Limited is a common organization name suffix. You can specify to replace either word with a blank space only if it appears at the end of a string. |
Enter a unique word list name, and optionally define the source of the list, for example to identify a list that you created or obtained from a third party. When you update an existing list, you can change the name and source, but not the language.
Define word replacement pairs.
No matter which word identification method is selected, do not enter original words with spaces.
For original and replacement words, you can enter not only whole words, but also abbreviations, word fragments, and numeric characters. For example, you can create a word replacement pair by entering 1 as the original word and one as the replacement word. If a user enters 1 to perform a search, then one is used to search your party information.
Replacement words do not have to be unique and can be left blank. You cannot, however, use the same word as both an original and replacement word in the same word list. For example, you cannot have Street to be replaced by St. in a word pair, and also St. to be replaced by Saint in another word pair.
You can create several word replacement pairs that have different unique original words with the same replacement word. This table shows an example:
| Original Word | Replacement Word |
|---|---|
| Bob | Robert |
| Rob | Robert |
| Robbie | Robert |
| Roberto | Robert |
| Bobby | Robert |
For any word pair, optionally enter a condition.
You can use the same original word multiple times in a list only if the replacement words and conditions are different. For example, you can enter St. twice as an original word to be replaced by the replacement words Street and Saint, with a condition for each case.
Attention: If you use original words multiple times, the conditions are applied in the order defined, and a word is replaced according to the first condition that is met. For example, if the St. and Street word pair is defined first, and that condition is met, then the word replacement occurs. The condition for the St. and Saint word pair is skipped.
You must enter a value after the condition if the field is not disabled. If multiple values are possible, for example for the seeded If Country Equals condition, separate each value by a comma.
After you add or modify word replacement pairs, run the DQM Staging program to update the staged schema to include the new or revised word replacement pairs. In the Original Word column, Staging Required indicates the word pairs that still need to be staged.
For any record that you add to or update in the TCA Registry, the word replacement pairs become immediately effective after the DQM Staging program finishes. See: DQM Staging Program.