If you want to see the actual difference threshold (Levenshtein Distance) between each group owner and group member in a fuzzy duplicates output table, and the difference percentage that applies to each owner-member pair, you can add computed fields to display these values. Once you have added the computed fields, you can create a nested sort order to rank the output results by their degree of fuzziness.
You need to create three computed fields, and the fields must be created in this order:
Group owner computed field
Levenshtein Distance computed field
Difference percentage computed field
To add difference threshold and difference percentage fields:
LEVDIST(ALLTRIM(Group_Owner),ALLTRIM(fuzzy_dup_test_field),F)
Replace fuzzy_dup_test_field with the actual name of the fuzzy duplicates test field.
100*DEC(Lev_Dist,2)/MINIMUM(LENGTH(ALLTRIM(Group_Owner)),
LENGTH(ALLTRIM(fuzzy_dup_test_field)))
Replace fuzzy_dup_test_field with the actual name of the fuzzy duplicates test field.
The difference threshold (Levenshtein Distance) between each group owner and group member, and the difference percentage that applies to each owner-member pair, is now displayed.
For information about how to add fields to a view, see Adding columns to a view.
The ACLScript syntax for the extract operation appears below.
EXTRACT FIELDS Lev_Dist Diff_Pct GROUP_NUM Group_Owner ORIG_REC_NUM fuzzy_dup_test_field IF ISBLANK(GROUP_FL) TO "Ranked_Fuzzy_Dupes_1" OPEN
Replace fuzzy_dup_test_field with the actual name of the fuzzy duplicates test field.
The ACLScript syntax for the sort operation appears below.
SORT ON Lev_Dist Diff_Pct TO "Ranked_Fuzzy_Dupes_2" OPEN
The fuzziness of the output results increases as you go down the table. The Group Number field is the original record number of the group owner in each fuzzy duplicate pair, and the Original Record Number field is the original record number of the group member in each pair.
For information about how to create a nested sort, see Sorting records.