Skip to main content

Table 3 Permutations for Experimental Evaluation

From: A proficient cost reduction framework for de-duplication of records in data integration

Indexing technique

Methodology

Encoding function for indexing key

Field comparison functions

1. Blocking

• Single Key Blocking (SKB)

• Composite Key Blocking (CKB)

• Multipass Blocking (MPB)

• Soundex (SDX)

• Substring-4 (SB4)

• Substring-3 (SB3)

• Soundex

• Edit-Distance

• Q-gram

2. Windowing with window sizes 3, 6, 9, …, 30

• Single Key Windowing (SKW)

• Composite Key Windowing (CKW)

• Multipass Windowing (MPW)