From: A proficient cost reduction framework for de-duplication of records in data integration
Experiment category | Exp. code | Blocking key | Encoding function for blocking key |
---|---|---|---|
Single Key Blocking (SKB) | DX-SKB | given_name | 1. Soundex (SDX) 2. Substring4 (SB4) 3. Substring3 (SB3) |
Composite Key Blocking (CKB) | DX-CKB | given_name + surname | |
Multipass Blocking (MPB) | DX-MPB | given_name (Pass1), surname (Pass2) |