From: The effect of data cleaning on record linkage quality
Synthetic data
F-measure
No cleaning
0.883
Minimal cleaning
0.882
High cleaning
0.875
Hospital admissions data
0.993
0.992