Skip to main content

Table 8 Overall performance

From: An efficient record linkage scheme using graphical analysis for identifier error detection

Process

Operation

Timing

1

Identifier cleaning; forename/surname duplication screening

3 min

2

Construction of unique identifiers

1 min

3

Initial clustering using identifiers

7 min

4

Identity collision detection

10 min

5

Identity collision resolution

2 min

6

Identity collision reassessment

2 min

  1. After initial linkage, a process of collision resolution is applied (see methods). This causes a decrease in the number of clusters containing multiple identifiers, as detailed above.