Skip to main content

Table 2 Performance summary of the deidentification software

From: Development and evaluation of an open source software tool for deidentification of pathology reports

 

Dept. A

Dept. B

Dept. C

Total

Reports

600

600

600

1800

Reports with any identifier

415

239

600

1254

Unique identifiers

1079

338

2082

3499

Unique identifiers per report

1.8

0.6

3.5

1.9

Unique identifiers removed

1057

320

2062

3439

Unique identifiers remaining, total

22

18

20

60

   Unique HIPAA identifiers remaining

11

1

7

19

% Unique identifiers removed

98.0%

94.7%

99.0%

98.3%

Unique over-scrubs

1126

961

2584

4671

Unique over-scrubs per report

1.9

1.6

4.3

2.6

% unique phrases removed that were identifiers

48.4%

25.0%

44.4%

42.4%