Skip to main content

Table 9 Main results for geographic file

From: De-identifying a public use microdata file from the Canadian national discharge abstract database

 

PROV_ALL

PROV_REGION

 

0.2

0.05

0.04

0.2

0.05

0.04

 

Comb.

Complete

Comb.

Complete

Comb.

Complete

Comb.

Complete

Comb.

Complete

Comb.

Complete

GENDER_CODE

0.002

0.002

0.005

0.005

0.005

0.005

0.002

0.003

0.003

0.009

0.002

0.012

AGE_GROUP

1.6

5.2

5.4

14.2

6.5

16.4

0.81

3.8

2.8

10.6

3.4

12.2

PROV_XXX

0.07

3.11

0.2

7.4

0.25

8.3

0.06

0.3

0.16

0.5

0.19

0.6

MRDx

12.6

14.4

26.8

27

29.4

29.4

8.6

10.7

19.4

21.4

21.6

23.5

CMG_CODE

12.6

14.4

26.8

27

29.4

29.4

8.6

10.7

19.4

21.4

21.6

23.5

DIAG_BLOCK

8.5

4.8

19.8

9.5

22.2

10.5

5.5

3.6

13.5

7.5

15.2

8.3

DIAG_CHAPTER

1.5

0.11

3.8

0.3

4.3

0.4

0.87

0.07

2.5

0.3

2.8

0.31

CCI_CODE

5.9

8.13

10.7

13.2

11.5

14.14

4.4

6.8

8.8

11.9

9.7

12.7

SHORT_CCI

5.9

8.13

10.7

13.2

11.5

14.14

4.4

6.8

8.8

11.9

9.7

12.7

Total % Cells Suppressed

5.4

6.5

11.6

12.4

12.8

13.6

3.7

4.75

8.4

9.5

9.4

10.4

Entropy (%)

100

137

246

302

274

334

70

104

181

236

204

262

  1. Missingness (as a percentage of all records) for different probability threshold levels and levels of geographic detail for the quasi-identifiers in the first PUMF for combinations ("comb") and complete algorithms. The last two rows show the percentage of cells in the quasi-identifiers that are suppressed and the entropy change from the baseline as a percentage.