Skip to main content

Table 9 Main results for geographic file

From: De-identifying a public use microdata file from the Canadian national discharge abstract database

  PROV_ALL PROV_REGION
  0.2 0.05 0.04 0.2 0.05 0.04
  Comb. Complete Comb. Complete Comb. Complete Comb. Complete Comb. Complete Comb. Complete
GENDER_CODE 0.002 0.002 0.005 0.005 0.005 0.005 0.002 0.003 0.003 0.009 0.002 0.012
AGE_GROUP 1.6 5.2 5.4 14.2 6.5 16.4 0.81 3.8 2.8 10.6 3.4 12.2
PROV_XXX 0.07 3.11 0.2 7.4 0.25 8.3 0.06 0.3 0.16 0.5 0.19 0.6
MRDx 12.6 14.4 26.8 27 29.4 29.4 8.6 10.7 19.4 21.4 21.6 23.5
CMG_CODE 12.6 14.4 26.8 27 29.4 29.4 8.6 10.7 19.4 21.4 21.6 23.5
DIAG_BLOCK 8.5 4.8 19.8 9.5 22.2 10.5 5.5 3.6 13.5 7.5 15.2 8.3
DIAG_CHAPTER 1.5 0.11 3.8 0.3 4.3 0.4 0.87 0.07 2.5 0.3 2.8 0.31
CCI_CODE 5.9 8.13 10.7 13.2 11.5 14.14 4.4 6.8 8.8 11.9 9.7 12.7
SHORT_CCI 5.9 8.13 10.7 13.2 11.5 14.14 4.4 6.8 8.8 11.9 9.7 12.7
Total % Cells Suppressed 5.4 6.5 11.6 12.4 12.8 13.6 3.7 4.75 8.4 9.5 9.4 10.4
Entropy (%) 100 137 246 302 274 334 70 104 181 236 204 262
  1. Missingness (as a percentage of all records) for different probability threshold levels and levels of geographic detail for the quasi-identifiers in the first PUMF for combinations ("comb") and complete algorithms. The last two rows show the percentage of cells in the quasi-identifiers that are suppressed and the entropy change from the baseline as a percentage.