Skip to main content

Table 2 Statistics about predictive properties obtained for low-dimensional datasets

From: Efficient and effective pruning strategies for health data de-identification

Data

Trans.

Checked

Property

 

Inserts

Hits

Antichain

Adult

12,960

1,180 (9.10 %)

Insufficient quality

↑

1,062

74.69 %

73.54 %

   

Insufficient protection

↓

887

15.37 %

93.01 %

Cup

45,000

1,524 (3.39 %)

Insufficient quality

↑

1,435

80.93 %

76.31 %

   

Insufficient protection

↓

1,172

24.75 %

96.84 %

Fars

20,736

1,342 (6.47 %)

Insufficient quality

↑

1,161

75.44 %

61.84 %

   

Insufficient protection

↓

752

10.81 %

88.43 %

Atus

34,992

1,022 (2.92 %)

Insufficient quality

↑

903

82.53 %

59.25 %

   

Insufficient protection

↓

561

5.23 %

95.37 %

Ihis

25,920

1,574 (6.07 %)

Insufficient quality

↑

1,341

73.58 %

42.80 %

   

Insufficient protection

↓

679

7.91 %

90.28 %

  1. We report the size of the solution space, the percentage of transformations checked as well as the number of inserts, the number of hits and the maximal size of the antichain for each predictive property. The size of the antichain is expressed relatively to the number of inserts