Status
|
Predicted not bad
|
Predicted bad
|
Total
|
% predicted bad
|
---|
Unknown
|
5623
|
2501
|
8124
|
30.7
|
Good
|
15975
|
901
|
16876
|
5.33
|
Bad
|
337
|
11698
|
12035
|
97.2
|
- The logistic classifier derived to identify bad clusters (not bad refers to a single individual within a cluster, bad refers to more than one individual), shown in Table 4, was applied to a further random sample of 25,000 clusters obtained after initial record linkage. These were classified into 'good' 'unknown status' and 'bad' using rules, as described in Table 4 Legend and methods. The classifier performance on this validation set is shown.