Skip to main content

Table 3 Distribution of records among the three CKD stages within the datasets used in our study. The number of records associated with each of the three stages is shown for each of the eight training sets as well as for the test set. Each of the training sets listed was obtained by considering the records left in the dataset while progressively truncating the early years of patient history included (the range of years covered by each set is shown in the respective column header). The rightmost column provides the number of records per stage in the test set, which was fixed to contain records gathered during 2015

From: Chronic Kidney Disease stratification using office visit records: Handling data imbalance via hierarchical meta-classification

CKD stages

Training set distribution

Test set

2007–2014

2008–2014

2009–2014

2010–2014

2011–2014

2012–2014

2013–2014

2014

2015

Stage 3

73,425

72,808

70,127

65,326

57,863

46,881

33,072

17,273

8,419

Stage 4

6,976

6,903

6,579

6,060

5,385

4,439

3,101

1,624

782

Stage 5

3,241

3,184

3,052

2,821

2,515

2,068

1,471

767

375

Total

83,642

82,895

79,758

74,207

65,763

53,388

37,644

19,664

9,576