Skip to main content

Table 1 Characteristics that were used the analysis: all patients (Entire cohort) and split by training and testing datasetsa

From: Identifying clinically important COPD sub-types using data-driven approaches in primary care population based electronic health records

Covariate

Level

Entire cohort

Training cohort

Test cohort

n

 

30,961

23,275

7686

Sex (male)

n (%)

16,885 (54.54)

12,723 (54.66)

4163 (54.15)

BMI

<  18.5

1305 (4.21)

978 (4.2)

327 (4.25)

≥ 18.5, <  25

9926 (32.06)

7461 (32.06)

2465 (32.07)

≥ 25, <  30

10,358 (33.45)

7758 (33.33)

2600 (33.83)

≥ 30

9372 (30.27)

7078 (30.41)

2294 (29.85)

CRS

n (%)

590 (1.91)

445 (1.91)

145 (1.89)

Anxiety

n (%)

3123 (10.09)

2375 (10.2)

748 (9.73)

Atopy

n (%)

3809 (12.3)

2868 (12.32)

941 (12.24)

Depression

n (%)

3413 (11.02)

2605 (11.19)

808 (10.51)

Diabetes

n (%)

5001 (16.15)

3789 (16.28)

1212 (15.77)

Eosinophils > 2%

n (%)

20,363 (65.77)

15,299 (65.73)

5064 (65.89)

GERD

 

2759 (8.91)

2108 (9.06)

651 (8.47)

GOLD

1

8077 (26.09)

6017 (25.85)

2060 (26.8)

2

15,536 (50.18)

11,749 (50.48)

3787 (49.27)

3

6322 (20.42)

4730 (20.32)

1592 (20.71)

4

1026 (3.31)

779 (3.35)

247 (3.21)

Heart failure

n (%)

4685 (15.13)

3579 (15.38)

1106 (14.39)

Hypertension

n (%)

10,515 (33.96)

7906 (33.97)

2609 (33.94)

IHD

n (%)

7134 (23.04)

5379 (23.11)

1755 (22.83)

Smoking

ex

14,447 (46.66)

10,920 (46.92)

3527 (45.89)

current

16,514 (53.34)

12,355 (53.08)

4159 (54.11)

Therapy type

none

11,621 (37.53)

8775 (37.7)

2846 (37.03)

mono

4071 (13.15)

3018 (12.97)

1053 (13.7)

dual

10,261 (33.14)

7722 (33.18)

2539 (33.03)

triple

5008 (16.18)

3760 (16.15)

1248 (16.24)

  1. aBMI Body mass index, CRS Chronic rhinosinusitis, GERD Gastroesophageal reflux disease, IHD Ischaemic heart disease, GOLD Global initiative for chronic obstructive lung disease