Skip to main content

Table 7 Top four most importance variable for the eight disease categories

From: Predicting disease risks from highly imbalanced data using random forest

Disease

Variable 1

Variable 2

Variable 3

1. Breast cancer

Age

Sex

Secondary malignant Secondary malignant sddsmalignant malignant

2. Diabetes no complication

Age

Hypertension

Hyperlipidemia

3. Diabetes with/complication

Age

Normal

pregnancy

Fluid-electrolyte

imbalance

4. Hypertension

Age

Hyperlipidemia

Diabetes without compl.

5. Coronary atherosclerosis

Age

Hypertension

Hyperlipidemia

6. Peripheral atherosclerosis

Age

Coronary

Atherosclerosis

Hypertension

7. Other circulatory diseases

Age

Dysthymia

Anemia

8. Osteoporosis

Age

Race

Hypertension