Skip to main content

Table 7 Top four most importance variable for the eight disease categories

From: Predicting disease risks from highly imbalanced data using random forest

Disease Variable 1 Variable 2 Variable 3
1. Breast cancer Age Sex Secondary malignant Secondary malignant sddsmalignant malignant
2. Diabetes no complication Age Hypertension Hyperlipidemia
3. Diabetes with/complication Age Normal
pregnancy
Fluid-electrolyte
imbalance
4. Hypertension Age Hyperlipidemia Diabetes without compl.
5. Coronary atherosclerosis Age Hypertension Hyperlipidemia
6. Peripheral atherosclerosis Age Coronary
Atherosclerosis
Hypertension
7. Other circulatory diseases Age Dysthymia Anemia
8. Osteoporosis Age Race Hypertension