Skip to main content

Table 1 Comparison of CV-AUCs obtained by standard random forest (RF), random forest with under-sampling (RF_under), random forest with over-sampling (RF_over), and random forest with inverse sampling probability weights (RF_ipw), including results obtained without variable screening and results obtained with variable screening

From: Improving random forest predictions in small datasets from two-phase sampling designs

 

No screening

Screening

RF

RF_under

RF_over

RF_ipw

RF

RF_under

RF_over

RF_ipw

All markers

0.679

0.732

0.711

0.657

0.824

0.806

0.806

0.824

T cell markers

0.718

0.714

0.715

0.708

0.812

0.780

0.799

0.819

Antibody markers

0.605

0.656

0.628

0.579

0.708

0.722

0.696

0.711

No markers

0.442

0.452

0.448

0.443

0.442

0.452

0.448

0.443

  1. Clinical covariates (age, BMI, and a risk behavior score) are always included