Skip to main content

A risk prediction model based on machine learning algorithm for parastomal hernia after permanent colostomy

Abstract

Objective

To develop a machine learning-based risk prediction model for postoperative parastomal hernia (PSH) in colorectal cancer patients undergoing permanent colostomy, assisting nurses in identifying high-risk groups and devising preventive care strategies.

Methods

A case-control study was conducted on 495 colorectal cancer patients who underwent permanent colostomy at the Second Affiliated Hospital of Anhui Medical University from June 2017 to June 2023, with a 1-year follow-up period. Patients were categorized into PSH and non-PSH groups based on PSH occurrence within 1-year post-operation. Data were split into training (70%) and testing (30%) sets. Variable selection was performed using Least Absolute Shrinkage and Selection Operator (LASSO) regression, and binary classification prediction models were established using Logistic Regression (LR), Support Vector Classification (SVC), K Nearest Neighbor (KNN), Random Forest (RF), Light Gradient Boosting Machine (LGBM), and Extreme Gradient Boosting (XgBoost). The binary classification label denoted 1 for PSH occurrence and 0 for no PSH occurrence. Parameters were optimized via 5-fold cross-validation. Model performance was evaluated using Area Under Curve (AUC), specificity, sensitivity, accuracy, positive predictive value, negative predictive value, and F1-score. Clinical utility was evaluated using decision curve analysis (DCA), model explanation was enhanced using shapley additive explanation (SHAP), and model visualization was achieved using a nomogram.

Results

The incidence of PSH within 1 year was 29.1% (144 patients). Among the models tested, the RF model demonstrated the highest discrimination capability with an AUC of 0.888 (95% CI: 0.881–0.935), along with superior specificity, accuracy, sensitivity, and F1 score. It also showed the highest clinical net benefit on the DCA curve. SHAP analysis identified the top 10 influential variables associated with PSH risk: body mass index (BMI), operation duration, history and status of chronic obstructive pulmonary disease (COPD), prealbumin, tumor node metastasis (TNM) staging, stoma site, thickness of rectus abdominis muscle (TRAM), C-reactive protein CRP, american society of anesthesiologists physical status classification (ASA), and stoma diameter. These insights from SHAP plots illustrated how these factors influence individual PSH outcomes. The nomogram was used for model visualization.

Conclusion

The Random Forest model demonstrated robust predictive performance and clinical relevance in forecasting colonic PSH. This model aids in early identification of high-risk patients and guides preventive care.

Peer Review reports

Background

Colorectal cancer (CRC) significantly impacts both survival and quality of life worldwide. According to the International Agency for Research on Cancer (IARC), CRC ranks third among all cancers in terms of incidence, with approximately 1.8 million new cases annually, and is the second leading cause of cancer-related deaths [1, 2]. About 60% of CRC patients undergo permanent colostomy (PC) surgery [3]. Parastomal hernia (PSH) is a common complication where intra-abdominal tissue protrudes around the stoma site, occurring in about 30% of cases within 1 year post-surgery. This incidence increases to approximately 42% three years after operation [4]. PSH can lead to various issues such as abdominal pain and distension, fecal water leakage, and skin ulceration. These symptoms not only reduce social engagement but also increase economic burdens and cause significant physical and mental pains. Moreover, around 15% of patients with PSH may develop serious complications like incarceration, intestinal obstruction, or intestinal perforation [5]. Surgical repair of PSH remains challenging with current methods like Keyhole, Sugarbaker, and Sandwich, which still see recurrence rates exceeding 30% [6].

Colostomy maintenance is predominantly overseen by nurses, emphasizing preventive nursing practices over reactionary treatments or surgical interventions, which aligns with the high-quality nursing principles [7]. Early identification of high-risk patients is crucial for implementing effective preventive care strategies, thereby enhancing stoma care quality. At present, there is a deficiency in effective prediction tools or scoring sheets for guiding nursing practice in managing PSH. The pathogenic mechanism of PSH is complex, involving factors such as disease status, stoma, and operation conditions. It involves multiple pathophysiological mechanisms and the predictive efficacy by a single index is insufficient. Therefore, constructing a prediction model using multi-dimensional indicators is more aligned with the needs of stoma care. Lopez-Cano M et al. conducted a meta-analysis to evaluate the effectiveness of prophylactic mesh in preventing post-colostomy PSH, highlighting both its potential benefits and risks [8]. They employed a rigorous statistical method to aggregate data from multiple studies, finding that while prophylactic meshes significantly reduced the incidence of PSH, there was also an increased risk of mesh-related complications. Donahue TF et al. investigated risk factors for parastomal hernias following radical cystectomy to help physicians better manage this complication [9]. Using a retrospective cohort study design, they identified several key risk factors such as age, BMI, and stomal location, which could inform targeted preventive strategies. Funahashi K et al. analyzed risk factors for PSH in Japanese patients undergoing permanent colostomy, focusing on identifying effective preventive measures [10]. Their case-control study revealed that patient-specific factors like abdominal adiposity and ostomy-related factors such as the size of the fascial opening were critical in determining PSH risk. Liu H et al. developed a nomogram model for predicting PSH risk, offering clinicians a valuable assessment tool [11]. This model, based on machine learning techniques, demonstrated superior predictive accuracy compared to traditional risk assessment methods and has the potential to be further enhanced with more extensive data.

Despite these advancements, there remains a compelling case for the continued application of machine learning (ML) approaches in predicting and managing PSH. The existing literature has shown promising results with ML models, particularly in enhancing predictive accuracy and personalizing patient care. However, limitations such as dataset size and model generalizability suggest that further exploration with more extensive and diverse datasets could lead to even more accurate and robust predictions. Moreover, the incorporation of recent ML innovations might unlock new pathways for preventing and managing PSH, underlining the potential value of continued research in this domain. ML is an advanced artificial intelligence technique designed for data processing. Unlike traditional statistical methods, ML excels in handling complex datasets associated with intricate disease mechanisms, incorporating numerous candidate parameters. These datasets often exhibit challenges like multicollinearity, interaction effects, nonlinear parameter relationships and outcome events, and intermediary effects with outcome events [12]. Methodologically, ML aligns with the characteristics of PSH and possesses a certain level of sophistication. At present, the application of ML algorithms remains relatively limited in nursing research, especially as a primary method for studying risk prediction of PSH after permanent colostomy surgery, with scarce existing literature indicating potential for major innovations.

Therefore, this study aims to utilize perioperative clinical indicators as a basis, employ ML algorithms for data screening, and establish an effective multidimensional model for predicting PSH risk. This endeavor seeks to enhance the quality and efficiency of ostomy care, provide early identification of high-risk patients, and support the development of preventive care strategies for timely intervention.

Materials and methods

General information

This retrospective case-control analysis included 495 patients with permanent colostomies who underwent surgical treatment at the Second Affiliated Hospital of Anhui Medical University from June 2017 to June 2023, with a one-year follow-up period. The latest patient surgery inclusion was in May 2022. All participants had complete one-year follow-up records at the colostomy outpatient clinic (Fig. 1). This study received approval from the Ethics Committee of the Second Affiliated Hospital of Anhui Medical University (YX2023-003).

Fig. 1
figure 1

Flowchart of case data collection

Inclusion and exclusion criteria

Inclusion criteria: Patients who were diagnosed with colorectal cancer for the first time and underwent permanent colostomy surgery; Patients with no prior history of colostomy surgery; Age ≥ 18 years.

Exclusion criteria: Incomplete baseline data; Lack of complete one-year follow-up data post-surgery; Subsequent ileostomy or colostomy revision surgery unrelated to specified reasons; Presence of severe infection, septic shock, severe postoperative bleeding or hemorrhagic shock, severe postoperative heart failure or cardiogenic shock, active phases of hematological and rheumatic autoimmune diseases during the postoperative period.

Sample size estimation

In this study, we aimed to construct a nomogram using the top 10 ranked variables in the model. According to literature on PSH incidence [4], a minimum of 334 patients must be included to adhere to the Events Per Variable (EPV) principle for predictive model studies [13]. Ultimately, our study included a total of 495 cases, meeting the required sample size criteria.

Diagnostic methods

The diagnosis of CRC diagnosis was based on the 2021 guidelines from the Chinese Society of Clinical Oncology (CSCO) for the diagnosis and treatment of CRC. tumor node metastasis (TNM) staging was assessed according to the eighth edition of the American Joint Committee on Cancer (AJCC) staging system. PSH was diagnosed by a full-time Enterostomal Therapist (ET) following the 2017 European Society of Hernia guidelines, using physical examination (Valsalva maneuver) combined with abdominal computed tomography (CT).

Included variables

In this study, data collection took place around the perioperative period, and a total of 48 clinical indicators were gathered from patients through literature review, expert guidance, and other sources. These include: Demographic and medical history data: gender, age, body mass index (BMI), smoking history, drinking history, hypertension history, diabetes history, chronic obstructive pulmonary disease (COPD) history, heart failure (HF) history, abdominal surgery history, and constipation history; Clinical data: cause of disease, subcutaneous fat thickness (SFT), thickness of rectus abdominis muscle (TRAM), long-term hormone use, postoperative blood transfusion, American Society of Anesthesiologists physical status classification (ASA), nutritional risk screening 2002 (NRS2002), Barthel Index (BI), neoadjuvant chemotherapy (NACT), tumor node metastasis (TNM) staging, and postoperative incision infection; Operation and stoma characteristics: timing of operation, operation method, operation type, operation duration, preoperative stoma localization (PSL), stoma route, stoma site, stoma diameter, and stoma venting time; Auxiliary examination data (before surgery): C-reactive protein (CRP), hemoglobin (HB), alanine transaminase (ALT), aspartate aminotransferase (AST), total bilirubin (TBIL), creatinine (Cr), blood urea nitrogen (BUN), prealbumin (PAB), albumin (ALB), glycosylated hemoglobin (HBA1c), creatine kinase (CK), total cholesterol (TC), triglyceride (TG), uric acid (UA), low-density lipoprotein cholesterol (LDL-C), carcinoembryonic antigen (CEA), and carbohydrate antigen 199 (CA-199).

Data processing

(1) Since no variable had more than 25% missing values after inclusion andexclusion, all 48 variables were retained. The remaining missing values (limited to continuous variables) were imputed using multiple imputation (MI). (2) Based on follow-up records, the included cases were categorized into the PSH group and the Non-PSH group depending on whether PSH occurred within 1 year after the operation. (3) Ordinal variables were ranked in order, while binary and multivariate variables were assigned based on the current situation [14, 15] (Tables 1 and 2). (4) The dataset was split into a training set and a test set using a 70:30 ratio, with the training set utilized for constructing different ML algorithm models. (5) The training set variables underwent initial screening to exclude collinearity using a 10-fold cross-validation with the Least Absolute Selection and Shrinkage Operator (LASSO). (6) Utilizing the training dataset, predictive models for PSH were developed employing four machine learning algorithms: Logistic Regression (LR), Support Vector Classification (SVC), K-Nearest Neighbors (KNN), Random Forest (RF), Light Gradient Boosting Machine (LGBM), and Extreme Gradient Boosting (XgBoost). Hyperparameter optimization of the models was conducted using 5-fold cross-validation. (7) The test set was utilized for model validation, and the optimal algorithm model was selected to evaluate the predictive efficacy using metrics such as Area Under Curve (AUC), specificity (SPE), sensitivity (SEN), accuracy (ACC), positive predictive value (PPV), negative predictive value (NPV) and F1-score. Clinical applicability was evaluated using decision curve analysis (DCA). (8) Shapley additive explanation (SHAP) was used for explanatory analysis of model variables and attribution analysis of individual-level effects. (9) Using the training set, a nomogram was constructed featuring the top 10 predictor variables from the model as evaluation metrics. This nomogram served as a tool for quantitatively visualize the risk of PSH.

Table 1 Assignment table of dichotomous and multivariate variables
Table 2 Assignment of ordinal variables

Statistical analysis

Descriptive statistics were performed using SPSS 25.0. Continuous variables, which were not normally distributed, are presented as median (quartile). The Mann-Whitney U test was utilized for their comparison. Categorical variables are represented as counts (percentages %) and were compared using the chi-square test. A two-tailed P-value of less than 0.05 was considered statistically significant. Multiple data imputation was performed using SPSS 25.0, followed by LASSO regression analysis using the Glmnet package in the R. ML model establishment, ROC curve plotting, and DCA curve plotting were carried out using Python 3.10, and the Scikit-learn package (https://scikit-learn.org). Variable importance and ranking in the predictive model were quantified using SHAP. The direction of impact of variables on the outcome event was determined based on SHAP values, and individual-level visual explanations were generated. A nomogram was created using the RMS package in R.

Results

Screening of variables

495 patients were ultimately included in the study, with a total of 48 variables, and no variables were excluded, leaving 48 variables in total. Among them, 144 (29.1%) developed PSH within one year after the operation. The training dataset underwent preliminary variable selection using LASSO regression with 10-fold cross-validation. When λ equaled 0.0171, 23 optimal predictive variables were identified. These variables are BMI, TNM staging, ASA classification, history of diabetes, history and status of COPD, history of constipation, NRS2002, operation duration, stoma route, stoma site, stoma diameter, stoma venting time, timing of operation, TBIL, CRP, TC, TG, HBA1c, LDL-C, SFT, TRAM, ALB, and PAB (Fig. 2A and B).

Fig. 2
figure 2

A) LASSO regression coefficient plot. B) LASSO regression subset selection plot

Model performance comparison

The RF prediction model demonstrated superior performance across various metrics in the study, achieving an AUC of 0.888 (95% CI: 0.881–0.935), SEN of 0.851, SPE of 0.804, ACC of 0.819, PPV of 0.921, NPV of 0.667, and F1-score of 0.813. In a comprehensive analysis, it could be concluded that the RF model exhibited the best predictive performance (Fig. 3; Table 3). DCA further supported the RF model’s efficacy by demonstrating the highest net benefit across different risk probability thresholds. Specifically, the RF model demonstrated the highest net benefit within the threshold probability range of 0.30 to 0.95 (Fig. 4). Therefore, RF was selected as the optimal algorithm for constructing the PSH prediction model.

Fig. 3
figure 3

ROC curves for LR, KNN, RF, SVC, LGBM, and XgBoost models

Table 3 Performance parameters of different machine learning algorithms for constructing predictive models
Fig. 4
figure 4

DCA curves for LR, KNN, RF, SVC, LGBM, and XgBoost models

Random forest algorithm setup

A total of 100 decision trees were employed to construct the Random Forest (RF) classifier model. Information entropy was utilized as the criterion for splitting, and to prevent overfitting, a maximum tree depth of 7 was imposed. Ensuring a balance between model complexity and performance, each tree required a minimum of 2 samples per node to undergo splitting.

Interpretation of prediction model

The SHAP bar plot, based on mean SHAP mean values, displayed the average impact magnitude of variables on the model output, ranked in descending order (Fig. 5). The top 10 variables influencing the prediction of PSH included BMI, operation duration, history and status of COPD, PAB, TNM staging, stoma site, TRAM, CRP, ASA classification, and stoma diameter. The SHAP waterfall plot, in conjunction with assignment situations, further illustrated the positive or negative associations of variables with the occurrence of PSH. It indicated that BMI, history and status of COPD, TNM staging, CRP level, and ASA classification are positively correlated with PSH. Conversely, operation duration ≤ 4 h, PAB, stoma site (through rectus abdominis muscle), TRAM measurements, and stoma diameter ≤ 30 mm are negatively correlated with PSH (Fig. 6). The SHAP Explanatory plot, employing two patient cases, provided detailed insights into how the RF prediction model analyzed individual-level predictive factors for occurrences and non-occurrences of PSH (Fig. 7A and B). In patients who experienced PSH, significant contributing factors included ALB levels at 27.28 g/L, BMI indicating overweight, CRP levels at 6.92 mg/L, TRAM measurements at 1.35 cm, PAB levels at 248.7 mg/L, and TNM stage II. Protective factors were the absence of COPD history and operation duration ≤ 4 h. In patients who did not experience PSH, protective factors included operation duration ≤ 4 h, no history of COPD, TRAM measurements at 1.01 cm, PAB levels at 231.3 mg/L, CRP levels at 5.4 mg/L, stoma site through the rectus abdominis muscle, and a healthy ASA classification. Factors promoting PSH included not being overweight in terms of BMI, stoma diameter > 30 mm, transperitoneal stoma route, and TNM stage II.

Fig. 5
figure 5

SHAP bar plot for variable ranking in the random forest model

Fig. 6
figure 6

SHAP waterfall plot for variables in the random forest model

Fig. 7
figure 7

(A) PSH patients - single-sample SHAP explanatory plot. (B) Non-PSH patients - single-sample SHAP explanatory plot

Predictive model presentation

The top 10 variables were assessed, and the predictive model was quantified using a nomagram to evaluate the risk of PSH in colostomy patients (Fig. 8). Its interpretation is as follows: For a given patient, a vertical line was drawn upwards from the horizontal axis corresponding to each variable, indicating a specific score on the horizontal axis. These scores from all variables were then summed to derive a total score. Subsequently, a vertical line was drawn downwards from this total score to the horizontal axis labeled “Risk Probability,” representing the predicted risk for PSH for that patient.

Fig. 8
figure 8

The nomogram of the PSH risk prediction model

The nomogram included the following variables: BMI, operation duration, COPD history and status, PAB, TNM staging, stoma site, TRAM, CRP, ASA classification, and stoma diameter. Firstly, you can get a specific value for each variable. BMI: 2. Operation duration: 0.5 h. History of COPD. PAB: 300. TNM stage: 2. Stoma position: 0. TRAM: (1) CRP: 5. ASA score: (2) Stoma diameter: 1. Secondly, according to the value of each variable, the integral is found on the corresponding axis. BMI = 2 corresponds to about 25 points. Operation time = 0.5 h corresponds to about 8 points. Have no history of COPD = 0 points. PAB = 300 corresponds to about 15 points. TNM stage = 2 corresponds to about 20 points. Stoma site = 0 is 20 points. TRAM = 1 corresponds to about 15 points. CRP = 5 corresponds to about 24 points. ASA classification = 2 corresponds to about 15 points. stoma diameter = 1 corresponds to 0 points. Then, you add up all the integrals and you get a total of 100. Based on a total score of 142, you find it on the “total score” axis, then corresponding down to the risk probability axis. For example, it might correspond to 0.6, indicating a 60% risk probability.

Discussion

In this study, we analyzed perioperative clinical data from 495 patients with permanent colostomy. Among them, 144 patients developed PSH within 1 year after the operation, resulting in an incidence of 29.1%, which was consistent with previous literature reports. Subsequently, LASSO regression and six machine learning algorithms were employed to analyze the variables. Ultimately, it was determined that the RF-based PSH prediction model exhibited the best performance in terms of predictive efficiency and clinical applicability. This model could effectively identify high-risk PSH patients early, providing significant clinical benefits. SHAP is used as a post-hoc explanation and analysis method for black-box models. It provides an intuitive way to assess the contribution and direction of the impact of variables on outcome events [16]. According to SHAP, the top 10 most important variables influencing PSH included history of COPD, SFT, ALB, BMI, PAB, CRP, stoma diameter, stoma route, TRAM, and operation duration. SHAP waterfall plots demonstrated the positive or negative relationship between these variables and the occurrence of PSH. From a disease perspective, these indicators are theoretically linked to PSH and align with the underlying pathophysiological mechanisms. They are not merely results of statistical analysis but encompass multiple dimensions, including stoma characteristics, nutritional status, medical history, and surgical factors. Together, they significantly enhance predictive accuracy. Moreover, these indicators are easily obtainable in routine nursing practices and do not require special methods or tools, thereby ensuring the clinical applicability of the model. Importantly, some of these indicators can be modified through nursing interventions, enabling the guidance of clinical preventive care plans.

From a methodological perspective, although Logistic Regression remains prominent in predictive modeling research as a traditional method for multifactor analysis, it faces limitations such as suboptimal accuracy, susceptibility to underfitting, and the requirement for linear relationships in data, along with limited capability to handle collinear data [17]. In contrast, machine learning, as a representative technology of artificial intelligence, excels at managing large datasets with complex relationships like collinearity, interactions, and mediation effects. It can effectively screen significant variables from numerous variables to construct more robust and efficient predictive models [18]. Among them, RF stands out as a supervised machine learning algorithm that integrates decision trees with ensemble learning techniques. It achieves higher accuracy by combining predictions from multiple decision trees, each trained on randomly sampled data and variables. This approach effectively mitigates overfitting and enhances classification performance [19, 20]. This study employed machine learning algorithms for data analysis, which is methodologically more advanced compared to traditional statistical methods. Osborne et al. developed a PSH risk assessment scale, which collected data through questionnaire surveys. However, it is essential to acknowledge potential limitations in data authenticity and objectivity [21]. In this study, all included variables were clinical objective indicators, to some extent, mitigating model underfitting due to information bias.

From the analysis of the pathogenic mechanism of PSH, its primary causes include the extent of abdominal wall defect and the intensity of intra-abdominal pressure. In this study, SHAP plots were utilized to rank and display the variables of the RF model based on their contribution levels. Additionally, the relationship between each variable and the outcome event was clarified. Regarding physical and medical history, BMI serves as an indicator used to measure the degree of obesity. Obese patients often exhibit weakened abdominal wall muscles and fascial tissues, resulting in insufficient abdominal wall strength. Subcutaneous fat will increase the longitudinal tension of the abdominal wall, which may result in suture ischemia, liquefaction, and delayed healing. Furthermore, excessive visceral fat deposition can elevate intra-abdominal pressure. Obese patients are prone to fat liquefaction, edema, necrosis secondary infection, and other adverse conditions in the surgical incision. Several studies have demonstrated a corresponding increased risk of postoperative PSH in patients with BMI > 25 kg/m2 [22, 23]. COPD is a respiratory condition characterized by recurrent coughing and wheezing. Frequent coughing and vigorous expectoration can significantly elevate intra-abdominal pressure. Furthermore, COPD, being a catabolic disease often accompanied by malnutrition, resulting in weakened abdominal wall muscles and fascial tissues, which makes surgical incisions less likely to heal. Liu et al. have confirmed that COPD independently increases the risk of PSH [24]. TRAM is directly correlated with PSH occurrence. Thicker rectus abdominis muscles can effectively resist tension around the incision site, thus reducing the extent of abdominal wall defect, buffering intra-abdominal pressure, and preventing herniation of abdominal contents [25].

In terms of disease status, TNM staging of CRC serves as a criterion for evaluating the degree of tumor infiltration and metastasis. Higher TNM stages indicate greater tumor invasion of the intestinal wall and surrounding lymph nodes. Patients with high TNM stage often suffer from severe malnutrition and weakened rectus abdominis due to factors such as intestinal absorption dysfunction, insufficient intake, and cancer cell consumption [26, 27]. Additionally, these patients frequently require more extensive surgical resections and prolonged postoperative chemotherapy. Consequently, patients with high TNM staging of stoma are more susceptible to PSH complications. ASA classification is an indicator used to evaluate the physical condition of CRC patients from the perspective of anesthesia safety. A higher ASA classification denotes poorer nutritional status, reduced activity ability, and other compromised general conditions, as well as an increased likelihood of combined organ dysfunction. Patients with high ASA classifications have diminished tissue repair capabilities, higher rates of stoma infection, poor healing of abdominal wall incisions, and decreased abdominal wall strength, all of which can increase the incidence of PSH [28, 29].

In terms of surgery and stoma conditions, the duration of the operation can reflect the complexity of the procedure, including factors such as intraoperative adhesions, complex tumor structures, and challenges in hemostasis. These complexities can lead to postoperative complications like wound infections and deteriorating postoperative conditions. Prolonged laparoscopic insufflation can damage the abdominal wall, causing aseptic inflammation. Additionally, since stoma creation typically occurs at the end of surgery, extended operation duration can lead to surgeon fatigue, potentially affecting the quality of the stoma. A prospective cohort study by Pennings et al. indicated a significant increase in PSH occurrence within 1 year after surgery for patients whose operation duration exceeded 395 min [5]. The abdominal wall stoma site is usually chosen beside or through the rectus abdominis muscle. Due to the thickness of the rectus abdominis muscle and its easy fixation with the intestinal tract, it can increase the strength of the abdominal wall adjacent to the stoma. Shiraishi et al. demonstrated that the incidence of PSH through rectus abdominis muscle stoma is lower than that beside rectus abdominis muscle stoma [30]. The stoma diameter not only represents the size of the abdominal wall defect, but its enlargement also indicates that the elastic retraction force of the tissue or scar around the stoma is compromised, preventing the maintenance of normal abdominal wall tension. Studies have shown that increased stoma diameter is positively associated with the incidence of PSH [15, 31, 32]. Current guidelines or consensus do not specify an optimal diameter due to the need for patient individualization [4]. However, excessively small stomas can easily lead to postoperative intestinal obstruction and intestinal ischemic necrosis.

In terms of blood test indicators, PAB reflects a patient’s nutritional status. Malnutrition can lead to periwound edema and poor healing, thereby increasing the risk of PSH [9]. Additionally, malnutrition may impair incision granulation tissue formation and weaken muscle fibers around the incision by reducing collagen production [33]. CRP, as an inflammatory marker, indicates elevated levels that may correlate with conditions such as perioperative infections and organ dysfunction. Increased inflammation not only hinders wound recovery but also exacerbates risks associated with colorectal tumor-related intestinal infections and intestinal obstructions, which elevate intra-abdominal pressure and complicate wound healing. Meta-analysis results by Niu et al. highlighted that persistent postoperative inflammation is a risk factor for PSH occurrence [34].

From a preventive nursing perspective, developing scientifically sound and effective nursing measures is crucial in mitigating PSH risks. In this study, SHAP plots were utilized with the RF model to analyze variables and provide individualized presentations. This approach facilitates the development of personalized care plans. Among the top 10 variables with significant impact, BMI, PAB, history and status of COPD, CRP, and TRAM were modifiable through nursing interventions. Additionally, their overall impact surpassed that of the remaining factors. Strengthening preoperative respiratory exercises, advising smoking cessation, enhancing postoperative airway care, performing back patting during position changes, and providing appropriate oxygen therapy can effectively decrease cough frequency, alleviate severity, and enhance expectoration in COPD patients. Implementing scientifically designed exercise plans during the perioperative period and after discharge, alongside appropriate dietary interventions, promotes reduction in body fat, optimal weight management, enhanced nutritional status, and increased rectus abdominis muscle strength. Enhancing preoperative psychological care can reduce patient anxiety and tension, thereby potentially reducing inflammatory responses in the body.

Conclusion

This study developed a predictive model for PSH using machine learning algorithms, elucidating the relevant promoting or protective factors of PSH and their respective impact levels. This approach facilitates early identification of high-risk patients and guides the formulation of preventive care strategies for PSH. However, this study is retrospective and sourced from a single center, thus limiting the strength of its findings and necessitating further prospective validation. Currently, the application of machine learning algorithms in PSH risk prediction models and other nursing research remains exploratory. Nevertheless, within the context of advancing nursing information technology and strengthening interdisciplinary collaboration between nursing and engineering, machine learning presents new opportunities for nursing research in the era of big data, warranting further investigation.

Data availability

The datasets generated and/or analyzed during the current study are not publicly available due to patient privacy concerns. However, they can be obtained from the corresponding author upon reasonable request.

Abbreviations

CRC:

Colorectal cancer

IARC:

International Agency for Research on Cancer

PC:

Permanent colostomy

PSH:

Parastomal hernia

ML:

Machine learning

EPV:

Events Per Variable

CSCO:

Chinese Society of Clinical Oncology

TNM:

Tumor node metastasis

AJCC:

American Joint Committee on Cancer

ET:

Enterostomal Therapist

CT:

Computed tomography

BMI:

Body mass index

COPD:

Chronic obstructive pulmonary disease

GOLD:

Global initiative for chronic obstructive lung disease

FEV1:

Forced expiratory volume in the first second

HF:

Heart failure

SFT:

Subcutaneous fat thickness

TRAM:

Thickness of rectus abdominis muscle

ASA:

Anesthesiologists physical status classification

NRS2002:

Nutritional risk screening 2002

BI:

Barthel index

NACT:

Neoadjuvant chemotherapy

PSL:

Preoperative stoma localization

CRP:

C-reactive protein

HB:

Hemoglobin

ALT:

Alanine transaminase

AST:

Aspartate aminotransferase

TBIL:

Total bilirubin

Cr:

Creatinine

BUN:

Blood urea nitrogen

PAB:

Prealbumin

ALB:

Albumin

HBA1c:

Glycosylated hemoglobin

CK:

Creatine kinase

TC:

Total cholesterol

TG:

Triglyceride

UA:

Uric acid

LDL:

C-low-density lipoprotein cholesterol

CEA:

Carcinoembryonic antigen

CA:

199-carbohydrate antigen 199

MI:

Multiple imputation

LASSO:

Least absolute selection and shrinkage operator

ROC:

Receiver operating characteristic

LR:

Logistic Regression

SVC:

Support Vector Classification

KNN:

K-Nearest Neighbors

RF:

Random Forest

LGBM:

Light Gradient Boosting Machine

XgBoost:

Extreme Gradient Boosting

AUC:

Area Under Curve

SPE:

specificity

SEN:

Sensitivity

ACC:

Accuracy

CI:

Confidence interval

SHAP:

Shapley additive explanation

PPV:

Positive predictive value

NPV:

Negative predictive value

References

  1. Siegel RL, Miller KD, Goding Sauer A, Fedewa SA, Butterly LF, Anderson JC, et al. Colorectal cancer statistics, 2020. CA Cancer J Clin. 2020;70(3):145–64.

    Article  PubMed  Google Scholar 

  2. Dekker E, Tanis PJ, Vleugels JLA, Kasi PM, Wallace MB. Colorectal cancer. Lancet. 2019;394(10207):1467–80.

    Article  PubMed  Google Scholar 

  3. Schiergens TS, Hoffmann V, Schobel TN, Englert GH, Kreis ME, Thasler WE, et al. Long-term quality of life of patients with permanent end ileostomy: results of a nationwide cross-sectional survey. Dis Colon Rectum. 2017;60(1):51–60.

    Article  PubMed  Google Scholar 

  4. Antoniou SA, Agresta F, Garcia Alamino JM, Berger D, Berrevoet F, Brandsma HT, et al. European Hernia Society guidelines on prevention and treatment of parastomal hernias. Hernia. 2018;22(1):183–98.

    Article  CAS  PubMed  Google Scholar 

  5. Pennings JP, Kwee TC, Hofman S, Viddeleer AR, Furnée EJB, Van Ooijen PMA, et al. Clinical and radiologic predictors of parastomal hernia development after end colostomy. AJR Am J Roentgenol. 2021;216(1):94–103.

    Article  PubMed  Google Scholar 

  6. Hansson BM, Bleichrodt RP, De Hingh IH. Laparoscopic parastomal hernia repair using a keyhole technique results in a high recurrence rate. Surg Endosc. 2009;23(7):1456–9.

    Article  CAS  PubMed  Google Scholar 

  7. Hansson BM. Parastomal hernia: treatment and prevention 2013; where do we go from here? Colorectal Dis. 2013;15(12):1467–70.

    Article  CAS  PubMed  Google Scholar 

  8. López-Cano M, Brandsma HT, Bury K, Hansson B, Kyle-Leinhase I, Alamino JG, et al. Prophylactic mesh to prevent parastomal hernia after end colostomy: a meta-analysis and trial sequential analysis. Hernia. 2017;21(2):177–89.

    Article  PubMed  Google Scholar 

  9. Donahue TF, Bochner BH, Sfakianos JP, Kent M, Bernstein M, Hilton WM, et al. Risk factors for the development of parastomal hernia after radical cystectomy. J Urol. 2014;191(6):1708–13.

    Article  PubMed  Google Scholar 

  10. Funahashi K, Suzuki T, Nagashima Y, Matsuda S, Koike J, Shiokawa H, et al. Risk factors for parastomal hernia in Japanese patients with permanent colostomy. Surg Today. 2014;44(8):1465–9.

    Article  PubMed  Google Scholar 

  11. Liu H, Wang W, Qin C, Wang H, Qi W, Wei Y, et al. Development and validation of a nomogram prediction model for the risk of parastomal hernia. Intell Med. 2024;4(2):128–33.

    Article  Google Scholar 

  12. Lee YW, Choi JW, Shin EH. Machine learning model for predicting malaria using clinical information. Comput Biol Med. 2021;129:104151.

    Article  PubMed  Google Scholar 

  13. Riley RD, Ensor J, Snell KIE, Harrell FE Jr., Martin GP, Reitsma JB, et al. Calculating the sample size required for developing a clinical prediction model. BMJ. 2020;368:m441.

    Article  PubMed  Google Scholar 

  14. Hussein AA, Ahmed YE, May P, Ali T, Ahmad B, Raheem S, et al. Natural history and predictors of parastomal hernia after robot-assisted radical cystectomy and ileal conduit urinary diversion. J Urol. 2018;199(3):766–73.

    Article  PubMed  Google Scholar 

  15. Hong SY, Oh SY, Lee JH, Kim DY, Suh KW. Risk factors for parastomal hernia: based on radiological definition. J Korean Surg Soc. 2013;84(1):43–7.

    Article  PubMed  Google Scholar 

  16. Tseng PY, Chen YT, Wang CH, Chiu KM, Peng YS, Hsu SP, et al. Prediction of the development of acute kidney injury following cardiac surgery by machine learning. Crit Care. 2020;24(1):478.

    Article  PubMed  PubMed Central  Google Scholar 

  17. Sufriyana H, Husnayain A, Chen YL, Kuo CY, Singh O, Yeh TY, et al. Comparison of multivariable logistic regression and other machine learning algorithms for prognostic prediction studies in pregnancy care: systematic review and meta-analysis. JMIR Med Inf. 2020;8(11):e16503.

    Article  Google Scholar 

  18. Sidey-Gibbons JaM, Sidey-Gibbons CJ. Machine learning in medicine: a practical introduction. BMC Med Res Methodol. 2019;19(1):64.

    Article  PubMed  Google Scholar 

  19. Chiofolo C, Chbat N, Ghosh E, Eshelman L, Kashani K. Automated continuous acute kidney injury prediction and surveillance: a random forest model. Mayo Clin Proc. 2019;94(5):783–92.

    Article  PubMed  Google Scholar 

  20. Lin K, Hu Y, Kong G. Predicting in-hospital mortality of patients with acute kidney injury in the ICU using random forest model. Int J Med Inf. 2019;125:55–61.

    Article  Google Scholar 

  21. Osborne W, North J, Williams J. Using a risk assessment tool for parastomal hernia prevention. Br J Nurs. 2018;27(5):15–9.

    Article  PubMed  Google Scholar 

  22. Temple B, Farley T, Popik K, Ewanyshyn C, Beyer E, Dufault B. Prevalence of parastomal hernia and factors associated with its development. J Wound Ostomy Cont Nurs. 2016;43(5):489–93.

    Article  Google Scholar 

  23. Kojima K, Nakamura T, Sato T, Matsubara Y, Naito M, Yamashita K, et al. Risk factors for parastomal hernia after abdominoperineal resection for rectal cancer. Asian J Endosc Surg. 2017;10(3):276–81.

    Article  PubMed  Google Scholar 

  24. Liu L, Zheng L, Zhang M, Hu J, Lu Y, Wang D. Incidence and risk factors for parastomal hernia with a permanent colostomy. J Surg Oncol. 2022;126(3):535–43.

    Article  CAS  PubMed  Google Scholar 

  25. Luo J, Singh D, Zhang F, Yang X, Zha X, Jiang H, et al. Comparison of the extraperitoneal and transperitoneal routes for permanent colostomy: a meta-analysis with RCTs and systematic review. World J Surg Oncol. 2022;20(1):82.

    Article  PubMed  PubMed Central  Google Scholar 

  26. Zhang Y, Zhu Y. Development and validation of risk prediction model for Sarcopenia in patients with colorectal cancer. Front Oncol. 2023;13:1172096.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  27. Dos Santos ALS, Santos BC, Frazão LN, Miranda AL, Fayh APT, Silva FM, et al. Validity of the GLIM criteria for the diagnosis of malnutrition in patients with colorectal cancer: a multicenter study on the diagnostic performance of different indicators of reduced muscle mass and disease severity. Nutrition. 2024;119:112324.

    Article  PubMed  Google Scholar 

  28. Hoek VT, Buettner S, Sparreboom CL, Detering R, Menon AG, Kleinrensink GJ, et al. A preoperative prediction model for anastomotic leakage after rectal cancer resection based on 13.175 patients. Eur J Surg Oncol. 2022;48(12):2495–501.

    Article  CAS  PubMed  Google Scholar 

  29. Altamimi A, Hassanain M, Nouh T, Ateeq K, Aljiffry M, Nawawi A, et al. Predictors of morbidity and mortality post emergency abdominal surgery: a national study. Saudi J Gastroenterol. 2018;24(5):282–8.

    Article  PubMed  PubMed Central  Google Scholar 

  30. Shiraishi T, Nishizawa Y, Ikeda K, Tsukada Y, Sasaki T, Ito M. Risk factors for parastomal hernia of loop stoma and relationships with other stoma complications in laparoscopic surgery era. BMC Surg. 2020;20(1):141.

    Article  PubMed  PubMed Central  Google Scholar 

  31. Ho KK, Economou T, Smart NJ, Daniels IR. Radiological progression of end colostomy trephine diameter and area. BJS Open. 2019;3(1):112–8.

    Article  CAS  PubMed  Google Scholar 

  32. Ambe PC. The safety of surgical technique for ileostomy and colostomy in preventing parastomal hernias: an in vitro experimental simulation study. Patient Saf Surg. 2021;15(1):26.

    Article  PubMed  PubMed Central  Google Scholar 

  33. Krishnamurty DM, Blatnik J, Mutch M. Stoma complications. Clin Colon Rectal Surg. 2017;30(3):193–200.

    Article  PubMed  PubMed Central  Google Scholar 

  34. Niu N, Du S, Yang D, Zhang L, Wu B, Zhi X, et al. Risk factors for the development of a parastomal hernia in patients with enterostomy: a systematic review and meta-analysis. Int J Colorectal Dis. 2022;37(3):507–19.

    Article  PubMed  Google Scholar 

Download references

Acknowledgements

The authors extend their sincere gratitude to the editor for promptly handling this paper and to the anonymous reviewers for their constructive comments that significantly contributed to the quality of this paper.

Funding

This research was supported by the 2021 Anhui Nursing Society Scientific Research Program Youth Project (AHHL a202116), and the 2020 Anhui Medical University Scientific Research Fund Youth Project (2020xkj136).

Author information

Authors and Affiliations

Authors

Contributions

TD: Research design, execution, data collection, and paper writing (original draft). MZ: research coordination, design, paper reviewing, administrative support. MB: Paper reviewing and administrative support. ZW: administrative support. JT: research design and administrative support. ZL: research design, paper editing, reviewing, and data analysis. All authors read and approved the final manuscript.

Corresponding author

Correspondence to Zeyan Liu.

Ethics declarations

Ethics approval and consent to participate

This study was approved by the Ethics Committee of the Second Affiliated Hospital of Anhui Medical University (YX2023-003). All methods were carried out in accordance with relevant guidelines and regulations. The researcher confirming that informed consent was obtained from all subjects and/or their legal guardian(s).

Consent for publication

Not applicable.

Competing interests

The authors declare no competing interests.

Additional information

Publisher’s Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Electronic supplementary material

Below is the link to the electronic supplementary material.

Supplementary Material 1

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License, which permits any non-commercial use, sharing, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if you modified the licensed material. You do not have permission under this licence to share adapted material derived from this article or parts of it. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by-nc-nd/4.0/.

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Dai, T., Bao, M., Zhang, M. et al. A risk prediction model based on machine learning algorithm for parastomal hernia after permanent colostomy. BMC Med Inform Decis Mak 24, 224 (2024). https://doi.org/10.1186/s12911-024-02627-8

Download citation

  • Received:

  • Accepted:

  • Published:

  • DOI: https://doi.org/10.1186/s12911-024-02627-8

Keywords