A risk prediction model based on machine learning algorithm for parastomal hernia after permanent colostomy

Dai, Tian; Bao, Manzhen; Zhang, Miao; Wang, Zonggui; Tang, JingJing; Liu, Zeyan

doi:10.1186/s12911-024-02627-8

Research
Open access
Published: 08 August 2024

A risk prediction model based on machine learning algorithm for parastomal hernia after permanent colostomy

Tian Dai ORCID: orcid.org/0000-0003-0223-6829^1,4,
Manzhen Bao^2,4,
Miao Zhang²,
Zonggui Wang^3,4,
JingJing Tang^1,4 &
…
Zeyan Liu ORCID: orcid.org/0000-0002-8807-368X⁵

BMC Medical Informatics and Decision Making volume 24, Article number: 224 (2024) Cite this article

239 Accesses
Metrics details

Abstract

Objective

To develop a machine learning-based risk prediction model for postoperative parastomal hernia (PSH) in colorectal cancer patients undergoing permanent colostomy, assisting nurses in identifying high-risk groups and devising preventive care strategies.

Methods

A case-control study was conducted on 495 colorectal cancer patients who underwent permanent colostomy at the Second Affiliated Hospital of Anhui Medical University from June 2017 to June 2023, with a 1-year follow-up period. Patients were categorized into PSH and non-PSH groups based on PSH occurrence within 1-year post-operation. Data were split into training (70%) and testing (30%) sets. Variable selection was performed using Least Absolute Shrinkage and Selection Operator (LASSO) regression, and binary classification prediction models were established using Logistic Regression (LR), Support Vector Classification (SVC), K Nearest Neighbor (KNN), Random Forest (RF), Light Gradient Boosting Machine (LGBM), and Extreme Gradient Boosting (XgBoost). The binary classification label denoted 1 for PSH occurrence and 0 for no PSH occurrence. Parameters were optimized via 5-fold cross-validation. Model performance was evaluated using Area Under Curve (AUC), specificity, sensitivity, accuracy, positive predictive value, negative predictive value, and F1-score. Clinical utility was evaluated using decision curve analysis (DCA), model explanation was enhanced using shapley additive explanation (SHAP), and model visualization was achieved using a nomogram.

Results

The incidence of PSH within 1 year was 29.1% (144 patients). Among the models tested, the RF model demonstrated the highest discrimination capability with an AUC of 0.888 (95% CI: 0.881–0.935), along with superior specificity, accuracy, sensitivity, and F1 score. It also showed the highest clinical net benefit on the DCA curve. SHAP analysis identified the top 10 influential variables associated with PSH risk: body mass index (BMI), operation duration, history and status of chronic obstructive pulmonary disease (COPD), prealbumin, tumor node metastasis (TNM) staging, stoma site, thickness of rectus abdominis muscle (TRAM), C-reactive protein CRP, american society of anesthesiologists physical status classification (ASA), and stoma diameter. These insights from SHAP plots illustrated how these factors influence individual PSH outcomes. The nomogram was used for model visualization.

Conclusion

The Random Forest model demonstrated robust predictive performance and clinical relevance in forecasting colonic PSH. This model aids in early identification of high-risk patients and guides preventive care.

Peer Review reports

Background

Colorectal cancer (CRC) significantly impacts both survival and quality of life worldwide. According to the International Agency for Research on Cancer (IARC), CRC ranks third among all cancers in terms of incidence, with approximately 1.8 million new cases annually, and is the second leading cause of cancer-related deaths [1, 2]. About 60% of CRC patients undergo permanent colostomy (PC) surgery [3]. Parastomal hernia (PSH) is a common complication where intra-abdominal tissue protrudes around the stoma site, occurring in about 30% of cases within 1 year post-surgery. This incidence increases to approximately 42% three years after operation [4]. PSH can lead to various issues such as abdominal pain and distension, fecal water leakage, and skin ulceration. These symptoms not only reduce social engagement but also increase economic burdens and cause significant physical and mental pains. Moreover, around 15% of patients with PSH may develop serious complications like incarceration, intestinal obstruction, or intestinal perforation [5]. Surgical repair of PSH remains challenging with current methods like Keyhole, Sugarbaker, and Sandwich, which still see recurrence rates exceeding 30% [6].

Colostomy maintenance is predominantly overseen by nurses, emphasizing preventive nursing practices over reactionary treatments or surgical interventions, which aligns with the high-quality nursing principles [7]. Early identification of high-risk patients is crucial for implementing effective preventive care strategies, thereby enhancing stoma care quality. At present, there is a deficiency in effective prediction tools or scoring sheets for guiding nursing practice in managing PSH. The pathogenic mechanism of PSH is complex, involving factors such as disease status, stoma, and operation conditions. It involves multiple pathophysiological mechanisms and the predictive efficacy by a single index is insufficient. Therefore, constructing a prediction model using multi-dimensional indicators is more aligned with the needs of stoma care. Lopez-Cano M et al. conducted a meta-analysis to evaluate the effectiveness of prophylactic mesh in preventing post-colostomy PSH, highlighting both its potential benefits and risks [8]. They employed a rigorous statistical method to aggregate data from multiple studies, finding that while prophylactic meshes significantly reduced the incidence of PSH, there was also an increased risk of mesh-related complications. Donahue TF et al. investigated risk factors for parastomal hernias following radical cystectomy to help physicians better manage this complication [9]. Using a retrospective cohort study design, they identified several key risk factors such as age, BMI, and stomal location, which could inform targeted preventive strategies. Funahashi K et al. analyzed risk factors for PSH in Japanese patients undergoing permanent colostomy, focusing on identifying effective preventive measures [10]. Their case-control study revealed that patient-specific factors like abdominal adiposity and ostomy-related factors such as the size of the fascial opening were critical in determining PSH risk. Liu H et al. developed a nomogram model for predicting PSH risk, offering clinicians a valuable assessment tool [11]. This model, based on machine learning techniques, demonstrated superior predictive accuracy compared to traditional risk assessment methods and has the potential to be further enhanced with more extensive data.

Despite these advancements, there remains a compelling case for the continued application of machine learning (ML) approaches in predicting and managing PSH. The existing literature has shown promising results with ML models, particularly in enhancing predictive accuracy and personalizing patient care. However, limitations such as dataset size and model generalizability suggest that further exploration with more extensive and diverse datasets could lead to even more accurate and robust predictions. Moreover, the incorporation of recent ML innovations might unlock new pathways for preventing and managing PSH, underlining the potential value of continued research in this domain. ML is an advanced artificial intelligence technique designed for data processing. Unlike traditional statistical methods, ML excels in handling complex datasets associated with intricate disease mechanisms, incorporating numerous candidate parameters. These datasets often exhibit challenges like multicollinearity, interaction effects, nonlinear parameter relationships and outcome events, and intermediary effects with outcome events [12]. Methodologically, ML aligns with the characteristics of PSH and possesses a certain level of sophistication. At present, the application of ML algorithms remains relatively limited in nursing research, especially as a primary method for studying risk prediction of PSH after permanent colostomy surgery, with scarce existing literature indicating potential for major innovations.

Therefore, this study aims to utilize perioperative clinical indicators as a basis, employ ML algorithms for data screening, and establish an effective multidimensional model for predicting PSH risk. This endeavor seeks to enhance the quality and efficiency of ostomy care, provide early identification of high-risk patients, and support the development of preventive care strategies for timely intervention.

Materials and methods

General information

This retrospective case-control analysis included 495 patients with permanent colostomies who underwent surgical treatment at the Second Affiliated Hospital of Anhui Medical University from June 2017 to June 2023, with a one-year follow-up period. The latest patient surgery inclusion was in May 2022. All participants had complete one-year follow-up records at the colostomy outpatient clinic (Fig. 1). This study received approval from the Ethics Committee of the Second Affiliated Hospital of Anhui Medical University (YX2023-003).

Inclusion and exclusion criteria

Inclusion criteria: ①Patients who were diagnosed with colorectal cancer for the first time and underwent permanent colostomy surgery; ②Patients with no prior history of colostomy surgery; ③Age ≥ 18 years.

Exclusion criteria: ①Incomplete baseline data; ②Lack of complete one-year follow-up data post-surgery; ③Subsequent ileostomy or colostomy revision surgery unrelated to specified reasons; ④Presence of severe infection, septic shock, severe postoperative bleeding or hemorrhagic shock, severe postoperative heart failure or cardiogenic shock, active phases of hematological and rheumatic autoimmune diseases during the postoperative period.

Sample size estimation

In this study, we aimed to construct a nomogram using the top 10 ranked variables in the model. According to literature on PSH incidence [4], a minimum of 334 patients must be included to adhere to the Events Per Variable (EPV) principle for predictive model studies [13]. Ultimately, our study included a total of 495 cases, meeting the required sample size criteria.

Diagnostic methods

The diagnosis of CRC diagnosis was based on the 2021 guidelines from the Chinese Society of Clinical Oncology (CSCO) for the diagnosis and treatment of CRC. tumor node metastasis (TNM) staging was assessed according to the eighth edition of the American Joint Committee on Cancer (AJCC) staging system. PSH was diagnosed by a full-time Enterostomal Therapist (ET) following the 2017 European Society of Hernia guidelines, using physical examination (Valsalva maneuver) combined with abdominal computed tomography (CT).

Included variables

In this study, data collection took place around the perioperative period, and a total of 48 clinical indicators were gathered from patients through literature review, expert guidance, and other sources. These include: ①Demographic and medical history data: gender, age, body mass index (BMI), smoking history, drinking history, hypertension history, diabetes history, chronic obstructive pulmonary disease (COPD) history, heart failure (HF) history, abdominal surgery history, and constipation history; ②Clinical data: cause of disease, subcutaneous fat thickness (SFT), thickness of rectus abdominis muscle (TRAM), long-term hormone use, postoperative blood transfusion, American Society of Anesthesiologists physical status classification (ASA), nutritional risk screening 2002 (NRS2002), Barthel Index (BI), neoadjuvant chemotherapy (NACT), tumor node metastasis (TNM) staging, and postoperative incision infection; ③Operation and stoma characteristics: timing of operation, operation method, operation type, operation duration, preoperative stoma localization (PSL), stoma route, stoma site, stoma diameter, and stoma venting time; ④Auxiliary examination data (before surgery): C-reactive protein (CRP), hemoglobin (HB), alanine transaminase (ALT), aspartate aminotransferase (AST), total bilirubin (TBIL), creatinine (Cr), blood urea nitrogen (BUN), prealbumin (PAB), albumin (ALB), glycosylated hemoglobin (HBA1c), creatine kinase (CK), total cholesterol (TC), triglyceride (TG), uric acid (UA), low-density lipoprotein cholesterol (LDL-C), carcinoembryonic antigen (CEA), and carbohydrate antigen 199 (CA-199).

Data processing

(1) Since no variable had more than 25% missing values after inclusion andexclusion, all 48 variables were retained. The remaining missing values (limited to continuous variables) were imputed using multiple imputation (MI). (2) Based on follow-up records, the included cases were categorized into the PSH group and the Non-PSH group depending on whether PSH occurred within 1 year after the operation. (3) Ordinal variables were ranked in order, while binary and multivariate variables were assigned based on the current situation [14, 15] (Tables 1 and 2). (4) The dataset was split into a training set and a test set using a 70:30 ratio, with the training set utilized for constructing different ML algorithm models. (5) The training set variables underwent initial screening to exclude collinearity using a 10-fold cross-validation with the Least Absolute Selection and Shrinkage Operator (LASSO). (6) Utilizing the training dataset, predictive models for PSH were developed employing four machine learning algorithms: Logistic Regression (LR), Support Vector Classification (SVC), K-Nearest Neighbors (KNN), Random Forest (RF), Light Gradient Boosting Machine (LGBM), and Extreme Gradient Boosting (XgBoost). Hyperparameter optimization of the models was conducted using 5-fold cross-validation. (7) The test set was utilized for model validation, and the optimal algorithm model was selected to evaluate the predictive efficacy using metrics such as Area Under Curve (AUC), specificity (SPE), sensitivity (SEN), accuracy (ACC), positive predictive value (PPV), negative predictive value (NPV) and F1-score. Clinical applicability was evaluated using decision curve analysis (DCA). (8) Shapley additive explanation (SHAP) was used for explanatory analysis of model variables and attribution analysis of individual-level effects. (9) Using the training set, a nomogram was constructed featuring the top 10 predictor variables from the model as evaluation metrics. This nomogram served as a tool for quantitatively visualize the risk of PSH.

Table 1 Assignment table of dichotomous and multivariate variables

Full size table

Table 2 Assignment of ordinal variables

Full size table

Statistical analysis

Descriptive statistics were performed using SPSS 25.0. Continuous variables, which were not normally distributed, are presented as median (quartile). The Mann-Whitney U test was utilized for their comparison. Categorical variables are represented as counts (percentages %) and were compared using the chi-square test. A two-tailed P-value of less than 0.05 was considered statistically significant. Multiple data imputation was performed using SPSS 25.0, followed by LASSO regression analysis using the Glmnet package in the R. ML model establishment, ROC curve plotting, and DCA curve plotting were carried out using Python 3.10, and the Scikit-learn package (https://scikit-learn.org). Variable importance and ranking in the predictive model were quantified using SHAP. The direction of impact of variables on the outcome event was determined based on SHAP values, and individual-level visual explanations were generated. A nomogram was created using the RMS package in R.

Results

Screening of variables

495 patients were ultimately included in the study, with a total of 48 variables, and no variables were excluded, leaving 48 variables in total. Among them, 144 (29.1%) developed PSH within one year after the operation. The training dataset underwent preliminary variable selection using LASSO regression with 10-fold cross-validation. When λ equaled 0.0171, 23 optimal predictive variables were identified. These variables are BMI, TNM staging, ASA classification, history of diabetes, history and status of COPD, history of constipation, NRS2002, operation duration, stoma route, stoma site, stoma diameter, stoma venting time, timing of operation, TBIL, CRP, TC, TG, HBA1c, LDL-C, SFT, TRAM, ALB, and PAB (Fig. 2A and B).

Model performance comparison

The RF prediction model demonstrated superior performance across various metrics in the study, achieving an AUC of 0.888 (95% CI: 0.881–0.935), SEN of 0.851, SPE of 0.804, ACC of 0.819, PPV of 0.921, NPV of 0.667, and F1-score of 0.813. In a comprehensive analysis, it could be concluded that the RF model exhibited the best predictive performance (Fig. 3; Table 3). DCA further supported the RF model’s efficacy by demonstrating the highest net benefit across different risk probability thresholds. Specifically, the RF model demonstrated the highest net benefit within the threshold probability range of 0.30 to 0.95 (Fig. 4). Therefore, RF was selected as the optimal algorithm for constructing the PSH prediction model.

Table 3 Performance parameters of different machine learning algorithms for constructing predictive models

Full size table

Random forest algorithm setup

A total of 100 decision trees were employed to construct the Random Forest (RF) classifier model. Information entropy was utilized as the criterion for splitting, and to prevent overfitting, a maximum tree depth of 7 was imposed. Ensuring a balance between model complexity and performance, each tree required a minimum of 2 samples per node to undergo splitting.

Interpretation of prediction model

①The SHAP bar plot, based on mean SHAP mean values, displayed the average impact magnitude of variables on the model output, ranked in descending order (Fig. 5). The top 10 variables influencing the prediction of PSH included BMI, operation duration, history and status of COPD, PAB, TNM staging, stoma site, TRAM, CRP, ASA classification, and stoma diameter. ②The SHAP waterfall plot, in conjunction with assignment situations, further illustrated the positive or negative associations of variables with the occurrence of PSH. It indicated that BMI, history and status of COPD, TNM staging, CRP level, and ASA classification are positively correlated with PSH. Conversely, operation duration ≤ 4 h, PAB, stoma site (through rectus abdominis muscle), TRAM measurements, and stoma diameter ≤ 30 mm are negatively correlated with PSH (Fig. 6). ③The SHAP Explanatory plot, employing two patient cases, provided detailed insights into how the RF prediction model analyzed individual-level predictive factors for occurrences and non-occurrences of PSH (Fig. 7A and B). In patients who experienced PSH, significant contributing factors included ALB levels at 27.28 g/L, BMI indicating overweight, CRP levels at 6.92 mg/L, TRAM measurements at 1.35 cm, PAB levels at 248.7 mg/L, and TNM stage II. Protective factors were the absence of COPD history and operation duration ≤ 4 h. In patients who did not experience PSH, protective factors included operation duration ≤ 4 h, no history of COPD, TRAM measurements at 1.01 cm, PAB levels at 231.3 mg/L, CRP levels at 5.4 mg/L, stoma site through the rectus abdominis muscle, and a healthy ASA classification. Factors promoting PSH included not being overweight in terms of BMI, stoma diameter > 30 mm, transperitoneal stoma route, and TNM stage II.

Predictive model presentation

The top 10 variables were assessed, and the predictive model was quantified using a nomagram to evaluate the risk of PSH in colostomy patients (Fig. 8). Its interpretation is as follows: For a given patient, a vertical line was drawn upwards from the horizontal axis corresponding to each variable, indicating a specific score on the horizontal axis. These scores from all variables were then summed to derive a total score. Subsequently, a vertical line was drawn downwards from this total score to the horizontal axis labeled “Risk Probability,” representing the predicted risk for PSH for that patient.

The nomogram included the following variables: BMI, operation duration, COPD history and status, PAB, TNM staging, stoma site, TRAM, CRP, ASA classification, and stoma diameter. Firstly, you can get a specific value for each variable. BMI: 2. Operation duration: 0.5 h. History of COPD. PAB: 300. TNM stage: 2. Stoma position: 0. TRAM: (1) CRP: 5. ASA score: (2) Stoma diameter: 1. Secondly, according to the value of each variable, the integral is found on the corresponding axis. BMI = 2 corresponds to about 25 points. Operation time = 0.5 h corresponds to about 8 points. Have no history of COPD = 0 points. PAB = 300 corresponds to about 15 points. TNM stage = 2 corresponds to about 20 points. Stoma site = 0 is 20 points. TRAM = 1 corresponds to about 15 points. CRP = 5 corresponds to about 24 points. ASA classification = 2 corresponds to about 15 points. stoma diameter = 1 corresponds to 0 points. Then, you add up all the integrals and you get a total of 100. Based on a total score of 142, you find it on the “total score” axis, then corresponding down to the risk probability axis. For example, it might correspond to 0.6, indicating a 60% risk probability.

Discussion

In this study, we analyzed perioperative clinical data from 495 patients with permanent colostomy. Among them, 144 patients developed PSH within 1 year after the operation, resulting in an incidence of 29.1%, which was consistent with previous literature reports. Subsequently, LASSO regression and six machine learning algorithms were employed to analyze the variables. Ultimately, it was determined that the RF-based PSH prediction model exhibited the best performance in terms of predictive efficiency and clinical applicability. This model could effectively identify high-risk PSH patients early, providing significant clinical benefits. SHAP is used as a post-hoc explanation and analysis method for black-box models. It provides an intuitive way to assess the contribution and direction of the impact of variables on outcome events [16]. According to SHAP, the top 10 most important variables influencing PSH included history of COPD, SFT, ALB, BMI, PAB, CRP, stoma diameter, stoma route, TRAM, and operation duration. SHAP waterfall plots demonstrated the positive or negative relationship between these variables and the occurrence of PSH. From a disease perspective, these indicators are theoretically linked to PSH and align with the underlying pathophysiological mechanisms. They are not merely results of statistical analysis but encompass multiple dimensions, including stoma characteristics, nutritional status, medical history, and surgical factors. Together, they significantly enhance predictive accuracy. Moreover, these indicators are easily obtainable in routine nursing practices and do not require special methods or tools, thereby ensuring the clinical applicability of the model. Importantly, some of these indicators can be modified through nursing interventions, enabling the guidance of clinical preventive care plans.

From a methodological perspective, although Logistic Regression remains prominent in predictive modeling research as a traditional method for multifactor analysis, it faces limitations such as suboptimal accuracy, susceptibility to underfitting, and the requirement for linear relationships in data, along with limited capability to handle collinear data [17]. In contrast, machine learning, as a representative technology of artificial intelligence, excels at managing large datasets with complex relationships like collinearity, interactions, and mediation effects. It can effectively screen significant variables from numerous variables to construct more robust and efficient predictive models [18]. Among them, RF stands out as a supervised machine learning algorithm that integrates decision trees with ensemble learning techniques. It achieves higher accuracy by combining predictions from multiple decision trees, each trained on randomly sampled data and variables. This approach effectively mitigates overfitting and enhances classification performance [19, 20]. This study employed machine learning algorithms for data analysis, which is methodologically more advanced compared to traditional statistical methods. Osborne et al. developed a PSH risk assessment scale, which collected data through questionnaire surveys. However, it is essential to acknowledge potential limitations in data authenticity and objectivity [21]. In this study, all included variables were clinical objective indicators, to some extent, mitigating model underfitting due to information bias.

From the analysis of the pathogenic mechanism of PSH, its primary causes include the extent of abdominal wall defect and the intensity of intra-abdominal pressure. In this study, SHAP plots were utilized to rank and display the variables of the RF model based on their contribution levels. Additionally, the relationship between each variable and the outcome event was clarified. Regarding physical and medical history, BMI serves as an indicator used to measure the degree of obesity. Obese patients often exhibit weakened abdominal wall muscles and fascial tissues, resulting in insufficient abdominal wall strength. Subcutaneous fat will increase the longitudinal tension of the abdominal wall, which may result in suture ischemia, liquefaction, and delayed healing. Furthermore, excessive visceral fat deposition can elevate intra-abdominal pressure. Obese patients are prone to fat liquefaction, edema, necrosis secondary infection, and other adverse conditions in the surgical incision. Several studies have demonstrated a corresponding increased risk of postoperative PSH in patients with BMI > 25 kg/m² [22, 23]. COPD is a respiratory condition characterized by recurrent coughing and wheezing. Frequent coughing and vigorous expectoration can significantly elevate intra-abdominal pressure. Furthermore, COPD, being a catabolic disease often accompanied by malnutrition, resulting in weakened abdominal wall muscles and fascial tissues, which makes surgical incisions less likely to heal. Liu et al. have confirmed that COPD independently increases the risk of PSH [24]. TRAM is directly correlated with PSH occurrence. Thicker rectus abdominis muscles can effectively resist tension around the incision site, thus reducing the extent of abdominal wall defect, buffering intra-abdominal pressure, and preventing herniation of abdominal contents [25].

In terms of disease status, TNM staging of CRC serves as a criterion for evaluating the degree of tumor infiltration and metastasis. Higher TNM stages indicate greater tumor invasion of the intestinal wall and surrounding lymph nodes. Patients with high TNM stage often suffer from severe malnutrition and weakened rectus abdominis due to factors such as intestinal absorption dysfunction, insufficient intake, and cancer cell consumption [26, 27]. Additionally, these patients frequently require more extensive surgical resections and prolonged postoperative chemotherapy. Consequently, patients with high TNM staging of stoma are more susceptible to PSH complications. ASA classification is an indicator used to evaluate the physical condition of CRC patients from the perspective of anesthesia safety. A higher ASA classification denotes poorer nutritional status, reduced activity ability, and other compromised general conditions, as well as an increased likelihood of combined organ dysfunction. Patients with high ASA classifications have diminished tissue repair capabilities, higher rates of stoma infection, poor healing of abdominal wall incisions, and decreased abdominal wall strength, all of which can increase the incidence of PSH [28, 29].

In terms of surgery and stoma conditions, the duration of the operation can reflect the complexity of the procedure, including factors such as intraoperative adhesions, complex tumor structures, and challenges in hemostasis. These complexities can lead to postoperative complications like wound infections and deteriorating postoperative conditions. Prolonged laparoscopic insufflation can damage the abdominal wall, causing aseptic inflammation. Additionally, since stoma creation typically occurs at the end of surgery, extended operation duration can lead to surgeon fatigue, potentially affecting the quality of the stoma. A prospective cohort study by Pennings et al. indicated a significant increase in PSH occurrence within 1 year after surgery for patients whose operation duration exceeded 395 min [5]. The abdominal wall stoma site is usually chosen beside or through the rectus abdominis muscle. Due to the thickness of the rectus abdominis muscle and its easy fixation with the intestinal tract, it can increase the strength of the abdominal wall adjacent to the stoma. Shiraishi et al. demonstrated that the incidence of PSH through rectus abdominis muscle stoma is lower than that beside rectus abdominis muscle stoma [30]. The stoma diameter not only represents the size of the abdominal wall defect, but its enlargement also indicates that the elastic retraction force of the tissue or scar around the stoma is compromised, preventing the maintenance of normal abdominal wall tension. Studies have shown that increased stoma diameter is positively associated with the incidence of PSH [15, 31, 32]. Current guidelines or consensus do not specify an optimal diameter due to the need for patient individualization [4]. However, excessively small stomas can easily lead to postoperative intestinal obstruction and intestinal ischemic necrosis.

In terms of blood test indicators, PAB reflects a patient’s nutritional status. Malnutrition can lead to periwound edema and poor healing, thereby increasing the risk of PSH [9]. Additionally, malnutrition may impair incision granulation tissue formation and weaken muscle fibers around the incision by reducing collagen production [33]. CRP, as an inflammatory marker, indicates elevated levels that may correlate with conditions such as perioperative infections and organ dysfunction. Increased inflammation not only hinders wound recovery but also exacerbates risks associated with colorectal tumor-related intestinal infections and intestinal obstructions, which elevate intra-abdominal pressure and complicate wound healing. Meta-analysis results by Niu et al. highlighted that persistent postoperative inflammation is a risk factor for PSH occurrence [34].

From a preventive nursing perspective, developing scientifically sound and effective nursing measures is crucial in mitigating PSH risks. In this study, SHAP plots were utilized with the RF model to analyze variables and provide individualized presentations. This approach facilitates the development of personalized care plans. Among the top 10 variables with significant impact, BMI, PAB, history and status of COPD, CRP, and TRAM were modifiable through nursing interventions. Additionally, their overall impact surpassed that of the remaining factors. Strengthening preoperative respiratory exercises, advising smoking cessation, enhancing postoperative airway care, performing back patting during position changes, and providing appropriate oxygen therapy can effectively decrease cough frequency, alleviate severity, and enhance expectoration in COPD patients. Implementing scientifically designed exercise plans during the perioperative period and after discharge, alongside appropriate dietary interventions, promotes reduction in body fat, optimal weight management, enhanced nutritional status, and increased rectus abdominis muscle strength. Enhancing preoperative psychological care can reduce patient anxiety and tension, thereby potentially reducing inflammatory responses in the body.

Conclusion

This study developed a predictive model for PSH using machine learning algorithms, elucidating the relevant promoting or protective factors of PSH and their respective impact levels. This approach facilitates early identification of high-risk patients and guides the formulation of preventive care strategies for PSH. However, this study is retrospective and sourced from a single center, thus limiting the strength of its findings and necessitating further prospective validation. Currently, the application of machine learning algorithms in PSH risk prediction models and other nursing research remains exploratory. Nevertheless, within the context of advancing nursing information technology and strengthening interdisciplinary collaboration between nursing and engineering, machine learning presents new opportunities for nursing research in the era of big data, warranting further investigation.

Data availability

The datasets generated and/or analyzed during the current study are not publicly available due to patient privacy concerns. However, they can be obtained from the corresponding author upon reasonable request.

Abbreviations

CRC:: Colorectal cancer
IARC:: International Agency for Research on Cancer
PC:: Permanent colostomy
PSH:: Parastomal hernia
ML:: Machine learning
EPV:: Events Per Variable
CSCO:: Chinese Society of Clinical Oncology
TNM:: Tumor node metastasis
AJCC:: American Joint Committee on Cancer
ET:: Enterostomal Therapist
CT:: Computed tomography
BMI:: Body mass index
COPD:: Chronic obstructive pulmonary disease
GOLD:: Global initiative for chronic obstructive lung disease
FEV1:: Forced expiratory volume in the first second
HF:: Heart failure
SFT:: Subcutaneous fat thickness
TRAM:: Thickness of rectus abdominis muscle
ASA:: Anesthesiologists physical status classification
NRS2002:: Nutritional risk screening 2002
BI:: Barthel index
NACT:: Neoadjuvant chemotherapy
PSL:: Preoperative stoma localization
CRP:: C-reactive protein
HB:: Hemoglobin
ALT:: Alanine transaminase
AST:: Aspartate aminotransferase
TBIL:: Total bilirubin
Cr:: Creatinine
BUN:: Blood urea nitrogen
PAB:: Prealbumin
ALB:: Albumin
HBA1c:: Glycosylated hemoglobin
CK:: Creatine kinase
TC:: Total cholesterol
TG:: Triglyceride
UA:: Uric acid
LDL:: C-low-density lipoprotein cholesterol
CEA:: Carcinoembryonic antigen
CA:: 199-carbohydrate antigen 199
MI:: Multiple imputation
LASSO:: Least absolute selection and shrinkage operator
ROC:: Receiver operating characteristic
LR:: Logistic Regression
SVC:: Support Vector Classification
KNN:: K-Nearest Neighbors
RF:: Random Forest
LGBM:: Light Gradient Boosting Machine
XgBoost:: Extreme Gradient Boosting
AUC:: Area Under Curve
SPE:: specificity
SEN:: Sensitivity
ACC:: Accuracy
CI:: Confidence interval
SHAP:: Shapley additive explanation
PPV:: Positive predictive value
NPV:: Negative predictive value

References

Siegel RL, Miller KD, Goding Sauer A, Fedewa SA, Butterly LF, Anderson JC, et al. Colorectal cancer statistics, 2020. CA Cancer J Clin. 2020;70(3):145–64.
Article PubMed Google Scholar
Dekker E, Tanis PJ, Vleugels JLA, Kasi PM, Wallace MB. Colorectal cancer. Lancet. 2019;394(10207):1467–80.
Article PubMed Google Scholar
Schiergens TS, Hoffmann V, Schobel TN, Englert GH, Kreis ME, Thasler WE, et al. Long-term quality of life of patients with permanent end ileostomy: results of a nationwide cross-sectional survey. Dis Colon Rectum. 2017;60(1):51–60.
Article PubMed Google Scholar
Antoniou SA, Agresta F, Garcia Alamino JM, Berger D, Berrevoet F, Brandsma HT, et al. European Hernia Society guidelines on prevention and treatment of parastomal hernias. Hernia. 2018;22(1):183–98.
Article CAS PubMed Google Scholar
Pennings JP, Kwee TC, Hofman S, Viddeleer AR, Furnée EJB, Van Ooijen PMA, et al. Clinical and radiologic predictors of parastomal hernia development after end colostomy. AJR Am J Roentgenol. 2021;216(1):94–103.
Article PubMed Google Scholar
Hansson BM, Bleichrodt RP, De Hingh IH. Laparoscopic parastomal hernia repair using a keyhole technique results in a high recurrence rate. Surg Endosc. 2009;23(7):1456–9.
Article CAS PubMed Google Scholar
Hansson BM. Parastomal hernia: treatment and prevention 2013; where do we go from here? Colorectal Dis. 2013;15(12):1467–70.
Article CAS PubMed Google Scholar
López-Cano M, Brandsma HT, Bury K, Hansson B, Kyle-Leinhase I, Alamino JG, et al. Prophylactic mesh to prevent parastomal hernia after end colostomy: a meta-analysis and trial sequential analysis. Hernia. 2017;21(2):177–89.
Article PubMed Google Scholar
Donahue TF, Bochner BH, Sfakianos JP, Kent M, Bernstein M, Hilton WM, et al. Risk factors for the development of parastomal hernia after radical cystectomy. J Urol. 2014;191(6):1708–13.
Article PubMed Google Scholar
Funahashi K, Suzuki T, Nagashima Y, Matsuda S, Koike J, Shiokawa H, et al. Risk factors for parastomal hernia in Japanese patients with permanent colostomy. Surg Today. 2014;44(8):1465–9.
Article PubMed Google Scholar
Liu H, Wang W, Qin C, Wang H, Qi W, Wei Y, et al. Development and validation of a nomogram prediction model for the risk of parastomal hernia. Intell Med. 2024;4(2):128–33.
Article Google Scholar
Lee YW, Choi JW, Shin EH. Machine learning model for predicting malaria using clinical information. Comput Biol Med. 2021;129:104151.
Article PubMed Google Scholar
Riley RD, Ensor J, Snell KIE, Harrell FE Jr., Martin GP, Reitsma JB, et al. Calculating the sample size required for developing a clinical prediction model. BMJ. 2020;368:m441.
Article PubMed Google Scholar
Hussein AA, Ahmed YE, May P, Ali T, Ahmad B, Raheem S, et al. Natural history and predictors of parastomal hernia after robot-assisted radical cystectomy and ileal conduit urinary diversion. J Urol. 2018;199(3):766–73.
Article PubMed Google Scholar
Hong SY, Oh SY, Lee JH, Kim DY, Suh KW. Risk factors for parastomal hernia: based on radiological definition. J Korean Surg Soc. 2013;84(1):43–7.
Article PubMed Google Scholar
Tseng PY, Chen YT, Wang CH, Chiu KM, Peng YS, Hsu SP, et al. Prediction of the development of acute kidney injury following cardiac surgery by machine learning. Crit Care. 2020;24(1):478.
Article PubMed PubMed Central Google Scholar
Sufriyana H, Husnayain A, Chen YL, Kuo CY, Singh O, Yeh TY, et al. Comparison of multivariable logistic regression and other machine learning algorithms for prognostic prediction studies in pregnancy care: systematic review and meta-analysis. JMIR Med Inf. 2020;8(11):e16503.
Article Google Scholar
Sidey-Gibbons JaM, Sidey-Gibbons CJ. Machine learning in medicine: a practical introduction. BMC Med Res Methodol. 2019;19(1):64.
Article PubMed Google Scholar
Chiofolo C, Chbat N, Ghosh E, Eshelman L, Kashani K. Automated continuous acute kidney injury prediction and surveillance: a random forest model. Mayo Clin Proc. 2019;94(5):783–92.
Article PubMed Google Scholar
Lin K, Hu Y, Kong G. Predicting in-hospital mortality of patients with acute kidney injury in the ICU using random forest model. Int J Med Inf. 2019;125:55–61.
Article Google Scholar
Osborne W, North J, Williams J. Using a risk assessment tool for parastomal hernia prevention. Br J Nurs. 2018;27(5):15–9.
Article PubMed Google Scholar
Temple B, Farley T, Popik K, Ewanyshyn C, Beyer E, Dufault B. Prevalence of parastomal hernia and factors associated with its development. J Wound Ostomy Cont Nurs. 2016;43(5):489–93.
Article Google Scholar
Kojima K, Nakamura T, Sato T, Matsubara Y, Naito M, Yamashita K, et al. Risk factors for parastomal hernia after abdominoperineal resection for rectal cancer. Asian J Endosc Surg. 2017;10(3):276–81.
Article PubMed Google Scholar
Liu L, Zheng L, Zhang M, Hu J, Lu Y, Wang D. Incidence and risk factors for parastomal hernia with a permanent colostomy. J Surg Oncol. 2022;126(3):535–43.
Article CAS PubMed Google Scholar
Luo J, Singh D, Zhang F, Yang X, Zha X, Jiang H, et al. Comparison of the extraperitoneal and transperitoneal routes for permanent colostomy: a meta-analysis with RCTs and systematic review. World J Surg Oncol. 2022;20(1):82.
Article PubMed PubMed Central Google Scholar
Zhang Y, Zhu Y. Development and validation of risk prediction model for Sarcopenia in patients with colorectal cancer. Front Oncol. 2023;13:1172096.
Article CAS PubMed PubMed Central Google Scholar
Dos Santos ALS, Santos BC, Frazão LN, Miranda AL, Fayh APT, Silva FM, et al. Validity of the GLIM criteria for the diagnosis of malnutrition in patients with colorectal cancer: a multicenter study on the diagnostic performance of different indicators of reduced muscle mass and disease severity. Nutrition. 2024;119:112324.
Article PubMed Google Scholar
Hoek VT, Buettner S, Sparreboom CL, Detering R, Menon AG, Kleinrensink GJ, et al. A preoperative prediction model for anastomotic leakage after rectal cancer resection based on 13.175 patients. Eur J Surg Oncol. 2022;48(12):2495–501.
Article CAS PubMed Google Scholar
Altamimi A, Hassanain M, Nouh T, Ateeq K, Aljiffry M, Nawawi A, et al. Predictors of morbidity and mortality post emergency abdominal surgery: a national study. Saudi J Gastroenterol. 2018;24(5):282–8.
Article PubMed PubMed Central Google Scholar
Shiraishi T, Nishizawa Y, Ikeda K, Tsukada Y, Sasaki T, Ito M. Risk factors for parastomal hernia of loop stoma and relationships with other stoma complications in laparoscopic surgery era. BMC Surg. 2020;20(1):141.
Article PubMed PubMed Central Google Scholar
Ho KK, Economou T, Smart NJ, Daniels IR. Radiological progression of end colostomy trephine diameter and area. BJS Open. 2019;3(1):112–8.
Article CAS PubMed Google Scholar
Ambe PC. The safety of surgical technique for ileostomy and colostomy in preventing parastomal hernias: an in vitro experimental simulation study. Patient Saf Surg. 2021;15(1):26.
Article PubMed PubMed Central Google Scholar
Krishnamurty DM, Blatnik J, Mutch M. Stoma complications. Clin Colon Rectal Surg. 2017;30(3):193–200.
Article PubMed PubMed Central Google Scholar
Niu N, Du S, Yang D, Zhang L, Wu B, Zhi X, et al. Risk factors for the development of a parastomal hernia in patients with enterostomy: a systematic review and meta-analysis. Int J Colorectal Dis. 2022;37(3):507–19.
Article PubMed Google Scholar

Download references

Acknowledgements

The authors extend their sincere gratitude to the editor for promptly handling this paper and to the anonymous reviewers for their constructive comments that significantly contributed to the quality of this paper.

Funding

This research was supported by the 2021 Anhui Nursing Society Scientific Research Program Youth Project (AHHL a202116), and the 2020 Anhui Medical University Scientific Research Fund Youth Project (2020xkj136).

Author information

Authors and Affiliations

Department of General Surgery (Ward one), The Second Affiliated Hospital of Anhui Medical University, Hefei, Anhui, 230601, China
Tian Dai & JingJing Tang
Nursing Department, The Second Affiliated Hospital of Anhui Medical University, Hefei, Anhui, 230601, China
Manzhen Bao & Miao Zhang
Department of Orthopedics (Ward two), The Second Affiliated Hospital of Anhui Medical University, Hefei, Anhui, 230601, China
Zonggui Wang
Wound and Stoma Nursing Working Group, The Second Affiliated Hospital of Anhui Medical University, Hefei, Anhui, 230601, China
Tian Dai, Manzhen Bao, Zonggui Wang & JingJing Tang
Department of Emergency Internal Medicine, The Second Affiliated Hospital of Anhui Medical University, Hefei, Anhui, 230601, China
Zeyan Liu

Authors

Tian Dai
View author publications
You can also search for this author in PubMed Google Scholar
Manzhen Bao
View author publications
You can also search for this author in PubMed Google Scholar
Miao Zhang
View author publications
You can also search for this author in PubMed Google Scholar
Zonggui Wang
View author publications
You can also search for this author in PubMed Google Scholar
JingJing Tang
View author publications
You can also search for this author in PubMed Google Scholar
Zeyan Liu
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

TD: Research design, execution, data collection, and paper writing (original draft). MZ: research coordination, design, paper reviewing, administrative support. MB: Paper reviewing and administrative support. ZW: administrative support. JT: research design and administrative support. ZL: research design, paper editing, reviewing, and data analysis. All authors read and approved the final manuscript.

Corresponding author

Correspondence to Zeyan Liu.

Ethics declarations

Ethics approval and consent to participate

This study was approved by the Ethics Committee of the Second Affiliated Hospital of Anhui Medical University (YX2023-003). All methods were carried out in accordance with relevant guidelines and regulations. The researcher confirming that informed consent was obtained from all subjects and/or their legal guardian(s).

Consent for publication

Not applicable.

Competing interests

The authors declare no competing interests.

Additional information

Publisher’s Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Electronic supplementary material

Below is the link to the electronic supplementary material.

Supplementary Material 1

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License, which permits any non-commercial use, sharing, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if you modified the licensed material. You do not have permission under this licence to share adapted material derived from this article or parts of it. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by-nc-nd/4.0/.

Reprints and permissions

About this article

Cite this article

Dai, T., Bao, M., Zhang, M. et al. A risk prediction model based on machine learning algorithm for parastomal hernia after permanent colostomy. BMC Med Inform Decis Mak 24, 224 (2024). https://doi.org/10.1186/s12911-024-02627-8

Download citation

Received: 15 September 2023
Accepted: 01 August 2024
Published: 08 August 2024
DOI: https://doi.org/10.1186/s12911-024-02627-8

A risk prediction model based on machine learning algorithm for parastomal hernia after permanent colostomy

Abstract

Objective

Methods

Results

Conclusion

Background

Materials and methods

General information

Inclusion and exclusion criteria

Sample size estimation

Diagnostic methods

Included variables

Data processing

Statistical analysis

Results

Screening of variables

Model performance comparison

Random forest algorithm setup

Interpretation of prediction model

Predictive model presentation

Discussion

Conclusion

Data availability

Abbreviations

References

Acknowledgements

Funding

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Ethics approval and consent to participate

Consent for publication

Competing interests

Additional information

Publisher’s Note

Electronic supplementary material

Supplementary Material 1

Rights and permissions

About this article

Cite this article

Share this article

Keywords

BMC Medical Informatics and Decision Making

Contact us