Skip to main content

Development of machine learning models for detection of vision threatening Behçet’s disease (BD) using Egyptian College of Rheumatology (ECR)–BD cohort

Abstract

Background

Eye lesions, occur in nearly half of patients with Behçet’s Disease (BD), can lead to irreversible damage and vision loss; however, limited studies are available on identifying risk factors for the development of vision-threatening BD (VTBD). Using an Egyptian college of rheumatology (ECR)-BD, a national cohort of BD patients, we examined the performance of machine-learning (ML) models in predicting VTBD compared to logistic regression (LR) analysis. We identified the risk factors for the development of VTBD.

Methods

Patients with complete ocular data were included. VTBD was determined by the presence of any retinal disease, optic nerve involvement, or occurrence of blindness. Various ML-models were developed and examined for VTBD prediction. The Shapley additive explanation value was used for the interpretability of the predictors.

Results

A total of 1094 BD patients [71.5% were men, mean ± SD age 36.1 ± 10 years] were included. 549 (50.2%) individuals had VTBD. Extreme Gradient Boosting was the best-performing ML model (AUROC 0.85, 95% CI 0.81, 0.90) compared with logistic regression (AUROC 0.64, 95%CI 0.58, 0.71). Higher disease activity, thrombocytosis, ever smoking, and daily steroid dose were the top factors associated with VTBD.

Conclusions

Using information obtained in the clinical settings, the Extreme Gradient Boosting identified patients at higher risk of VTBD better than the conventional statistical method. Further longitudinal studies to evaluate the clinical utility of the proposed prediction model are needed.

Peer Review reports

Background

Behçet's disease (BD) is a chronic systemic immune-mediated vasculitis of unknown cause. Major manifestations include oral and genital ulcers, skin and ocular lesions [1]. Ocular lesions, occur in nearly 48–75% of BD patients [2], are characterized by iridocyclitis, vitritis, retinitis, occlusive retinal vasculitis, and optic disc edema. Poor visual outcome as a result of irreversible ischemic damage of the retina, and optic disc, commonly leads to vision threatening complications in BD (VTBD). Despite available treatment, the rate of poor visual acuity is reported in more than one-third of BD patients [3], and about a quarter of BD patients become blind [4, 5]. According to the BD damage index (BDI), ocular domain represents the top organ contributed to the total BDI score [6]. Accurate identification of patients with BD at risk for vision threatening complications allows initiation of effective treatments to prevent ocular morbidity and save the sight.

Few studies have reported the factors associated with poor ocular outcomes in patients with BD [3, 7, 8]. Higher frequency and longer duration of ocular attacks (uveitis and retinal vasculitis) were among the risk factors for poor visual outcomes and blindness [3, 8]. On the other hand, the presence of systemic vasculitis and genital ulcer was found to be negatively associated with the development VTBD [7]. These findings were limited in their clinical utility by the inclusion of only ocular related variables, the small numbers of patients enrolled, and focusing only on a particular group of BD patients. Prior studies used conventional analysis for the prediction which lack the ability to capture complex interactions among multiple predictors which may limit their use.

To overcome conventional statistical methods’ limitations, machine learning (ML), data analysis technique that develops algorithms by “learning” from data, hold the promise to improve patient classification, predict outcome, and treatment response [9,10,11,12]. The ML-based approach was used for diagnosing BD [13], and classification of specific features in patients with BD [14, 15]. Focusing on ocular involvement, ML using multinomial logistic regression was used to determine the misclassification rate of BD-uveitis among 1012 cases of panuveitis [15]. The overall accuracy for BD-panuveitis was 96.3% and 94.0% in the training and validation set respectively.

Given the importance of identifying patients at risk for development of VTBD and no enough data exist, we aimed to develop machine learning models using Egyptian College of Rheumatology (ECR)-BD cohort to predict VTBD and to compare their performance with that of logistic regression. Then, we examined the clinical importance and directionality of each factor for predicting the VTBD using the best performing ML model.

Methods

Patients

This population data were derived from ECR-BD cohort, a national study group was created by specialized rheumatologists representing 26 specialized rheumatology centers from 15 major governates around the country from north to south during 2017–2018 [16].

The original ECR-BD database included 1526 adult BD patients (new and existing cases). Inclusion criteria were adults (age ≥ 18 years old) satisfying the diagnostic criteria published by the International Study Group for Behçet’s Disease [17] who presented to one of the included centers. Any patients with other  autoimmune diseases or vasculitis rather than BD or subjects without available ocular outcome data (N = 477) were excluded. This study was approved by the Institutional Review Board of Cairo University, Cairo, Egypt (IRB No.: 47-SReC-RCU2021) and the informed consent was obtained from all participants for the original ECR-BD study [16] and the subsequent secondary data analysis.

Variables (features) and outcome

Predictors (features)

Data were collected on a standard sheet and stored in an electronic database. Patients were subjected to full history taking, clinical examination, and skin pathergy test if required. Medications received by the patients at the time of enrollment were recorded. Disease activity was assessed using the Behçet Disease Current Activity Form (BDCAF) score [14], and laboratory markers were determined for all patients. The data used for this study were fully anonymized. All methods were carried out in accordance with relevant guideline and regulations. All variables reflect patients' status at the time of data entry.

Outcome (target) variable

Presence of VTBD (active state at the time of data entry) was diagnosed by ophthalmologists. The full ophthalmological examination included examination of anterior segment by slit lamp, and posterior segment by indirect ophthalmoscopy were conducted for all patients. Fundus fluorescein angiography was done only if needed to confirm posterior segment findings. Patients with retinal disease and/or optic nerve involvement and/or occurrence of blindness were classified as having a VTBD. While, patients with any other form of ocular disease (i.e., episcleritis, cataract, anterior uveitis) were identified as having a non-vision threatening form of the disease (non-VTBD).

Supervised ML approaches

ML analyses involved the following steps: data pre-processing, variables (features) selection, model creation, model evaluation, feature importance derivation, and interpretation of results (a positive or negative relationship of each variable with the target variable).

Data pre-processing and features selection

Data has an overall good quality as the cleaning and transformation process has been made in the primary project [16]. Then, variables of relevance were selected based on clinical expertise, data availability in the database, and literature review. Variables containing a high proportion of missing values (> 30%) were excluded from the analysis. For the included variables with some data missing (< 30%), we applied two techniques; (1) features with missing values were included in the analyses as their own without imputation, and (2) the numerical variables were filled with the mean, and the categorical variables were filled with the mode. Twenty-six variables, which were routinely and easily measured in the clinical setting, were included as inputs to the models (Table 1).

Table 1 List of features that were used to building the machine learning algorithms

Data partition (train and test data)

The total dataset contained 1049 subjects. Prior to training of the algorithms, the data were first split into a training (80%, N = 840) and a test set (20%, N = 209) using a random split. Each model was trained using the training set and evaluated on the test set (i.e., patients not previously seen by the model).

ML models development

Various ML methods, including extreme gradient boosting (XGBoost) [18], extra Tree Classifier, random forest (RF) [19], support vector machine (SVM), artificial neural networks (ANNs), and multi-layer perceptron (MLP) were applied to classify patients into categories associated with having VTBD. These models are commonly used for binary classification problems in medicine, and to ensure that best possible ML model will be selected. Each model was feeded with the same input variables. A brief summary of the models is presented below:

Extreme gradient boosting machine is an ensemble tree-based ML method that includes a chain of classification and regression trees, with each tree created to predict the outcomes misclassified by the previous tree. Thus, the gradient boosting process focuses on predicting more difficult cases and corrects its own weakness. This “boosting” process continues using repeated cross validation, and an ensemble was included to improve robustness when applied to an external dataset. Extra Tree classifier generates randomized multiple decision trees with different sub-samples without bootstrapping. It avoids the problem of over-fitting and results in better accuracy. Compared to random forest, extra tree classifier randomly choose the attributes and split the values for tree construction. Random forest utilizes multiple classification and regression trees to generate a mean prediction model of case status based on variable importance. When fitting a tree, the random forest algorithm considers a random subset of the predictors at each node and iteratively identifies optimal splits to separate the outcome into two groups with the least disparate outcome probabilities. Random forest accounts for interactions and nonlinear relationships among a large group of factors simultaneously to determine the importance of individual variables, and potentially increasing prediction accuracy. Support vector machine is the large margin classifier which classifies the positive and negative data points with a large boundary between them. SVM classifier does not suffer from overfitting problem unlike other similar classifiers. Artificial neural networks, non-linear models, consist of a series of layers: the input layer (features), a hidden layer, and an output layer (outcome). Each layer is composed of several units called neurons whose value depends on the connections with the other neurons. Multi-layer perceptron is a feedforward artificial neural network that maps input data sets to a set of appropriate outputs. An MLP consists of multiple layers and each layer is fully connected to the next one. The nodes of the layers are neurons with nonlinear activation functions. Between the input and the output layers there may be one or more nonlinear hidden layers.

Logistic regression

A logistic regression model, representing the simplest of all conventional classifiers, was chosen to create a reference model against the performance of other machine models.

Models evaluation

In order to evaluate and select the most accurate model, we used the receiver operating characteristic curve (AUROC) (95% confidence intervals (CI)), maximizing AUCROC indicates a satisfying classification. Additionally, the performance of the models was evaluated by accuracy, sensitivity, specificity, negative predictive value (NPV), and positive predictive value (PPV) of each model. We compared models using these criteria of both training and test datasets.

To validate the best models’ results, we used nested K-fold validation in both RF and XGBoost models. Nested K-fold validation ensures our model doesn’t overfit on the training set. The process divided the dataset into five independent folds, and the model is successively trained on four folds and evaluated on the last fold. The evaluation fold rotates so that the process outputs 5 different AUROC. The highest level of accuracy was selected.

Feature importance

Finally, the interpretation of results in the classifications was evaluated SHapley Additive exPlanation (SHAP) [20]. The SHAP provides the importance and direction of each variable contributing to the model. We present this analysis using SHAP summary and feature importance plots as a method of visual representation.

Statistical analysis

All data analyses were conducted using Stata statistical software version 15 (Stata-Corp), and Python language (ver.3.7.12). Normally distributed variables were summarized using the mean ± standard deviation (SD), and non-normally distributed variables by the median and interquartile range (IQR). Frequencies were expressed by percentage. Mean characteristics between patients with and without vision threatening complications were compared using a two-sample t-test, and proportions were compared using the chi-square test. Two-sided P < 0.05 were considered statistically significant. The SHAP analysis was performed on the cohort subdivided by gender to identify the difference in the affecting factors to the VTBD.

For the included variables with some data missing (< 30%), we applied two techniques; (1) features with missing values were included in the analyses as their own without imputation, and (2) missing values were imputed to avoid removing important variables from the dataset. Imputation followed this strategy: in binary variables missing values were substituted by the mode of each class, while in numeric features they were substituted by the mean of each class, a widely used technique [21]. The dictionary contains the scripts, input and output variables for the different analysis included in the manuscript is provided in the following link (https://github.com/aly202012/Beh-et-s-disease-with-Machine-Learning). Schematic presentation of the main process of the machine learning path presents in Fig. 1.

Fig. 1
figure 1

Schematic presentation of the main path of machine learning process used in the current study

Results

Characteristics of the patients’ cohort

In all, 1049 BD patients were analyzed. Among the participants, the mean ± SD age was 36.1 ± 10 years; 71.5% were male; and mean disease duration was 6.7 ± 4.9 years. The mean BDCAF was 4.9 ± 4.5. There were 393 (42.0%) who were smokers, mostly males. 92.1% of the patients were receiving systemic steroids, 83.5% colchicine, 40.7% azathioprine, 26.8% cyclosporine A, 19.4% cyclophosphamide, 7.1% methotrexate, and 7.8% were receiving biologic therapy. 158 (15.7%) were receiving anticoagulants. The main clinical, laboratory and therapeutic features of the whole cohort were described in Table 2.

Table 2 Characteristics of Bechet’s disease patients according to the vision threatening complications

Overall 78.0% of patients had any form of ocular involvements.Vision threatening manifestations were identified in 549 (50.2%) patients. Most frequent ocular manifestations were anterior uveitis in 542 (50.2%), posterior uveitis in 575 (53.6%), retinal occlusion in 273 (28.7%), cataract and conjunctivitis in 318 (41.9%), optic nerve involvement in 106 (24.1%), and 37 (7.9%) patients were blind.

Performance of various prediction models for predicting VTBD

Table 3 and Additional file 1: Figure S1 summarize the discrimination performance of the models. In all models, the AUCROC for predicting VTBD ranged from 0.64 to 0.85. The AUCROC of the conventional logistic regression model was 0.64 (95% CI 0.58, 0.71). On the other hand, XGBoost showed the best performance ML model in predicting VTBD with an overall AUROC of 0.85 (95% CI 0.81, 0.90). The specificity, sensitivity, and accuracy for this model were 0.86, 0.85, and 0.85, respectively. Random forest performance followed XGBoost model (AUROC = 0.83, 95% CI 0.79, 0.89). Then using K-fold validation method, the highest accuracy was 84.0% for RF and 83.1% for XGBoost models.

Table 3 Results of the model performance on training and test sets

Feature importance and model interpretation

Figure 2 shows the influence of variables to VTBD in the prediction model. Disease activity, thrombocytosis, smoking status, and daily steroid dose were among the top estimators for the presence of VTBD. On the other hand, oral ulcers, chlorambucil and methotrexate use, and gastrointestinal involvement were less important factors compared to the other selected features.

Fig. 2
figure 2

Overall SHAP values for the variables in Shapely plots to display both the feature importance and feature contribution to the model prediction. Shapley plots show the SHAP values in the order of the important variables that contribute to VTBD. The x-axis represents the marginal contribution of a feature to the change in the predicted probability of development of VTBD. Colors indicate the value of the variable: red represents higher numerical values of the variable and blue represent lower numerical values. As all categorical variables were converted into binary indicators, zero (i.e., absence) is indicated with blue dots and one (i.e., presence) is represented by red dots

In term of interpretation of the SHAP dots, for example, the blue (lighter) colors for disease activity (BDCAF) represent lower values, whereas the red (darker) colors represent higher values. Positive SHAP values with red colors indicate that higher BDCAF values related to VTBD, whereas negative SHAP values with lighter colors indicate that lower BDCAF values are strongly protective for presence of VTBD. For the categorical features, patients with musculoskeletal manifestations were more likely to have VTBD as suggested by the larger spread of the red dots on the right.

The associations and average contributions of these features to the absolute predicted probability of VTBD are presented in Fig. 3. BDCAF was the most important predictor and it increased the absolute values of the predicted probability of VTBD by an average of 0.66. Both high ESR and thrombocytosis were associated with a small but positive increase in the predicted probability of potentially VTBD (0.23 and 0.13; respectively). The younger age of disease onset was associated with lower predicted probabilities of existence of VTBD (− 0.54).

Fig. 3
figure 3

XGBoost variable importance for predicting VTBD. The x-axis shows how much each feature added or subtracted to the final probability value for VTBD development. Please note that the numbers presented are average contributions for each feature to the model prediction

Important variables for predicting VTBD stratified by gender

The basic characteristics of the cohort stratified by gender are presented in Additional file 2: Table S1. Figure 4 shows degree and direction of contribution of the variables to VTBD from the Shapley plot separated by gender. In women with BD; BDCAF score, genital ulcer, thrombocytosis, and musculoskeletal manifestations were the top variables that contribute to the VTBD model (Fig. 4A). Higher BDCAF score and presence of diabetes mellitus were associated with higher probability of VTBD (Fig. 4B). For men with BD; thrombocytosis, older age, and presence of gastrointestinal manifestations were associated with a higher probability of VTBD (Fig. 4C). Contradictory to women, genital ulcers were associated with less probability for VTBD in men indicated by more red dots on the left (SHAP values − 0.09) as shown in Fig. 4D.

Fig. 4
figure 4

XGBoost variable importance and overall SHAP values for predicting VTBD among women and men with BD. Figure shows degree and direction of contribution of the variables to VTBD from the Shapley plot separated by gender. Shapley plots show the SHAP values in the order of the important variables that contribute to VTBD (left side) and by the direction of the contribution (right side). A Represents the variables importance in women, while B shows the average probability value of each variables in the contribution of VTBD in women. C Represents the variables importance and D shows the average probability value of each variables in the contribution of VTBD in men

Discussion

Predicting the outcome of ocular disorders in patients with BD, and providing proper and effective care for these patients can improve the visual acuity prognosis. We used a machine learning approach for predicting this severe complication selection, which is suited to data with high dimensionality. In this study, machine learning approaches outperformed the traditional statistical methods in the detection of vision threatening complications among patients with BD. In addition, we identified the important factors that were associated with VTBD risk. An advantage of this approach is that it builds on basic predictors available in the routine setting making the prediction model easily implemented in clinical practice. Further reproducible studies to validate these exploratory ML models in a larger number of BD patients are needed.

Machine learning is more suitable than conventional statistical approaches when: (1) there is no great prior knowledge on the topic under  investigation; and (2) the number of observations largely exceeds the allowed number of input variables [22]. The study demonstrates that XGBoost, the best performing ML mode (accuracy 85.0%) was superior to LR (accuracy 64.0%), which suggests that the relationship between input variables and the predictive outcome is nonlinear. This is in line with previous studies in the field of prediction task of medicine [10, 23, 24]. Although, there are no other studies related to developing ML algorithms for detecting the visual outcome of BD-affected patients, researchers successfully applied ML for a range of BD manifestations’ discrimination [14, 15]. For example, the application of ML classification to discriminate 194 cases of BD with uveitis against other inflammatory uveitis showed an overall accuracy for panuveitides was 96.3% in the training set and 94.0% in the validation set [15]. Using K-fold validation, the XGBoost model recorded a slightly lower level of accuracy than RF (83.1% vs 84.0%). This result can be explained by the small size of the data in each fold when the model builds the tree that records a higher level of accuracy compared to the previous fold. The data in the Fold itself does not contain properties that can be used to build the most appropriate tree until the model exceeds the specified accuracy level.

As opposed to the “black-box” problem of ML, the feature analysis based on the Shapley values, was recently applied  to improve the interpretability of predicting complex models [25]. The current study was identified that thrombocytosis among the important factors contribute to presence of VTBD. Previous research has shown the potential role of platelet either the count parameter (thrombocytosis) or function parameter (mean platelet volume) in BD patients [26]. Although the underlying pathophysiology is not well-understood, platelets have a central role in the pathogenesis of thrombosis. BD is characterized by the venous as well as arterial thrombosis [27]. As platelet counts measurement is low in costs, and readily available in the clinical settings, they could be valuable potential markers in the evaluation of ocular disorders progression.

Interestingly, the feature importance analysis shows an important role for BDCAF score in the estimation of VTBD; higher BDCAF score is associated with a higher probability for VTBD. Disease activity is the presence of any ongoing expression of vasculitis that may precede the disease damage [28]. There was no significant correlation between BDCAF score with the total disease damage assessed either by BDI or vasculitis damage index [6, 29]. A prospective study measures BDCAF score over longitudinal period is preferred to examine the impact of disease activity on the development of VTBD.

This work shows that genital ulcer, systemic vasculitis, and oral ulcers had the lowest probability to be associated with VTBD. Among 249 subjects with BD, these three systemic factors have a predictive value on the development of non-VTBD, defined as a milder form of eye involvement [7]. In other studies, local ocular factors: such as higher frequency of ocular attacks and longer duration of uveitis and retinal vasculitis were the main risk factors for blindness among patients with BD [3, 8]. Although, traditional statistical approaches have suggested some associations, no previous work from a high quality, large dataset using wide spectrum of variables collected in a clinical setting was determined.

There are well-known gender differences in the clinical manifestations and severity of BD [16, 30]; however, few data are available concerning gender difference in the BD associated ocular disorders and their predictors. Different variables were identified as the top features for detecting VTBD in both sexes. For men, thrombocytosis, older age, and gastrointestinal involvement were the top features associated with VTBD, while, in women, higher BDCAF, presence of genital ulcers, and musculoskeletal involvement were the important features for VTBD. Interestingly, genital ulcer was associated with lower predicted probabilities of VTBD in women, but with higher probability of VTBD in men. The course of the BD ocular manifestations is known to be more severe in the male population [30,31,32]; however, Davatchi et al. [33] have shown that the severity of ocular BD had the same outcome and improvement under treatment in the two sexes. Although the etiology of ocular manifestations is unknown, both genetic and environmental factors (smoking, infection, and vitamin D ) have been blamed [34, 35]. While, smoking was the 4th ranked important feature in men, it was the 17th ranked feature in women associated with less probability of VTBD (Fig. 4). Smoking was more common among male patients with BD in some studies raising the question of possible association [36, 37]. Identification of gender difference risk factors for VTBD may indeed be responsible for more severe disease in men.

Although RF presented a comparable accuracy, the XGBoost was considered the ideal model in terms of the results being easier to interpret, and thus allowing to better understand the factors influencing the prediction result [38]. The random forest model aggregates multiple decision trees grown on bootstrap subsamples of the training set, while the XGBoost successively build decision trees, learning from the mistakes of the previous ones. The XGBoost method often achieve the best results on structured data [39] as available in our study. The XGBoost model presented here, if validated for use, is of great clinical interest because it requires demographic and clinical variables only, with no genetic or biomarkers data. In daily practice, prognostic models that would be available at the time of decision-making are preferred. In this way, the rheumatologist can select patients for recommended ophthalmologic counselling, and can decide the appropriate treatment for these patients.

This study has several strengths. First, we used well-curated data available from national BD cohort to generate our classification models. Second, given that the patients were from multicenter and because the partitions were non-random, this approach is considered a type of validation. Third, the use of ensemble tree-based model was a strength owing to its capability to examine a large number of variables, accounting for all others simultaneously. Finally, we provided a Shapley plot that can be easily explained visually and easily to understand. This study still has some limitations. The cross-sectional design, thus the realization of a longitudinal analysis, is certainly needed. Analyses with a large number of variables are susceptible to collinearity among the variables; however, XGBoost classifier is designed to account for collinearity. Patients with BD were excluded if they were missing outcome data in their databases, which may introduce bias to the analysis. In the current study, there is male predilection (71.5%), which is consistent with previous reports from Middle East (REF). However the analysis were stratified by gender differences, further validation of ML models in different gender distribution dataset is suggested. Finally, the decision about what to do when the value of a patient’s variable was missing was a challenge. Our choice to not replace missing values was supported by the similar results we observed with statistical imputation (Additional file 2: Table S2).

Conclusion

In conclusion, we identified that Extreme Gradient Boosting model could reliably identify features associated with BDVT risk better than the conventional statistical method. Furthermore, higher disease activity, thrombocytosis, ever smoking, and daily steroid dose were the top factors associated with VTBD. The identification of VTBD-key contributors improve the multimodal treatment strategies. Such approach could be further validated on an external dataset, and once validated, it would be easy to implement at the point of care to individualize and tailor therapeutic regimens.

Availability of data and materials

The datasets generated during and/or analysed during the current study, and the code required for replicating the results in the paper are available upon request on request to the corresponding author.

Abbreviations

BD:

Behçet’s disease

VTBD:

Vision threatening complications in BD

BDI:

BD damage index

ML:

Machine learning

ECR:

Egyptian College of Rheumatology

BDCAF:

Behçet’s Disease Current Activity Form

XGBoost:

Extreme gradient boosting

ETC:

Extra tree classifier

RF:

Random forest

SVM:

Support vector machine

ANNs:

Artificial neural networks

MLP:

Multi-layer perceptron

AUROC:

Receiver operating characteristic curve

CI:

Confidence intervals

NPV:

Negative predictive value

PPV:

Positive predictive value

SHAP:

SHapley additive explanation

SD:

Standard deviation

IQR:

Interquartile range (IQR)

References

  1. Mat MC, Sevim A, Fresko I, Tüzün Y. Behçet’s disease as a systemic disease. Clin Dermatol. 2014;32(3):435–42.

    Article  PubMed  Google Scholar 

  2. Calamia KT, Wilson FC, Icen M, Crowson CS, Gabriel SE, Kremers HM. Epidemiology and clinical characteristics of Behçet’s disease in the US: a population-based study. Arthritis Care Res. 2009;61(5):600–4. https://doi.org/10.1002/art.24423.

    Article  Google Scholar 

  3. Takeuchi M, Hokama H, Tsukahara R, Kezuka T, Goto H, Sakai JI, et al. Risk and prognostic factors of poor visual outcome in Behcet’s disease with ocular involvement. Graefes Arch Clin Exp Ophthalmol. 2005;243(11):1147–52.

    Article  PubMed  Google Scholar 

  4. Zhang Z, Peng J, Hou X, Dong Y. Clinical manifestations of Behcet’s disease in Chinese patients. APLAR J Rheumatol. 2006;9(3):244–7. https://doi.org/10.1111/j.1479-8077.2006.00208.x.

    Article  Google Scholar 

  5. Kitaichi N, Miyazaki A, Iwata D, Ohno S, Stanford MR, Chams H. Ocular features of Behcet’s disease: an international collaborative study. Br J Ophthalmol. 2007;91(12):1579–82.

    Article  PubMed  PubMed Central  Google Scholar 

  6. Gheita TA, Hammam N, Fawzy SM, Abd El-Latif E, El-Gazzar II, Samy N, et al. Development and validation of a Behçet’s disease damage index for adults with BD: an explicit, composite and rated (ECR) tool. Int J Rheum Dis. 2021;24(8):1071–9.

    Article  PubMed  Google Scholar 

  7. Hussein MA, Eissa IM, Dahab AA. Vision-threatening Behcet’s disease: severity of ocular involvement predictors. J Ophthalmol. 2018;2018:9518065.

    Article  PubMed  PubMed Central  Google Scholar 

  8. Shams H, Lasheyei A, Javadian A, Karkhaneh R, Shahram F, Davachi F. The risk factors and causes for blindness in Behcet’s Disease. 2008.

  9. Pandit A, Radstake TRDJ. Machine learning in rheumatology approaches the clinic. Nat Rev Rheumatol. 2020;16(2):69–70.

    Article  PubMed  Google Scholar 

  10. Lee S, Kang S, Eun Y, Won HH, Kim H, Lee J, et al. Machine learning-based prediction model for responses of bDMARDs in patients with rheumatoid arthritis and ankylosing spondylitis. Arthritis Res Ther. 2021;23(1):254. https://doi.org/10.1186/s13075-021-02635-3.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  11. Hügle M, Omoumi P, van Laar JM, Boedecker J, Hügle T. Applied machine learning and artificial intelligence in rheumatology. Rheumatol Adv Pract. 2020;4(1):rkaa005.

    Article  PubMed  PubMed Central  Google Scholar 

  12. An introduction to machine learning and analysis of its use in rheumatic diseases. Nat Rev Rheumatol. https://www.nature.com/articles/s41584-021-00708-w?proof=t%25C2%25A0.

  13. Isik YE, Gormez Y, Aydin Z, Bakir-Gungor B. The determination of distinctive single nucleotide polymorphism sets for the diagnosis of Behçet’s disease. IEEE/ACM Trans Comput Biol Bioinform. 2021.

  14. Kim JM, Kang JG, Kim S, Cheon JH. Deep-learning system for real-time differentiation between Crohn’s disease, intestinal Behçet’s disease, and intestinal tuberculosis. J Gastroenterol Hepatol. 2021;36(8):2141–8.

    Article  PubMed  Google Scholar 

  15. Standardization of Uveitis Nomenclature (SUN) Working Group. Classification criteria for Behçet disease uveitis. Am J Ophthalmol. 2021;228:80–8.

  16. Gheita TA, El-Latif EA, El-Gazzar II, Samy N, Hammam N, Abdel Noor RA, et al. Behçet’s disease in Egypt: a multicenter nationwide study on 1526 adult patients and review of the literature. Clin Rheumatol. 2019;38(9):2565–75.

    Article  PubMed  Google Scholar 

  17. International Team for the Revision of the International Criteria for Behçet’s Disease (ITR-ICBD). The International Criteria for Behçet’s Disease (ICBD): a collaborative study of 27 countries on the sensitivity and specificity of the new criteria. J Eur Acad Dermatol Venereol. 2014;28(3):338–47.

  18. Borstelmann SM. Machine learning principles for radiology investigators. Acad Radiol. 2020;27(1):13–25.

    Article  PubMed  Google Scholar 

  19. Strobl C, Malley J, Tutz G. An Introduction to recursive partitioning: rationale, application and characteristics of classification and regression trees, bagging and random forests. Psychol Methods. 2009;14(4):323–48.

    Article  PubMed  PubMed Central  Google Scholar 

  20. Lundberg SM, Lee SI. A unified approach to interpreting model predictions. In: Advances in neural information processing systems. Curran Associates, Inc.; 2017. Available from: https://proceedings.neurips.cc/paper/2017/hash/8a20a8621978632d76c43dfd28b67767-Abstract.html.

  21. Han J, Pei J, Kamber M. Data mining: concepts and techniques. Elsevier; 2011.

  22. Rajula HSR, Verlato G, Manchia M, Antonucci N, Fanos V. Comparison of conventional statistical methods with machine learning in medicine: diagnosis, drug development, and treatment. Medicina. 2020;56(9):455.

    Article  PubMed  PubMed Central  Google Scholar 

  23. A comparison of machine learning models versus clinical evaluation for mortality prediction in patients with sepsis. Available from: https://journals.plos.org/plosone/article?id=10.1371/journal.pone.0245157

  24. Comparison of machine learning methods with traditional models for use of administrative claims with electronic medical records to predict heart failure outcomes|Cardiology|JAMA Network Open|JAMA Network [Internet]. [cited 2022 Jan 19]. Available from: https://jamanetwork.com/journals/jamanetworkopen/fullarticle/2758475.

  25. Interpretation of machine learning models using shapley values: application to compound potency and multi-target activity predictions | SpringerLink [Internet]. [cited 2022 Jan 19]. Available from: https://link.springer.com/article/10.1007/s10822-020-00314-0

  26. Acikgoz N, Karincaoglu Y, Ermis N, Yagmur J, Atas H, Kurtoglu E, et al. Increased mean platelet volume in Behçet’s disease with thrombotic tendency. Tohoku J Exp Med. 2010;221(2):119–23.

    Article  CAS  PubMed  Google Scholar 

  27. La Regina M, Gasparyan AY, Orlandini F, Prisco D. Behçet’s disease as a model of venous thrombosis. Open Cardiovasc Med J. 2010;4:71–7.

    PubMed  PubMed Central  Google Scholar 

  28. Merkel PA. Defining disease activity and damage in patients with small-vessel vasculitis. Cleve Clin J Med. 2012;79(Suppl 3):S11-15.

    Article  PubMed  Google Scholar 

  29. Vasculitis damage index in Behçet’s disease|Advances in Rheumatology|Full Text [Internet]. [cited 2022 Jan 19]. Available from: https://advancesinrheumatology.biomedcentral.com/articles/10.1186/s42358-021-00193-5.

  30. Tugal-Tutkun I, Onal S, Altan-Yaycioglu R, Altunbas HH, Urgancioglu M. Uveitis in Behçet disease: an analysis of 880 patients. Am J Ophthalmol. 2004;138(3):373–80.

    Article  PubMed  Google Scholar 

  31. Yang P, Fang W, Meng Q, Ren Y, Xing L, Kijlstra A. Clinical features of Chinese patients with Behçet’s disease. Ophthalmology. 2008;115(2):312-318.e4.

    Article  PubMed  Google Scholar 

  32. Bang DS, Oh SH, Lee KH, Lee ES, Lee SN. Influence of sex on patients with Behçet’s disease in Korea. J Korean Med Sci. 2003;18(2):231–5.

    Article  PubMed  PubMed Central  Google Scholar 

  33. Davatchi F, Shahram F, Chams C, Chams H, Nadji A, Jamshidi AR, et al. The influence of gender on the frequency of clinical symptoms in Behçet’s disease. Adv Exp Med Biol. 2003;528:65–6.

    Article  CAS  PubMed  Google Scholar 

  34. Gul A. Behcet’s disease: an update on the pathogenesis. Clin Exp Rheumatol. 2001;19:S6.

    CAS  PubMed  Google Scholar 

  35. Gul A. Behçet’s disease as an autoinflammatory disorder. Curr Drug Targets Inflam Allergy. 2005;4(1):81–3.

    Article  Google Scholar 

  36. Özer H, Güneşaçar R, Dinkçi S, Özbalkan Z, Yildiz F, Erken E. The impact of smoking on clinical features of Behçet’s disease patients with glutathione S-transferase polymorphisms. Clin Exp Rheumatol. 2012;30(3):S14.

    PubMed  Google Scholar 

  37. Aramaki K, Kikuchi H, Hirohata S. HLA-B51 and cigarette smoking as risk factors for chronic progressive neurological manifestations in Behçet’s disease. Mod Rheumatol. 2007;17(1):81–2.

    Article  PubMed  Google Scholar 

  38. Zhang Z, Ho KM, Hong Y. Machine learning for the prediction of volume responsiveness in patients with oliguric acute kidney injury in critical care. Crit Care. 2019;23(1):112.

    Article  PubMed  PubMed Central  Google Scholar 

  39. Duquesne J, Bouget V, Cournède PH, Fautrel B, Guillemin F, de Jong PHP, et al. Machine learning identifies a profile of inadequate responder to methotrexate in rheumatoid arthritis. Rheumatology (Oxford). 2022;keac645.

Download references

Acknowledgements

Not applicable.

Funding

Open access funding provided by The Science, Technology & Innovation Funding Authority (STDF) in cooperation with The Egyptian Knowledge Bank (EKB). The authors did not receive any funding related to this work.

Author information

Authors and Affiliations

Authors

Contributions

NH and TG conceived and designed the research. NH and AB generated the machine learning models and analyzed data NH and TG interpreted the results. All authors performed clinical data collection and review. NH wrote the manuscript. All authors reviewed the manuscript. All authors read and approved the final manuscript. Nevin Hammam: Former Specialist at University of California San Francisco, CA, USA. All authors read and approved the final manuscript.

Corresponding author

Correspondence to Nevin Hammam.

Ethics declarations

Ethics approval and consent to participate

This study was approved by the Institutional Review Board of Cairo University, Cairo, Egypt (IRB No.: 47-SReC-RCU2021) and the informed consent was obtained from all participants. The data used for this study were fully anonymized. All methods were carried out in accordance with relevant guideline and regulations.

Consent for publication

Not applicable.

Competing interests

The authors declare that they have no competing interests.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary Information

Additional file 1.

Supplementary figure 1: Receiver operating characteristic (ROC) curve analysis of machine learning algorithms for prediction of VTBD in the training (Left figure) and testing (Right figure) sets.

Additional file 2.

Supplementary table 1: Characteristics of Bechet’s disease patients stratified by gender.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated in a credit line to the data.

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Hammam, N., Bakhiet, A., El-Latif, E.A. et al. Development of machine learning models for detection of vision threatening Behçet’s disease (BD) using Egyptian College of Rheumatology (ECR)–BD cohort. BMC Med Inform Decis Mak 23, 37 (2023). https://doi.org/10.1186/s12911-023-02130-6

Download citation

  • Received:

  • Accepted:

  • Published:

  • DOI: https://doi.org/10.1186/s12911-023-02130-6

Keywords