Skip to main content

Machine learning to predict virological failure among HIV patients on antiretroviral therapy in the University of Gondar Comprehensive and Specialized Hospital, in Amhara Region, Ethiopia, 2022

Abstract

Background

Treatment with effective antiretroviral therapy (ART) reduces viral load as well as HIV-related morbidity and mortality in HIV-positive patients. Despite the expanded availability of antiretroviral therapy around the world, virological failure remains a serious problem for HIV-positive patients. Thus, Machine learning predictive algorithms have the potential to improve the quality of care and predict the needs of HIV patients by analyzing huge amounts of data, and enhancing prediction capabilities. This study used different machine learning classification algorithms to predict the features that cause virological failure in HIV-positive patients.

Method

An institution-based secondary data was used to conduct patients who were on antiretroviral therapy at the University of Gondar Comprehensive and Specialized Hospital from January 2020 to May 2022. Patients’ data were extracted from the electronic database using a structured checklist and imported into Python version three software for data pre-processing and analysis. Then, seven supervised classification machine-learning algorithms for model development were trained. The performances of the predictive models were evaluated using accuracy, sensitivity, specificity, precision, f1-score, and AUC. Association rule mining was used to generate the best rule for the association between independent features and the target feature.

Result

Out of 5264 study participants, 1893 (35.06%) males and 3371 (64.04%) females were included. The random forest classifier (sensitivity = 1.00, precision = 0.987, f1-score = 0.993, AUC = 0.9989) outperformed in predicting virological failure among all selected classifiers. Random forest feature importance and association rules identified the top eight predictors (Male, younger age, longer duration on ART, not taking CPT, not taking TPT, secondary educational status, TDF-3TC-EFV, and low CD4 counts) of virological failure based on the importance ranking, and the CD-4 count was recognized as the most important predictor feature.

Conclusion

The random forest classifier outperformed in predicting and identifying the relevant predictors of virological failure. The results of this study could be very helpful to health professionals in determining the optimal virological outcome.

Peer Review reports

Background

According to the Global HIV/AIDS Statistics, around 38 million individuals are living with HIV, 28.7 million people were accessing antiretroviral therapy, 1.5 million persons are newly infected with HIV, and 650,000 people died from AIDS-related illnesses around the world in 2021 [1]. Approximately 67% of HIV-positive people live in Africa [2]. In Eastern and Southern Africa, 20.6 million people are living with HIV, and 16.2 million individuals receiving antiretroviral therapy [1]. UNAIDS hoped that by the end of 2020, 90% of people living with HIV knew their HIV status, 90% of those who knew their HIV-positive status were on treatment, and 90% of those on treatment had suppressed viral load [1]. Since this target was not achieved, UNAIDS modified it to 95-95-95 targets to end the HIV/AIDS pandemic by 2030 [3]. Nonetheless, it remains a serious global public health issue; 40 million people have died from AIDS-related illnesses since the epidemic began [4].

Furthermore, Ethiopia is one of the most affected countries in Sub-Saharan Africa by the HIV epidemic, with an estimated 617,921 people living with HIV, 78% of adults and children receiving ART. More than 60% of new infections occur in the Amhara, Oromia, SNNP, and Tigray regions, with the Amhara region having the highest number of HIV patients at around 191,480. The prevalence of HIV among adults in Ethiopia is 0.9% [5,6,7]. According to the Ethiopian National HIV Care Guideline, virological failure is defined as a viral load greater than 1000 copies/ml based on two consecutive viral load tests in three months, with improved adherence support following the first viral load test [8]. This guideline is approved by WHO that viral load monitoring is the best standard for determining the efficacy of treatment [9]. Virological failure is a common problem encountered by HIV patients after the initiation of treatment [10]. WHO guidelines recommend that virological failure should not exceed 10% after the initiation of treatment [10, 11]. However, studies show that viral failure is much higher than expected, and viral replication remains a major challenge for HIV-infected patients [12,13,14]. Patients who experienced virologic failure were at increased risk of clinical progression to AIDS and death compared with patients who had a complete virological response [15].

In Sub-Saharan African countries, HIV viral load testing has been used and scaled up to monitor patients on antiretroviral therapy (ART). However, more than 24% of patients receiving first-line ART experience virologic failure within 1 year of starting therapy [16, 17]. Failure to achieve and maintain viral suppression can lead to resistance and an increased risk of viral infection [17, 18]. Therefore, with delayed early identification of virological failure, patients with HIV are more likely to have illness progression and death [2, 19, 20]. As a result, effective antiretroviral therapy reduces the incidence and mortality in HIV-positive patients [4]. Furthermore, HIV viral load suppression is the most important indicator of successful antiretroviral therapy, progression, and death, as well as the most effective monitoring strategy for confirming a diagnosis and proving ARV treatment failure [21]. Despite the expanded availability of antiretroviral treatment (ART) around the world, virological failure remains a significant challenge for HIV patients [22, 23]. Delays in commencing ART can have serious consequences, particularly for people with tuberculosis or extensive immunosuppression who are at high risk of mortality [21, 24]. A study indicates that viral suppression was much lower than expected at the first test, and viral replication remains a major issue for HIV patients [25]. According to recent research conducted in Sub-Saharan Africa, the proportion of individuals who experienced virological failure ranged from 11 to 47% [26,27,28,29]. Furthermore, in Ethiopia, studies reported that virologic failure rates ranged from 5.4 to 19% [30,31,32]. Several studies have found that socio-demographic predictors (age, gender, marital status, religion), clinical-related predictors (low BMI, TB co-infection), and treatment-related predictors (excessive dose frequency, poor medication adherence) can all influence virological failure in HIV patients [30, 33,34,35,36]. However, the predictors of virological failure varied between studies.

Understanding the causes that lead to virological failure in HIV patients on antiretroviral therapy (ART) is essential for HIV/AIDS prevention and control programs Also minimize these adverse outcomes, clinical prediction models that use patient-level evidence can be used in medical decision-making [37]. Meanwhile, traditional statistical modeling was created for data with a few dozen input variables and sample sizes that would be considered small to medium. The complexity of data may make traditional statistical inference less manageable. As a result, machine learning was used more effectively [38]. Machine learning methods can identify and discover patterns in complex datasets and predict future HIV treatment outcomes with excellent predictive performance [39,40,41]. Machine learning predictive algorithms have the potential to improve the quality of care and predict the needs of HIV patients by analyzing huge amounts of data, and enhancing prediction capabilities [42, 43]. Furthermore, machine learning is used to identify HIV patients who are at high risk of failing to adhere and being a virological failure [44] and able to learn from domain experts quickly and accurately, then apply that knowledge to identify HIV risk behaviors in a huge dataset [45]. Ethiopia has a severe HIV pandemic with increasing virological failure [46]. Despite these facts, researchers using data from this country have only used statistical methods to identify factors of viral failure [46,47,48]. Although previous studies have been conducted to investigate the determinants of virological failure in different regions of Ethiopia, most of them have used classical statistical models to identify important predictors of virological failure. Therefore, advanced predictive modeling is required to improve the targeting of interventions through differentiated care models, to be more cost-effective, to improve patient outcomes, and to initiate treatment based on achieving maximum patient survival and clinical benefit. Therefore, this study took advantage of the unique opportunity provided by access to the secondary dataset in the ART clinic of the University of Gondar comprehensive and specialized hospital. Then, we compare the relative performances of various machine learning algorithms for predicting virological failure using ART datasets. Identifying the risk factors for virological failure in HIV-positive patients also helps to prevent the virus from spreading and reduces the possibility of treatment failure.

Method

Study design

This study used a quantitative research method with a machine learning approach and an institution-based cross-sectional study was conducted at the University of Gondar Comprehensive and Specialized Hospital.

Study area and period

This study was conducted at the University of Gondar Comprehensive and Specialized Hospital using secondary data from January 2020 to May 2022. The University of Gondar Comprehensive and Specialized Hospital is located in the Amhara National Regional State, Gondar City Administration. It is located 748 km northwest of Ethiopia’s capital, Addis Ababa. The hospital, which is Ethiopia’s oldest academic institution, is a teaching hospital that provides teaching activities to medical and health science students. The hospital provides medical education, training, medical services, and many other services to more than 7 million people in Gondar province and neighboring areas. One of the services provided is the ART service, and through its service, both children and adults can get a free diagnosis, treatment, and monitoring. The hospital started a free ART service in March 2005. At the University of Gondar hospital, the ART clinic reported that 15,933 patients were enrolled for ART. Of those, 5523 patients are actively taking the treatment.

Source population and study population

All adult (≥ 18 years) HIV-positive patients on antiretroviral therapy (ART) who attend ART clinics at the University of Gondar Comprehensive and Specialized Hospital were the source population, while all adult HIV-positive patients who have been on antiretroviral therapy (ART) for at least 6 months at the University of Gondar Comprehensive and Specialized Hospital were the study population.

Inclusion and exclusion criteria

All adult HIV-positive patients who have been on antiretroviral therapy (ART) for at least 6 months, have had viral load tests, and have records in the ART computerized database at the University of Gondar Comprehensive and Specialized Hospital were included. However, the records that had significant missing feature values (like CD4 counts, Adherence, and WHO Stage) were excluded.

Sample size and sampling procedure

Out of all the ART patients who received treatment at the time this study was undertaken, 5264 adult HIV-positive patients from the University of Gondar Comprehensive Specialized Hospital’s ART clinic were included for the model building and prediction of virological failure.

Study variables

Dependent and independent variables

In this study, the dependent variable was a virological failure while the independent variables were sociodemographic factors (age, sex, marital status, educational status, residence), clinical-related factors (BMI, WHO stages, TB screened, TB screened result, functional status), hematologic and immunological factors (CD4 count) and treatment-related factors (treatment adherence, duration on ART, ART regimen, TPT started, TPT discontinued, CPT use, viral load test duration).

Operational definition

Virological failure is defined as a viral load greater than 1000 copies/ml based on two consecutive viral load tests within three months, with better adherence support after the first viral load test [21]. Using the definition of HIV viral load, this study classified the outcome variable into suppressed (not virological failure) and unsuppressed (virological failure). If the patient’s viral load was less than 1000 copies/ml, the outcome variable was classified as suppressed (not virological failure), and if the viral load was larger than 1000 copies/ml, the outcome variable was classified as unsuppressed (virological failure).

Treatment adherence is defined as the percentage of a person’s medication-taking behavior that conforms to agreed-upon recommendations from a healthcare practitioner. Good adherence is defined as more than 95%, fair adherence is defined as between 85% and 94%, and poor adherence is defined as less than 85% adherence [21].

Data collection tools and procedures

The main source of data for this study was the ART clinic at the University of Gondar Comprehensive and Specialized Hospital, which had existing record data on an electronic database that contained information on HIV-positive patients. It is identified by unique ART numbers and MRN. From all ART patients, a total of 5,264 adult HIV-positive patients’ data were extracted from the electronic database using a structured checklist that was prepared in English. It was adapted from the Ethiopian Federal Ministry of Health ART clinic intake and follow-up form [49]. Multiple tables with a variety of feature integrations were present in the electronic database. The features were chosen after discussions with subject-matter experts and categorized as sociodemographic features, hematologic and immunological features, clinical-related features, and treatment-related features. Features with identifiable information such as patient names and phone numbers were removed during data extractions. After all data collection procedures were completed, twenty features and one target feature were identified for this study.

Data quality assurance

Data quality assurance is an important step for determining the quality of data. During the initial stage of data collection from the electronic database, the principal investigator provided one day of training to the ART data clerk and observed each step of deidentified record extraction for the feature range value consistency and correctness of the collective data. The principal investigator repeatedly checked the data completeness and randomly cross-checked the patient charts to ensure data similarity and consistency.

Data management and analysis

Patients’ data were extracted from the electronic database in Microsoft Excel format and then converted to comma-separated values (CSV) to make the dataset easier to pre-process in Python version 3. Machine learning needs a high-quality dataset for prediction. Due to this, handling the missing data during the pre-processing of the dataset is a crucial phase. An imputation technique was applied to handle the missing values in the dataset. As a result, this study handled the missing values using simple imputation techniques. The Simple Imputer class of the scikit-learn module was used for imputing the missing values in the dataset [50]. Data pre-processing also includes encoding data, which is a crucial and essential step. It used one-hot and label encoding to encode categorical variables. Values with two or more category values are considered categorical if they are discrete and not continuous. In this study, categorical variables were encoded using one hot encoding approach. The categorical values are replaced by a number between 0 and 1 in one hot encoding [51].

Data Analysis

In this study, descriptive statistics were used to describe the socio-demographic characteristics using frequency and percentage. Some of the data analysis parts were data pre-processing, feature selection, data split, imbalanced data handling, building the model, and testing the performance of the model. In this study, Python version 3 software was used. It is commonly used in data analysis and interactive computing. It has also been a popular choice for data analysis tasks in recent years and It has libraries for data loading, visualization, statistics, natural language processing, and image processing [52, 53].

Feature selection method

Feature selection is one of the main processes of data dimensionality reduction in machine learning. It is the method of extracting the necessary beneficial factors from a dataset to improve the accuracy of machine learning predictions [54]. A dataset with redundant, irrelevant, and inconsistent features may be reducing the model’s performance. It is essential to use feature selection methods to improve the accuracy of model performance and reduce the training time and overfitting [55].

In this study, different feature selection methods such as recursive feature elimination (RFE), random forest feature importance, and the Boruta feature selection method were used to select the relevant predictive feature. RFE is a feature selection technique that starts with all the features in the training dataset and removes features until the target number of the feature remains. Furthermore, it is a relatively efficient technique for decreasing model complexity and speeding up the processing of machine-learning algorithms [56].

Data split

A typical strategy in machine learning is to split the data into training and testing sets. K-fold cross-validation is commonly used for improved model selection in classification prediction (50 ). This study used the stratified tenfold cross-validation method, which partitioned the entire dataset into ten folds. Nine folds were used to train the model and then the remaining one fold was used to test the model. The procedure is repeated ten times with the testing fold [57, 58]. In addition, cross-validation was a useful method for this study because the available trained dataset was small and the total performance of the model is the average of all 10 folds [59].

Imbalance data handling

Many real-world applications, including medical diagnosis, pattern recognition, and fraud detection, rely on imbalanced data learning. The problem occurs when there is a big gap in numbers between the majority and minority classes, which is particularly common in classes having binary values, and frequently occurs in supervised machine learning [60]. The outcome variable for this study was a binary classification of suppressed (not virological failure) and unsuppressed (virological failure). The majority class was suppressed (not virological failure) while the minority class was unsuppressed (virological failure).

Predictive accuracy is commonly used to evaluate the performance of machine learning algorithms, but the imbalance of data affects accuracy and machine learning prediction, making it difficult to determine the cause of virological failure. In this study, the Synthetic Minority Oversampling Technique (SMOTE) [61]. Adaptive Synthetic (ADASYN), and random under-sampling were used to balance the majority and minority classes. SMOTE is an oversampling approach for imbalanced datasets that is used as a preliminary step in learning algorithms and is an effective way of handling class imbalance. It creates new samples by randomly interpolating linearly between a few samples and their neighbors [62, 63]. Random under-sampling is a method of balancing the imbalance of classes in which the majority class is under-sampled by randomly discarding samples from the population of the majority class until the minority class reaches a certain percentage of the majority class [61].

ADASYN is the adaptive generation of minority data samples per their distributions. More synthetic data is generated for minority class samples that are harder to learn compared to those minority samples that are easier to learn and can minimize the bias introduced by the imbalanced data distribution [64]. The figure below shows that to balance the dataset, under-sampling and over-sampling methods [65].

Method of building a predictive model

Predictive modeling is defined as “the process of developing a mathematical tool or model that generates an accurate prediction” [66]. Classification algorithms are supervised learning techniques that divide a set of data into specific categories. In this study, seven supervised classification algorithms were applied. Some of the classification machine learning algorithms were support vector machine, random forest, decision tree, logistic regression, gradient boosting, K-nearest neighbours, and XGBoost. During the literature review, it has been observed that these algorithms are better suited for classification problems in the healthcare domains [43, 67,68,69]. The algorithms were chosen based on their accuracy, training duration, ability to handle missing data, and ease of interpretation and learning.

Performance evaluation for predictive model

After model training, the model’s performances are evaluated and compared to each other. The performance of the prediction models was evaluated based on the confusion matrix. This study used precision, sensitivity, specificity, F1-score, and the area under the receiver-operating characteristic (AUC-ROC) to assess the model performance. AUC-ROC is a popular and powerful performance metric for assessing the performance of binary classifiers and is used to evaluate the model’s predictive classification capacity. A range of 0.5 indicated no ability to predict, while 1.0 indicated exceptional or perfect predictive ability [44, 70]. The confusion matrix, which is used to represent the output of a model as a binary class, is a standard performance metric tool used in machine learning classification tasks. The confusion matrix was crucial for this study to determine the sensitivity, precision, f1-score, and performance metrics of accuracy. The matrix is made up of predictions that have been summarized into a total number of correct and incorrect predictions [70, 71] (Table 1).

Equations used to calculate Recall (sensitivity), (specificity), precision, and accuracy are given below.

$$\mathrm{Recall}\;(\mathrm{Sensitivity},\;\mathrm{true}\;\mathrm{positive}\;\mathrm{rate})\;=\frac{\mathrm{TP}}{\mathrm{TP}+\mathrm{FN}}$$
(1)
$$\mathrm{Specificity}\;(\mathrm{true}\;\mathrm{negative}\;\mathrm{rate})\;\;=\frac{\mathrm{TN}}{\mathrm{TN}+\mathrm{FP}}$$
(2)
$$\mathrm{Precision}\;(\mathrm{positive}\;\mathrm{predictive}\;\mathrm{value})\;\;=\frac{\mathrm{TP}}{\mathrm{TP}+\mathrm{FN}}$$
(3)
$$\mathrm{Accuracy}=\frac{\mathrm{TP}+\mathrm{FN}}{\mathrm{TN}+\mathrm{TP}+\mathrm{FP}+\mathrm{FN}}$$
(4)
$$\mathrm F1\;=2\ast\frac{\mathrm{Recall}\ast\mathrm{Precision}}{\mathrm{Recall}+\mathrm{Precision}}$$
(5)
Table 1 Confusion matrix show to summarizes the total number of correct and incorrect predictions and actual positive and negative

In summary, the machine learning pipeline in this study is depicted in Fig. 1.

Fig. 1
figure 1

Workflow of machine learning for virological failure prediction

Association rule

The association rule mining technique, which examines the concurrent correlations between several variables in a grouping, was first created by Agarwal

and Srikanth [72]. Therefore, this study used an additional method that supports the classification of machine learning algorithms for predicting virological failure using R software. Using the apriori algorithm [73] to discover the association between the selected features and the target feature. To identify all potential association rules, the minimal support degree was set at 0.00095 and the minimum confidence threshold at 90% because a rule is considered reliable if its confidence level is more than 80% [74].

To produce association rules, this study primarily focused on features that are implied by the target features (Antecedent = > Consequent), which is a way to classify all the variables that contribute to virological failure. These rules are often referred to as classification association rules [75]. The reason for using this method was to know the predictors that each category contributed to the virological failure.

Support, Confidence, and Lift can be represented as follows for a certain rule: Here, the sets of features represented by X and Y are mutually exclusive.

$$\mathrm{Rule}\;\mathrm X=>\mathrm Y$$
(1)
$$\mathrm{Support}\;=\frac{\mathrm{frequency}(\mathrm X,\mathrm Y)}{\mathrm N}$$
(2)
$$\mathrm{Confidence}\;=\frac{\mathrm{frequency}(\mathrm X,\mathrm Y)}{\mathrm{frequency}(\mathrm X)}$$
(3)
$$\mathrm{Lift}\;=\frac{\mathrm{frequency}(\mathrm X,\mathrm Y)}{\mathrm{frequency}(\mathrm X)\mathrm{frequency}(\mathrm Y)}$$
(4)

Results

Sociodemographic characteristics of participants

From the total number of HIV patients who had ever started antiretroviral therapy, this study includes 5,264 patients who were on antiretroviral therapy. 64% (3371) were females, and 39.31% (2,069) were between the ages of 38 and 47. More than half of HIV patients 57.26% (3014) were married and 77.79% (4095) lived in Gondar city. Almost half of the patients had secondary educational status, and 91.76% (4830) of all patients were orthodox (Table 2).

Table 2 Sociodemographic characteristics of the study participants in the University of Gondar Comprehensive and Specialized Hospital, July 2022 (n = 5264)

Data Pre-processing Results

In this study, the main pre-processing steps involved dealing with missing or null values, encoding categorical labels, and balancing the dataset. Due to the incompleteness of the initial raw data, missing values were handled by the imputation technique. Figure 2 shows the percentage of missing values for each feature. The features with the higher percentage of missing data were educational level (15.14%), religion (13.09%), CD4 count (12.82%), and marital status (12.71%). The missing values were then imputed using the simple imputation technique. Input and output features must be numerical to be used with machines. This implies that to fit our data to the model, we used one hot encoder to encode categorical variables that are present in the dataset. From this, the dataset had twenty features, with one target feature. One-hot encoding is a useful encoding technique for classification tasks. Each categorical value was transformed into a new column with one-hot encoding, and the label values were created as new columns (1 or 0) [51].

Fig. 2
figure 2

Percentage of missing values in the ART dataset

Imbalance data handling

To address the data imbalance issue and increase the effectiveness of the machine learning algorithms, imbalance data handling was an important method for this study. From the outcome feature, 5123 (97.32%) of the observations were classified as suppressed and 141 (2.68%) were unsuppressed. This work used under-resampling, SMOTE, and ADNYSN as balanced sampling methods to apply the classification algorithm to a balanced dataset. Each balanced sampling method used a different sampling technique to balance the unequal dataset by either maximizing the minority class or decreasing the majority class. This study compared the balancing technique with the accuracy and AUC using selected supervised classification machine learning algorithms. In the unbalanced dataset, logistic regression performed with a higher AUC of 0.915 as compared to another classifier, and random forest achieved a higher accuracy of 97.42%. In the ADASYN, the XGBoost classifier had a higher accuracy of 96.24% and an AUC of 0.994. Additionally, support vector machines performed better in under-sampling techniques, with training accuracy and AUC values of 84.60% and 0.94, respectively (Table 3).

Table 3 compares imbalanced data handling techniques using accuracy and Area under the curve (AUC)

In the comparison of different balanced sampling methods using a random forest classifier, the SMOTE sampling balanced method performed better than other sampling methods with an accuracy of 98.80% and an area under the curve (AUC) of 0.999. ADASYN was the second most important balanced resampling method, next to the SMOTE balanced method, with accuracy and an AUC of 95.51% and 0.991, respectively. However, of the balancing imbalanced data handling methods, the under-sampling balanced method was the least important for this study, with an AUC of 0.917 and an accuracy of 83.3% (Fig. 3).

Fig. 3
figure 3

Comparison of balanced sampling method using random forest classifier

Generally, this study had a high issue of imbalanced data. A balanced sampling method was important to overcome such a problem. Unbalanced data makes machine learning challenging and hinders the performance of the classifying algorithm because values from the minority class or rarely occurring classes are mistakenly classified as instances of the majority class. After all, the classifier is overloaded with the majority class and ignores the minority class of virological failure or unsuppressed. The total number of records increased after SMOTE was applied to the unbalanced dataset (Fig. 4). This study compared the classifier and the balanced sampling method with other classifiers using training accuracy and AUC. However, classification algorithms with high accuracy do not always interpret better performance on target datasets because imbalanced data affects the performance of accuracy. For this reason, we mainly used AUC to compare the classifier and balanced sampling method.

Fig. 4
figure 4

Before unbalanced and after balancing the target feature

Model building and model evaluation

This study conducted several experiments for model building in predicting the occurrence of unsuppressed or viral failure in patients on ART. These experiments were primarily focused on two parts: the first part trained the various classification algorithms with an unbalanced dataset using twenty features, and the second part used a balanced sampling method to identify the best model with twenty features.

The ROC curve in Fig. 5 shows that the imbalanced dataset results from stratified 10-fold cross-validation with default hyperparameter tuning performed less than the balanced dataset from all classifiers. The Support Vector Machine outperformed all other classifiers, with an AUC of 0.966, followed by a gradient boosting classifier with an AUC of 0.961, logistic regression with an AUC of 0.956, XGBoost with an AUC of 0.944, random forest with an AUC of 0.919, and KNN with an AUC of 0.761. Of all the classifiers, the decision tree classifier achieved a lower performance with an AUC of 0.634 (Fig. 5).

Fig. 5
figure 5

ROC curve shows an unbalanced dataset

The second half of this work was mainly concentrated on building the classification algorithm with a balanced dataset by utilizing different balancing techniques, including under-resampling, SMOTE, and ADASYN. As shown in Table 3, from all the balanced sampling methods, SMOTE was selected by comparing the AUC results for this study. After applying different machine learning classification algorithms to the imbalanced data using SMOTE, high accuracy, precision, sensitivity, specificity, f1-score, and AUC were obtained. In comparison to other algorithms, the random forest classifier produces the most accurate and meaningful results by comparing the performance of the model using performance metrics. For each method, all hyperparameters were left at their default values while the experiments were being conducted. The accuracy, precision, sensitivity, specificity, and f1-score were used in this study to assess how well the models performed. The ROC curve in Fig. 6 shows that the random forest classifier (AUC = 0.9989) outperformed all other classifiers, followed by the XGBoost classifier (AUC = 0.9985), support vector machine (AUC = 0.997), logistic regression (AUC = 0.992), gradient boosting classifier (AUC = 0.992), the k-nearest neighbor classifier (AUC = 0.982) and decision tree classifier (AUC = 0.983). The relationship between false positive and true positive rates is shown in the ROC curve, which states that a predictive model was more accurate for the prediction of virological failure (Fig. 6).

Fig. 6
figure 6

ROC curve shows a balanced dataset using SMOTE

The performance evaluation results with the balanced dataset in Fig. 7 show that all the classifiers of the models achieved different results. The random forest classifier achieved a sensitivity of 100%, a precision of 98.65%, and an f1-score of 99.32%. The XGBoost classifier was performed with a sensitivity of 99.60%, a precision of 96.96%, and an f1-score of 98.26%. The decision tree classifier achieved a sensitivity, precision, and f1-score of 99.80%, 96.41%, and 98.08%, respectively. The K-Nearest Neighbors Classifier achieved the highest sensitivity performance but lower precision and f1-score as compared to the random forest classifier. Support vector machine (sensitivity = 99.02%, precision = 96.38%) achieved the performance metrics; logistic regression with a sensitivity of 96.48% and precision of 95.00%, gradient boosting classifier (sensitivity = 99.02%, precision = 93.02%, f1-score = 96.02%).

Fig. 7
figure 7

Performance of Classifier using SMOTE

Generally, from the performance metrics shown in Fig. 7, random forest achieved the highest sensitivity, precision, and f1-score. Of all the classifiers, the k-nearest neighbor classifier, the decision tree classifier, and the random forest classifier had the highest sensitivity value, which means the classifier was correctly classified as true positive. Overall, it is believed that the classifiers have an acceptable performance (Fig. 7).

Random Forest Model performance

This experiment was developed to evaluate how well different classifiers predicted virological status. The objective of this analysis is to evaluate the accuracy of the predictions provided by the selected classifier. From the balanced dataset, the random forest classifier’s performance was quite strong compared to other selected classifiers. The hyper-parameter tuning and feature selection were carried out after the best model had been chosen. The important predictor of viral failure was established to compare the model’s performance.

Random forest with hyperparameter tuning

After selecting the best classifier, this study applied the hyperparameter tuning or optimation to compare it with the default hyperparameter tuning. Figure 8 shows that default hyperparameter tuning was higher performed than hyperparameter tuning using the best classifier of random forest. According to the results, the random forest classifier with tuned hyperparameters was less performed than the random forest with the default hyperparameter. Therefore, this study used a random forest with a default hyperparameter. A random forest with hyperparameter tuning of precision of 96.77%, f1-score 98.17%, the sensitivity of 99.61%, and specificity of 96.67%, as well as a random forest with default hyperparameter tuning of sensitivity, specificity, precision, and f1-score of 100%, 98.63%, 98.65%, and 99.32%, respectively. However, the random forest classifier with hyperparameter tuning surpassed all other classifiers in terms of AUC. The random forest classifier with default hyperparameter tuning had the highest sensitivity values, which means that the classifier properly identified viral unsuppressed or failed. Then this study applied the default hyperparameter tuning (Fig. 8).

Fig. 8
figure 8

Comparison of tuned and default hyperparameter using random forest classifier

Random Forest with selected features

This experiment was performed to examine the random forest classifier’s ability to predict virological failure and the impact of feature selection. Different Features selection methods were applied to compare the performance of the random forest classifier. Figure 9 shows that from the features selection method, random forest features importance was performed better than both Boruta feature selection and recursive feature elimination with a sensitivity of 100%, specificity of 98.4%, a precision of 98.5%, f1-score of 99.2%, and an AUC of 0.999.

Fig. 9
figure 9

Comparison features selection method using random forest classifier

Using these findings as a reference, for this study, random forest feature importance was chosen as the most appropriate method to select the features that most accurately predict virological failure. From the selection method, both Boruta feature selection and recursive feature elimination were less outperformed than random forest feature importance. As a result, this study used a random forest classifier as the feature importance selection method. Of all the features, CD4 count, ART regimen, age group, educational level, marital status, TPT discontinued, body mass index, regimen types, TPT started, duration of ART, sex, and CPT were highly significant for virological failure (Fig. 10).

Fig. 10
figure 10

Relevant features selected by a random forest feature importance

Association rule result

After selecting the relevant features by random forest feature importance, this study applied the association mining rules with an apriori algorithm for interpretation and a comparison of the best-chosen features. Then, from the association mining rules, nine association rules were found with more than 90% confidence value and the highest lift or interestingness. Of all the rules, the six most important rules were selected for predicting virological failure (Fig. 11).

Fig. 11
figure 11

Six important association rules

Rule 1 indicated that the studied client’s sex was male, the ART regin used TDF-3TC-EFV, has not taken Cotrimoxazole preventive therapy and the CD4 count of the patient had between 50 and 99 cells/mm3 then the possibility of developing virological failure is 92% of confidence.

Rule 2 means that being male, the client lived with ART for more than 10 years, did not take Cotrimoxazole preventive therapy and the CD4 count of the client was between 50 and 99 cells/mm3 then the client will a 91% chance of developing the virological failure.

Rule 3 means that if, a client whose sex was male and age was between 18 and 27, had a secondary educational level, has taken a first-line regimen, and had a CD4 count client had between 50 and 99 cells/mm3 then the probability of developing virological failure is 100% of confidence.

Rule 4 showed that if a client’s age was between 28 and 37, the ART regimen used TDF-3TC-EFV, the patient lived with ART for more than 10 years, not taking Cotrimoxazole preventive therapy and the CD4 count of the client was between 50 and 99 cells/mm3 then the possibility of developing virological failure is 100% of confidence.

Rule 5 means that if, a client whose sex was male and the client lived with ART for more than 10 years, has not taken Tb preventive therapy, and the CD4 count of the client was between 50 and 99 cells/mm3 then the client will be 100% confidence of developing the virological failure.

Rule 6 indicated that the studied clients’ sex was male, had a secondary educational level, have taken a first-line regimen of TDF-3TC-EFV, has not taken Cotrimoxazole preventive therapy and the CD4 count of the client was between 50 and 99 cells/mm3 then the possibility of developing virological failure is 100% of confidence.

Based on Association Rules results showed that males, aged between 18 and 27 and 28–37, secondary educational level, TDF-3TC-EFV, lived with ART for more than 10 years, not used TPT and CPT and CD4 count between 50 and 99 cells/mm3 were the most predictors of virological failure for this study. According to the above association rules, the independent and dependent features were strongly associated with one another because the lift values were more than one value.

Discussion

Ethiopia is stepping up efforts to find patients who need more services and interventions to meet the 95-95-95 targets at each stage of treatment [7]. The goal of this study has been to predict and identify the predictors of virological failure and build the best performance of a supervised machine learning classifier. Seven supervised machine learning classification algorithms such as Logistic Regression, K-Nearest Neighbors Classifier, Decision Tree Classifier, Random Forest Classifier, Gradient Boosting Classifier, XGBoost Classifier, and Support Vector Machine, were applied to predict virological failure in patients receiving antiretroviral therapy (ART), using electronic demographic, clinical, and treatment data.

To improve model prediction accuracy and generalizability, the above models were chosen to develop and verify the best predictive model using the important predictors. The classifiers have been trained on a set of training samples by splitting stratified 10-fold cross-validation with default hyperparameter tuning. Various experiments were done to find the best accuracy using both unbalanced and balanced datasets. The result showed that unbalanced data achieved low-performance metrics. Thus, this study compared imbalanced data handling techniques. A SMOTE balancing sampling technique was selected based on the performance of accuracy and AUC. The random forest classifier outperformed all other chosen classifiers with a performance evaluation of accuracy (99.31%), sensitivity (100%), precision (98.65%), f-1-score (99.32%), and AUC (0.999). As the result, this classifier was selected in our study for the prediction of virological failure. Despite the availability of limited research on virological failure with a random forest classifier, we have attempted to compare some related research on predicting HIV/AIDS status and virological response. The results showed that the random forest classifier had a better prediction performance on virological failure. This result is supported by the previous research study conducted in Switzerland in 2020 [57]. However, there was a variation in the performance evaluation metrics between our findings and the findings of Switzerland. A possible explanation might be due to the size of the dataset. A study conducted in Switzerland used a small dataset, while our study used relatively large datasets. The small dataset might affect the performance of the predictive model [76]. Another possible difference might be due to the number of features used. This study used twenty predictors, while the study conducted in Switzerland used seven predictors. The predictors could have an impact on how well the predictive model performs [55]. The study conducted in North Carolina, USA, suggested that using a random forest classifier can predict a patient’s probability of delayed linkage to care [77]. A study using a data mining application revealed that random forest outperformed predicting CD4 change [78].

This study determined the significant influencing factors for virological failure using the relevance values of independent features for the random forest classifier. Using the random forest feature selection method, the most important predictors that contribute to better performance in virological failure prediction were identified. Among all independent features, sex, age, CPT use, ART regimen used, duration of ART, TPT discontinued, educational status, and CD4 count were the significant predictors of virological failure.

Sex was a highly significant factor in predicting virological failure. Accordingly, male HIV patients were more likely to develop virological failure as compared to females. This study is in line with the research studies conducted in Kumasi, Ghana [26], Morocco [79], Dar Es Salaam, Tanzania [36], and the Tigray region, Ethiopia [80] that stated that virological failure was more common in male HIV patients than female HIV patients. The possible reasons might be excessive alcohol consumption, low clinic attendance for ART, and low health-seeking behavior in male patients [31, 36, 80].

The machine learning classifier identified that age was a highly important feature for predicting virological failure. Accordingly, younger HIV patients are more likely to develop virological failure than their counterparts. This finding is similar to the studies conducted in Mozambique [81], in localities of Ethiopia, Gondar [82], Kombolcha town [33], and Dessie [34], which reported that virological failure occurs more frequently in younger HIV patients than in older HIV patients. The possible reason could be the emotional instability of young HIV patients, which contributes to depression and a worry about telling their families, and friends, which has a negative impact on treatment outcomes [33, 34]. This study showed that educational level was significantly associated with virological failure. The patients who were educated to a secondary level were more likely to develop virological failure. These findings were supported by research conducted in Woldia, Ethiopia [83] and North West Ethiopia [84] and the study conducted on virologic outcomes in early antiretroviral treatment [85] reported that HIV patients who were educated to a secondary level had a significant association with virological failure. The possible reasons might be the level of counseling provided to educated patients who are assumed to know better, as well as the detrimental effects of social stigma and discrimination on educated individuals [83].

Duration on ART was an important feature for predicting virological failure. Patients who had a longer duration of stay on ART had a higher risk of developing virological failure. similarly, previous studies conducted in Waghimra, Northern Ethiopia, and Mettu, South West Ethiopia [22, 86] revealed that patients who had been on ART for a longer time were more susceptible to developing virological failure. The possible explanation might be due to the prolonged use of ART, which can increase the risk of developing poor adherence to treatment and unfavorable medication reactions [26].

Based on the finding of this study, the CD4 count of the respondent was found another relevant feature for predicting virological failure. As a result, patients who had low CD4 counts were more likely to develop virological failure. This finding is supported by research conducted in the Waghimra zone, Northern Ethiopia [86], Kombolcha town [33], Dessie, Northeast Ethiopia [87], and central Oromia [88], which reported that patients with low CD4 counts were more likely to experience virological failure. The effect of low CD4 counts makes patients more susceptible to opportunistic infections, which could increase the chance of virological failure [49].

Other relevant features of virological failure were co-trimoxazole preventive therapy and Tb preventive therapy. This study showed that patients who had not taken the Cotrimoxazole preventive therapy had a high chance of developing virological failure. The finding of this study is in line with the research conducted in Kenya [89] and the Amhara region of Ethiopia [90], which indicated that patients who have not received the Cotrimoxazole preventative medication are most susceptible to developing virological failure. HIV prevention drugs, such as Cotrimoxazole preventive therapy significantly reduce morbidity and mortality rates among patients with HIV [21]. However, the study conducted in China reported that using Cotrimoxazole preventive therapy was more likely to increase virological failure [91]. The possible explanation might be due to the cut-off point of the viral load. A study conducted in China showed a viral load cut-off point of greater than 400 copies/ml two or more consecutive times after 6 months of ART, while our study used a viral load cut-off point of greater than 1000 copies/ml two or more consecutive times after three to six months of ART. This research reveals that patients who have not taken TB preventive therapy are more likely to develop virological failure. The possible explanation might be that TPT plays a role in decreasing mycobacterium load and reducing the progression of latent bacilli to active TB. The increased mycobacterial load was linked to progressive impairment of mycobacterium-specific T cell response as well as an increase in the occurrence of active TB [92]. TB is an opportunistic infection for HIV patients, those patients are more likely to develop virological failure [93].

The ART regimen was highly important for predicting virological failure. This research study showed that patients who were taking the first-line ART regimen of TDF-3TC-EFV were more likely to develop virological failure. Findings from a research study which was conducted in Nepal [94] and Namibia [95] supported that patients who received the first-line ART regimen of TDF-3TC-EFV were more prone to virological failure. This idea is also supported by the WHO, which recommends starting ART treatment with DTG rather than EFV due to difficulties with the high prevalence of depression, dizziness, and treatment-related adverse effects [21].

In conclusion, our study demonstrates the feasibility of using machine learning methods to identify HIV patients at high risk of virologic failure and to determine predictive factors associated with virologic failure. Machine learning algorithms seem to work effectively for risk prediction and classification of ART treatment failure, but further refinements are needed. Nevertheless, our study model may contribute to the important public health issue of reaching and treating HIV-infected patients.

Machine learning can be a great help to clinicians who care for HIV patients. The proposed algorithm can predict viral failure in HIV patients with optimal ROC, accuracy, precision, sensitivity, and specificity. This prediction helps to optimize the use of hospital resources to treat high-risk patients, provide better quality care, and reduce medical errors caused by fatigue and long work hours in ART clinics. Designing effective predictive models may improve the quality of care and improve patient survival. Therefore, our study of predictive models of viral failure can make a significant contribution to identifying HIV patients at high risk of viral failure and adopting the most effective supportive and therapeutic regimens. This may reduce ambiguity by providing quantitative, objective, and evidence-based models for risk stratification, prediction, and ultimately care planning. In addition, the findings of this study may provide clinicians with better strategies to reduce complications and improve the chances of survival of HIV patients.

Strengths and Limitations of the study

One of the strengths of this work is that it incorporates seven supervised machine-learning classification algorithms. Despite its strength, this study is not without limitations. This study was performed in a single-site ART treatment registry database, which may limit the generalizability of the developed model. However, the dataset used in this study is a database collected at Gondar University Hospital, which provides specialized ART treatment services for HIV patients. Data were collected retrospectively, some data were incomplete or missing, and behavioral data such as smoking, alcohol consumption, exercise, and dietary habits were not analyzed in the study. For handling missing data, this study used simple imputation techniques. Moreover, this study used data that was unbalanced, meaning that the majority class and the minority class of dependent features were unequally distributed.

Conclusion

This study suggests that the random forest classifier outperformed based on the performance evaluation metrics. Male, younger age, longer time on ART, not taking CPT, not taking TPT, secondary educational level, TDF-3TC-EFV, and low CD4 counts were significant features for predicting virological failure.

Availability of data and materials

The datasets used during the current study are available from the corresponding author upon reasonable request.

Abbreviations

3TC:

Lamivudine

AIDS:

Acquired Immunodeficiency Syndrome

ART:

Antiretroviral Therapy

AUC:

Area Under Curve

AZT/ZDV:

Zidovudine

BMI:

Body Mass Index

CD4:

Cluster Differentiation 4

CPT:

Cotrimoxazole Prophylaxis Therapy

CSV:

Comma Separated Values

DTG:

Dolutegravir

DRV/r:

Ritonavir boosted Darunavir

EFV:

Efavirenz

HIV:

Human Immunodeficiency Virus

MRN:

Medical Record Number

RNA:

Ribonucleic Acid

SNNP:

South Nation Nationality People

UNAIDS:

United Nations Programme on Acquired Immune Deficiency Syndrome

WHO:

World Health Organization

XGBoost:

eXtreme Gradient Boosting

References

  1. UNAIDS, Global HIV. & AIDS statistics fact sheet. Geneva, Switzerland: Joint United Nations Programme on HIV/AIDS; 2021. Accessed on 28 July 2022. Available from: https://www.unaids.org/en/resources/fact-sheet.

  2. Lecher SL, Fonjungo P, Ellenberger D, Toure CA, Alemnji G, Bowen N, et al. HIV viral load monitoring among patients receiving antiretroviral therapy—eight Sub-Saharan Africa Countries, 2013–2018. Morb Mortal Wkly Rep. 2021;70(21):775.

    Article  CAS  Google Scholar 

  3. Joint United Nations Programme on HIV/AIDS (UNAIDS). Fast Track Strategy to end the AIDS epidemic by 2030. Geneva: UNAIDS; 2014. Accessed on 22 Mar 2022. Available from: https://www.unaids.org/sites/default/files/media_asset/JC2686_WAD2014report_en.pdf.

  4. World Health Organization. Geneva HIV/AIDS. Switzerland: WHO; 2021. Accessed on 2 Apr 2022. Available from: https://www.who.int/news-room/fact-sheets/detail/hiv-aids.

  5. Joint United Nations Programme on HIV/AIDS. HIV & AIDS statistics fact sheet. Geneva: UNAIDS; 2020. Accessed on 14 Apr 2022. Available from: https://www.unaids.org/en/regionscountries/countries/ethiopia.

  6. Central Statistical Agency - CSA/. Ethiopia ICF, Ethiopia Demographic and Health Survey 2016. Addis Ababa: CSA and ICF; 2017. Accessed on 23 Mar 2022. Available from: https://dhsprogram.com/pubs/pdf/FR328/FR328.pdf.

  7. Ethiopian Public Health Institute. HIV Estimates and projection for the year 2020 and 2021. Addis Abeba: EPHI. ; 2021. Accessed on 27 Apr 2022. Available from: https://ephi.gov.et/wp-content/uploads/2021/06/HIV-Estimates-and-projection-for-the-year-2020-and-2021.pdf.

  8. EFMOH. National consolidated guidelines for comprehensive HIV prevention, care and treatment. Addis Abeba2018. Accessed on 13 April 2022. Available from: https://www.afro.who.int/publications/national-consolidated-guidelines-comprehensive-hiv-prevention-care-and-treatment.

  9. Chhim K, Mburu G, Tuot S, Sopha R, Khol V, Chhoun P, et al. Factors associated with viral non-suppression among adolescents living with HIV in Cambodia: a cross-sectional study. AIDS Res therapy. 2018;15(1):1–10.

    Article  Google Scholar 

  10. World Health Organization. Consolidated guidelines on the use of antiretroviral drugs for treating and preventing HIV infection: recommendations for a public health approach. Geneva: World Health Organization; 2016.

    Google Scholar 

  11. World Health Organization. Guideline on when to start antiretroviral therapy and on pre-exposure prophylaxis for HIV. Geneva: World Health Organization; 2015. Accessed 14 Apr 2022. Available from: https://www.who.int/publications/i/item/9789241509565.

  12. Vidal JE, Song ATW, Matos ML, et al. High rate of virologic suppression with darunavir/ritonavir plus optimized background therapy among highly antiretroviral-experienced HIV-infected patients: results of a prospective cohort study in São Paulo, Brazil. Braz J Infect Dis. 2013;17(1):41–7.

    Article  PubMed  PubMed Central  Google Scholar 

  13. Liégeois F, Vella C, Eymard-Duvernay S, et al. Virological failure rates and HIV-1 drug resistance patterns in patients on first-line antiretroviral treatment in semirural and rural Gabon. J Int AIDS Soc. 2012;15(2):17985.

    Article  PubMed  PubMed Central  Google Scholar 

  14. Meriki HD, Tufon KA, Afegenwi MH, et al. Immuno-haematologic and virologic responses and predictors of virologic failure in HIV-1 infected adults on first-line antiretroviral therapy in Cameroon. Infect Dis Poverty. 2014;3:5.

    Article  PubMed  PubMed Central  Google Scholar 

  15. Kitchen CM, Kitchen SG, Dubin JA, Gottlieb MS. Initial virological and immunologic response to highly active antiretroviral therapy predicts long-term clinical outcome. Clin Infect Dis. 2001;33(4):466–72.

    Article  CAS  PubMed  Google Scholar 

  16. McMahon JH, Elliott JH, Bertagnolio S, Kubiak R, Jordan MR. Viral suppression after 12 months of antiretroviral therapy in low-and middle-income countries: a systematic review. Bull World Health Organ. 2013;91(5):377–85.

    Article  PubMed  PubMed Central  Google Scholar 

  17. Barth RE, van der Loeff MFS, Schuurman R, Hoepelman AI, Wensing AM. Virological follow-up of adult patients in antiretroviral treatment programmes in sub-saharan Africa: a systematic review. Lancet Infect Dis. 2010;10(3):155–66.

    Article  CAS  PubMed  Google Scholar 

  18. Abdissa A, Yilma D, Fonager J, Audelin AM, Christensen LH, Olsen MF, et al. Drug resistance in HIV patients with virological failure or slow virological response to antiretroviral therapy in Ethiopia. BMC Infect Dis. 2014;14(1):181.

    Article  PubMed  PubMed Central  Google Scholar 

  19. Bernabé KJ, Siedner M, Tsai AC, Marconi VC, Murphy RA. Detection of HIV Virologic failure and switch to second-line therapy: a systematic review and Meta-analysis of Data from Sub-Saharan Africa. Open Forum Infect Dis. 2022;9(5):ofac121.

    Article  PubMed  PubMed Central  Google Scholar 

  20. Petersen ML, Tran L, Geng EH, Reynolds SJ, Kambugu A, Wood R, et al. Delayed switch of antiretroviral therapy after virologic failure associated with elevated mortality among HIV-infected adults in Africa. AIDS. 2014;28(14):2097.

    Article  CAS  PubMed  Google Scholar 

  21. World Health Organization. Consolidated guidelines on HIV prevention, testing, treatment, service delivery and monitoring: recommendations for a public health approach. Geneva: World Health Organization; 2021. Accessed on 28 July 2022.

    Google Scholar 

  22. Zenu S, Tesema T, Reshad M, Abebe E. Determinants of first-line antiretroviral treatment failure among adult patients on treatment in Mettu Karl Specialized Hospital, South West Ethiopia; a case control study. PLoS ONE. 2021;16(10):e0258930.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  23. Agegnehu CD, Merid MW, Yenit MK. Incidence and predictors of virological failure among adult HIV patients on first-line antiretroviral therapy in Amhara regional referral hospitals; Ethiopia: a retrospective follow-up study. BMC Infect Dis. 2020;20(1):460.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  24. Murphy RA, Court R, Maartens G, Sunpath H. Second-line antiretroviral therapy in sub-saharan Africa: it is time to mind the gaps. AIDS Res Hum Retroviruses. 2017;33(12):1181–4.

    Article  PubMed  PubMed Central  Google Scholar 

  25. Seyler L, Lacor P, Allard SD. Current challenges in the treatment of HIV. Pol Arch Intern Med. 2018;128(10):609–16.

    Article  PubMed  Google Scholar 

  26. Ansah D, Kumah E, Bawontuo V, Agyei-Baffour P, Afriyie EK. Determinants of viral load non-suppression among people living with HIV on antiretroviral therapy in Kumasi, Ghana. Ghana Med J. 2021;55(2):111–7.

    Article  Google Scholar 

  27. Bulage L, Ssewanyana I, Nankabirwa V, Nsubuga F, Kihembo C, Pande G, et al. Factors associated with virological non-suppression among HIV-positive patients on antiretroviral therapy in Uganda, August 2014–July 2015. BMC Infect Dis. 2017;17(1):1–11.

    Article  Google Scholar 

  28. Gupta-Wright A, Fielding K, van Oosterhout JJ, Alufandika M, Grint DJ, Chimbayo E, et al. Virological failure, HIV-1 drug resistance, and early mortality in adults admitted to hospital in Malawi: an observational cohort study. The Lancet HIV. 2020;7(9):e620–8.

    Article  PubMed  PubMed Central  Google Scholar 

  29. Gumede SB, Venter WDF, Lalla-Edward ST. Understanding adherence in virally suppressed and unsuppressed human immunodeficiency virus-positive urban patients on second-line antiretroviral treatment. South Afr J HIV Med. 2020;21(1):1–0.

    Article  Google Scholar 

  30. Fentie Wendie T, Workneh BD. Prevalence and predictors of virological failure among adults living with HIV in South Wollo Zone, Northeast Ethiopia: a retrospective cohort study. HIV AIDS (Auckl). 2020;12:393–402.

    PubMed  Google Scholar 

  31. Hailu GG, Hagos DG, Hagos AK, Wasihun AG, Dejene TA. Virological and immunological failure of HAART and associated risk factors among adults and adolescents in the Tigray region of Northern Ethiopia. PLoS ONE. 2018;13(5):e0196259.

    Article  PubMed  PubMed Central  Google Scholar 

  32. Negash H, Welay M, Legese H, Adhanom G, Mardu F, Tesfay K, et al. Increased virological failure and determinants among HIV patients on highly active retroviral therapy in Adigrat General Hospital, Northern Ethiopia, 2019: hospital-based cross-sectional study. Infect drug Resist. 2020;13:1863–72.

    Article  PubMed  PubMed Central  Google Scholar 

  33. Meshesha HM, Nigussie ZM, Asrat A, Mulatu K. Determinants of virological failure among adults on first-line highly active antiretroviral therapy at public health facilities in Kombolcha town, Northeast, Ethiopia: a case–control study. BMJ Open. 2020;10(7):e036223.

    Article  PubMed  PubMed Central  Google Scholar 

  34. Fentaw Z, Molla A, Wedajo S, Mebratu W. Determinants of virological failure among adult clients on first-line antiretroviral therapy in Amhara Regional State, Northeast Ethiopia. A case -control study. HIV AIDS (Auckl). 2020;12:749–56.

    PubMed  Google Scholar 

  35. Mziray SR, Kumburu HH, Assey HB, Sonda TB, Mahande MJ, Msuya SE, et al. Patterns of acquired HIV-1 drug resistance mutations and predictors of virological failure in Moshi, Northern Tanzania. PLoS ONE. 2020;15(9):e0232649.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  36. Samizi FG, Panga OD, Mulugu SS, Gitige CG, Mmbaga EJ. Rate and predictors of HIV virological failure among adults on first-line antiretroviral treatment in Dar Es Salaam, Tanzania. J Infect Developing Ctries. 2021;15(06):853–60.

    Article  CAS  Google Scholar 

  37. Brnabic A, Hess LM. Systematic literature review of machine learning methods used in the analysis of real-world data for patient-provider decision making. BMC Med Inform Decision Making. 2021;21(1):1–19.

    Article  Google Scholar 

  38. Bzdok D, Altman N, Krzywinski M. Statistics versus machine learning. Nat Methods. 2018;15(4):233–4.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  39. Supriya M, Deepa AJ. Machine learning approach on healthcare big data: A review. Big Data Inform Anal. 2020;5(1):58–75.

    Article  Google Scholar 

  40. Ekpenyong ME, Etebong PI, Jackson TC. Fuzzy-multidimensional deep learning for efficient prediction of patient response to antiretroviral therapy. Heliyon. 2019;5(7):e02080.

    Article  PubMed  PubMed Central  Google Scholar 

  41. Bisaso KR, Anguzu GT, Karungi SA, Kiragga A, Castelnuovo B. A survey of machine learning applications in HIV clinical research and care. Comput Biol Med. 2017;91:366–71.

    Article  PubMed  Google Scholar 

  42. Orel E, Esra R, Estill J, Thiabaud A, Marchand-Maillet S, Merzouki A, et al. Prediction of HIV status based on socio-behavioural characteristics in East and Southern Africa. PLoS ONE. 2022;17(3):e0264429.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  43. Uddin S, Khan A, Hossain ME, Moni MA. Comparing different supervised machine learning algorithms for disease prediction. BMC Med Inf Decis Mak. 2019;19(1):1–16.

    Google Scholar 

  44. Maskew M, Sharpey-Schafer K, De Voux L, Crompton T, Bor J, Rennick M, et al. Applying machine learning and predictive modeling to retention and viral suppression in South African HIV treatment cohorts. Sci Rep. 2022;12(1):12715.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  45. Young SD, Yu W, Wang W. Toward automating HIV identification: machine learning for rapid identification of HIV-related social media data. J Acquir Immune Defic Syndr. 2017;74(Suppl 2):128.

    Article  Google Scholar 

  46. Nega J, Taye S, Million Y, Rodrigo C, Eshetie S. Antiretroviral treatment failure and associated factors among HIV patients on first-line antiretroviral treatment in Sekota, northeast Ethiopia. AIDS Res Ther. 2020;17(1):1–9.

    Article  Google Scholar 

  47. Mulisa D, Tesfa M, Mullu Kassa G, Tolossa T. Determinants of first line antiretroviral therapy treatment failure among adult patients on ART at central Ethiopia: un-matched case control study. BMC Infect Dis. 2019;19(1):1–13.

    Article  Google Scholar 

  48. Assemie MA, Alene M, Ketema DB, Mulatu S. Treatment failure and associated factors among first line patients on highly active antiretroviral therapy in Ethiopia: A systematic review and meta-analysis. Glob Health Res Policy. 2019;4(1):1–10.

    Article  Google Scholar 

  49. Ethiopian Federal Ministry of health. National Comprehensive HIV Prevention, Care and Treatment Training for Healthcare Providers. Addis Abeba; 2021. Accessed on 20 June 2022. Available from: https://repository.iifphc.org/handle/123456789/995.

  50. Pedregosa FaV G, Gramfort A, Michel V, Thirion BaG O, Blondel M, Prettenhofer P, Weiss RaD V, Vanderplas J, Passos A, Cournapeau DaB M, Perrot M, Duchesnay E. Scikit-learn Machine Learning in Python. 2011;12:2825–30.

    Google Scholar 

  51. Al-Shehari T, Alsowail RA. An insider data leakage detection using one-hot encoding, synthetic minority oversampling and machine learning techniques. Entropy. 2021;23(10):1258.

    Article  PubMed  PubMed Central  Google Scholar 

  52. Muller AC, Guido S. Introduction to machine learning with Python: a guide for data scientists. USA: O’Reilly Media, Inc.; 2016.

    Google Scholar 

  53. McKinney W. Python for data analysis: Data wrangling with Pandas, NumPy, and IPython. USA: O’Reilly Media, Inc.; 2012.

    Google Scholar 

  54. Gnana DAA, Balamurugan SAA, Leavline EJ. Literature review on feature selection methods for high-dimensional data. Int J Comput Appl. 2016;136(1):9–17.

    Google Scholar 

  55. Cai J, Luo J, Wang S, Yang S. Feature selection in machine learning: a new perspective. Neurocomputing. 2018;300:70–9.

    Article  Google Scholar 

  56. Saxena A, Ganguly A, Shrivastava AK. Predicting Chronic Kidney Disease Risk Using Recursive Feature Elimination and Machine Learning. 2020;4(5):1–7.

    Google Scholar 

  57. Kamal S, Urata J, Cavassini M, Liu H, Kouyos R, Bugnon O, et al. Random forest machine learning algorithm predicts virologic outcomes among HIV infected adults in Lausanne, Switzerland using electronically monitored combined antiretroviral treatment adherence. AIDS care. 2021;33(4):530-6.

    Article  PubMed  Google Scholar 

  58. Olayemi Olufunke C, Olasehinde Olayemi O, Agbelusi O. Predictive model of pediatric hiv/aids survival in nigeria using support vector machine. Communications. 2016;5:29–36.

    Google Scholar 

  59. Mohammed M, Khan MB, Bashier EBM. Machine Learning: Algorithms and Applications (1st ed). CRC Press. 2016. https://doi.org/10.1201/9781315371658.

  60. Alghamdi M, Al-Mallah M, Keteyian S, Brawner C, Ehrman J, Sakr S. Predicting diabetes mellitus using SMOTE and ensemble machine learning approach: the Henry Ford ExercIse Testing (FIT) project. PLoS ONE. 2017;12(7):e0179805.

    Article  PubMed  PubMed Central  Google Scholar 

  61. Chawla NV, Bowyer KW, Hall LO, Kegelmeyer WP. SMOTE: synthetic minority over-sampling technique. J Artif Intell Res. 2002;16:321–57.

    Article  Google Scholar 

  62. Desalegn M, Seyoum D, Tola EK, Tsegaye Gayesa R. Determinants of first-line antiretroviral treatment failure among adult HIV patients at Nekemte Specialized Hospital, Western Ethiopia: Unmatched case-control study. SAGE Open Med. 2021;9:20503121211030184.

    Article  PubMed  PubMed Central  Google Scholar 

  63. Wang S, Dai Y, Shen J, Xuan J. Research on expansion and classification of imbalanced data based on SMOTE algorithm. Sci Rep. 2021;11(1):1–11.

    Article  Google Scholar 

  64. Haibo He, Yang Bai, Garcia EA, Shutao Li. "ADASYN: Adaptive synthetic sampling approach for imbalanced learning" 2008 IEEE International Joint Conference on Neural Networks (IEEE World Congress on Computational Intelligence). Hong Kong. 2008;1322-1328. https://doi.org/10.1109/IJCNN.2008.4633969.

  65. Kaur H, Pannu HS, Malhi AK. A systematic review on imbalanced data challenges in machine learning: applications and solutions. ACM Comput Surv (CSUR). 2019;52(4):1–36.

    Google Scholar 

  66. Kuhn M, Johnson K. Applied predictive modeling. New York: Springer; 2013.

    Book  Google Scholar 

  67. Bisaso KR, Anguzu GT, Karungi SA, Kiragga A, Castelnuovo B. A survey of machine learning applications in HIV clinical research and care. Comput Biol Med. 2017;91:366–71.

    Article  PubMed  Google Scholar 

  68. Cai Q, Yuan R, He J, Li M, Guo Y. Predicting HIV drug resistance using weighted machine learning method at target protein sequence-level. Mol Diversity. 2021;25(3):1541–51.

    Article  CAS  Google Scholar 

  69. Olatosi B, Sun X, Chen S, Zhang J, Liang C, Weissman S, et al. Application of machine-learning techniques in classification of HIV medical care status for people living with HIV in South Carolina. Aids. 2021;35(Suppl 1):S19-S28.

    Article  PubMed  Google Scholar 

  70. Saito T, Rehmsmeier M. The precision-recall plot is more informative than the ROC plot when evaluating binary classifiers on imbalanced datasets. PLoS ONE. 2015;10(3):e0118432.

    Article  PubMed  PubMed Central  Google Scholar 

  71. Chicco D, Jurman G. The advantages of the Matthews correlation coefficient (MCC) over F1 score and accuracy in binary classification evaluation. BMC Genomics. 2020;21(1):1–13.

    Article  Google Scholar 

  72. Agrawal R, Imieliński T, Swami A. Mining association rules between sets of items in large databases. ACM SIGMOD Record. 1993;22(2):207–16.

    Article  Google Scholar 

  73. Harahap M, Husein AM, Aisyah S, Lubis FR, Wijaya BA, editors. Mining association rule based on the diseases population for recommendation of medicine need. J Physics Conf Ser; 2018. p. 012017.

  74. Altaf W, Shahbaz M, Guergachi A. Applications of association rule mining in health informatics: a survey. Artif Intell Rev. 2017;47(3):313–40.

    Article  Google Scholar 

  75. Khare S, Gupta D. “Association rule analysis in cardiovascular disease” 2016 Second International Conference on Cognitive Computing and Information Processing (CCIP). Mysuru, India. 2016;1–6. https://doi.org/10.1109/CCIP.2016.7802881.

  76. Lateh Masitah Abdul, Muda Azah Kamilah, Yusof Zeratul Izzah M, Muda Noor Azilah, Azmi Mohd Sanusi, editors. Handling a Small Dataset Problem in Prediction Model by employ Artificial Data Generation Approach. Rev J Phys Conf Ser. IOP Publishing; 2017.

  77. Chen S, Owolabi Y, Dulin M, Robinson P, Witt B, Samoff E. Applying a machine learning modelling framework to predict delayed linkage to care in patients newly diagnosed with HIV in Mecklenburg County, North Carolina, USA. AIDS. 2021;35(Suppl 1):29–S38.

    Article  Google Scholar 

  78. Kebede M, Zegeye DT, Zeleke BM. Predicting CD4 count changes among patients on antiretroviral treatment: application of data mining techniques. Comput Methods Programs Biomed. 2017;152:149–57.

    Article  PubMed  Google Scholar 

  79. Hicham T, Ilyas E, Tarik H, Noureddine B, Omar B, Rachid F, et al. Risk factors associated with unsuppressed viral load in HIV-1 infected patients at the first antiretroviral therapy in Morocco. Int J Mycobacteriology. 2019;8(2):113–7.

    Google Scholar 

  80. Desta AA, Woldearegay TW, Futwi N, Gebrehiwot GT, Gebru GG, Berhe AA, et al. HIV virological non-suppression and factors associated with non-suppression among adolescents and adults on antiretroviral therapy in northern Ethiopia: a retrospective study. BMC Infect Dis. 2020;20(1):1–10.

    Article  Google Scholar 

  81. Rupérez M, Pou C, Maculuve S, Cedeno S, Luis L, Rodríguez J, et al. Determinants of virological failure and antiretroviral drug resistance in Mozambique. J Antimicrob Chemother. 2015;70(9):2639–47.

    Article  PubMed  Google Scholar 

  82. Bayu B, Tariku A, Bulti AB, Habitu YA, Derso T, Teshome DF. Determinants of virological failure among patients on highly active antiretroviral therapy in University of Gondar Referral Hospital, Northwest Ethiopia: a case–control study. Hiv/aids (Auckland NZ). 2017;9:153–9.

    Google Scholar 

  83. Babo YD, Alemie GA, Fentaye FW. Predictors of first-line antiretroviral therapy failure amongst HIV-infected adult clients at Woldia Hospital, Northeast Ethiopia. PLoS ONE. 2017;12(11):e0187694.

    Article  PubMed  PubMed Central  Google Scholar 

  84. Agezew T, Tadesse A, Derseh L, Yimer M. Incidence and predictors of first line anti-retroviral therapy failure among adults receiving HIV care in North West Ethiopia: a hospital-based follow-up study. J Infect Dis Epidemiol. 2019;5(2):345.

    Google Scholar 

  85. Eshleman SH, Wilson EA, Zhang XC, Ou S-S, Piwowar-Manning E, Eron JJ, et al. Virologic outcomes in early antiretroviral treatment: HPTN 052. HIV Clin Trial. 2017;18(3):100–9.

    Article  CAS  Google Scholar 

  86. Emagnu A, Abay Z, Bulti AB, Animut Y. Determinants of virologic failure among adult HIV patients on first-line antiretroviral therapy at waghimra zone, northern Ethiopia: a case-control study. Advances in Public Health. 2020;2020:1–8.

    Article  Google Scholar 

  87. Ahmed M, Merga H, Jarso H. Predictors of virological treatment failure among adult HIV patients on first-line antiretroviral therapy in Woldia and Dessie hospitals, Northeast Ethiopia: a case-control study. BMC Infect Dis. 2019;19(1):1–7.

    Article  Google Scholar 

  88. Abdullahi IJ, Deybasso HA, Adlo AM. Determinants of virological failure among patients on first-line antiretroviral therapy in central Oromia, Ethiopia: a case–control study. Hiv/aids (Auckland NZ). 2020;12:931–9.

    Google Scholar 

  89. Polyak CS, Yuhas K, Singa B, Khaemba M, Walson J, Richardson BA, et al. Cotrimoxazole prophylaxis discontinuation among antiretroviral-treated HIV-1-infected adults in Kenya: a randomized non-inferiority trial. PLoS Med. 2016;13(1):e1001934.

    Article  PubMed  PubMed Central  Google Scholar 

  90. Agegnehu CD, Merid MW, Yenit MK. Incidence and predictors of virological failure among adult HIV patients on first-line antiretroviral therapy in Amhara regional referral hospitals; Ethiopia: a retrospective follow-up study. BMC Infect Dis. 2020;20(1):1–14.

    Article  Google Scholar 

  91. Qin S, Lai J, Zhang H, Wei D, Lv Q, Pan X, et al. Predictive factors of viral load high-risk events for virological failure in HIV/AIDS patients receiving long-term antiviral therapy. BMC Infect Dis. 2021;21(1):1–11.

    Article  Google Scholar 

  92. Ayele HT, Mourik MSv, Debray TP, Bonten MJ. Isoniazid prophylactic therapy for the prevention of tuberculosis in HIV infected adults: a systematic review and meta-analysis of randomized trials. PLoS ONE. 2015;10(11):e0142290.

    Article  PubMed  PubMed Central  Google Scholar 

  93. Getaneh T, Negesse A, Dessie G, Desta M. The impact of tuberculosis co-infection on virological failure among adults living with HIV in Ethiopia: A systematic review and meta-analysis. J Clin Tuberc Other Mycobact Dis. 2022;27:100310.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  94. Ojha CR, Shakya G, Dumre SP. Virological and immunological status of the people living with HIV/AIDS undergoing ART treatment in Nepal. Biomed Res Int. 2016;2016:6817325.

    Article  PubMed  PubMed Central  Google Scholar 

  95. Mouton DC, Gonzo M, Mukesi M. Viral load suppression as a surrogate marker of treatment success in patients on antiretroviral therapy in Namibia. Edorium J Biomed Sci. 2017;2:1–8.

    Google Scholar 

Download references

Acknowledgements

We would like to thank the University of Gondar, the College of Medicine and Health Science, and the Institution of Public Health for allowing us to undertake this study and the University of Gondar Comprehensive and Specialized Hospital for providing us with data collection to undertake this research study. Our gratitude goes to those who work in the ART service at the University of Gondar Hospital, especially the data clerks who work in the ART clinic for their willingness to support the extraction of data from the database.

Funding

No funding was obtained for this study

Author information

Authors and Affiliations

Authors

Contributions

MF and TB contributed to data collection. DN performed the data management, data extraction, and data analysis. TM and YS participated in the data interpretation. DN and MS drafted the original manuscript.TM, YS, MF, MS TB, and AD revised all subsequent versions of the manuscript. All authors read and approved the final manuscript. The authors ( Agmasie Damtew Walle) had substantial contributions during the revision of the manuscript, especially in responding to reviewers’ comments. Hence, we need to add him to this manuscript for their contribution.

Corresponding author

Correspondence to Daniel Niguse Mamo.

Ethics declarations

Ethics approval and consent to participate

The University of Gondar’s Institutional Ethical Review Committee Board (Reference No: IPH/2107/2014) gave the study approval, and the board granted its Ethical clearance. Due to the retrospective nature of the study, informed consent was not used. However, the confidentiality of the data was maintained by keeping the extracted information private and by ensuring that it could not be used for any other purpose than study-related purposes. The Institutional Ethical Review Board of University of Gondar in Ethiopia approved participant’s informed consent waiver. In addition to this, all methods were carried out following relevant guidelines and regulations.

Consent for publication

Not applicable

Competing interests

The authors declare that they have no competing interests.

Additional information

Publisher’s Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated in a credit line to the data.

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Mamo, D.N., Yilma, T.M., Fekadie, M. et al. Machine learning to predict virological failure among HIV patients on antiretroviral therapy in the University of Gondar Comprehensive and Specialized Hospital, in Amhara Region, Ethiopia, 2022. BMC Med Inform Decis Mak 23, 75 (2023). https://doi.org/10.1186/s12911-023-02167-7

Download citation

  • Received:

  • Accepted:

  • Published:

  • DOI: https://doi.org/10.1186/s12911-023-02167-7

Keywords