Skip to main content

Gut microbiome-based machine learning for diagnostic prediction of liver fibrosis and cirrhosis: a systematic review and meta-analysis

Abstract

Background

Invasive detection methods such as liver biopsy are currently the gold standard for diagnosing liver cirrhosis and can be used to determine the degree of liver fibrosis and cirrhosis. In contrast, non-invasive diagnostic methods, such as ultrasonography, elastography, and clinical prediction scores, can prevent patients from invasiveness-related discomfort and risks and are often chosen as alternative or supplementary diagnostic methods for liver fibrosis or cirrhosis. However, these non-invasive methods cannot specify the pathological grading and early diagnosis of the lesions. Recent studies have revealed that gut microbiome-based machine learning can be utilized as a non-invasive diagnostic technique for liver cirrhosis or fibrosis, but there is no evidence-based support. Therefore, this study conducted a systematic review and meta-analysis for the first time to investigate the accuracy of machine learning based on the gut microbiota in the prediction of liver fibrosis and cirrhosis.

Methods

A comprehensive and systematic search of publications published before April 2th, 2023 in PubMed, Cochrane Library, Embase, and Web of Science was conducted for relevant studies on the application of gut microbiome-based metagenomic sequencing modeling technology to the diagnostic prediction of liver cirrhosis or fibrosis. A bivariate mixed-effects model and Stata software 15.0 were adopted for the meta-analysis.

Results

Ten studies were included in the present study, involving 11 prediction trials and 838 participants, 403 of whom were fibrotic and cirrhotic patients. Meta-analysis showed the pooled sensitivity (SEN) = 0.81 [0.75, 0.85], specificity (SEP) = 0.85 [0.77, 0.91], positive likelihood ratio (PLR) = 5.5 [3.6, 8.7], negative likelihood ratio (NLR) = 0.23 [0.18, 0.29], diagnostic odds ratio (DOR) = 24 [14, 41], and area under curve (AUC) = 0.86 [0.83–0.89]. The results demonstrated that machine learning methods had excellent potential to analyze gut microbiome data and could effectively predict liver cirrhosis or fibrosis. Machine learning provides a powerful tool for non-invasive prediction and diagnosis of liver cirrhosis or liver fibrosis, with broad clinical application prospects. However, these results need to be interpreted with caution due to limited clinical data.

Conclusion

Gut microbiome-based machine learning can be utilized as a practical, non-invasive technique for the diagnostic prediction of liver cirrhosis or fibrosis. However, most of the included studies applied the random forest algorithm in modeling, so a diversified prediction system based on microorganisms is needed to improve the non-invasive detection of liver cirrhosis or fibrosis.

Peer Review reports

Introduction

Liver fibrosis is a pathologic change characterized by an excessive accumulation of liver fibrous tissues resulting from multiple pathogenic factors, including alcohol, medicine, and the hepatitis virus. The unceasing progression of liver fibrosis will cause liver cirrhosis [1], which is a dynamic progress [2]. Unlike liver fibrosis, liver cirrhosis is non-reversible and often causes a poor outcome in cirrhotic patients [3]. Most patients with liver cirrhosis are clinically asymptomatic before the decompensatory stage, so the disease goes undetected and cannot be promptly diagnosed [4]. Therefore, early identification of liver fibrosis is significant. Presently, the gold standard for diagnosing fibrosis is still liver biopsy. However, this invasive detection method may cause such complications as bleeding, biliary peritonitis, and pneumothorax during operation; its histologic assessment may be affected by high sampling-rate errors [5, 6]. Additionally, the liver biopsy must be performed in strict accordance with relevant operation specifications, and its implementation often depends on hospitals’ medical conditions and technological levels. Consequently, this approach cannot be used for large-scale disease screening or assessment in routine care. Although various imaging methods can, to a certain extent, show the grade of liver fibrosis and cirrhosis with relatively reliable accuracy [7], they cannot precisely identify early liver fibrosis [8, 9]. Furthermore, they have some shortages in diagnosis. For instance, magnetic resonance elastography (MRE) requires expensive equipment and specialized knowledge; it sets varying cut-off values of diagnostic data for liver fibrosis caused by different causes. Transient elastography (TE) is less accurate in identifying patients with such complications as ascites and narrow intercostal space [10]. As a result, more accurate non-invasive techniques are urgently needed to address the clinical demand for early detection and severity assessment of liver fibrosis and cirrhosis.

The gut microbiota, a new testing target, has been adopted to diagnose liver fibrosis or cirrhosis in recent years. Gut microbiota analysis, combined with other non-invasive tests, such as Fib-4 and the NAFLD fibrosis score, can detect and stage liver fibrosis or cirrhosis and identify patients at a high risk of progressing to the advanced stage. A previous study found that gut microbiome analysis may be a potential method for diagnosing liver-related diseases with high accuracy [11]. For example, patients with advanced liver fibrosis have fewer clostridium and ruminococci, while cirrhotic patients have more prevotella [12,13,14]. During the progression of liver fibrosis and cirrhosis, the content of escherichia coli varies before portal hypertension, suggesting that changes in gut microbial are helpful for diagnosing liver fibrosis [15]. In addition, the gut microbiome may play a key role in the disease progression of liver fibrosis or cirrhosis. Muegg et al. [13] found that the α diversity in the gut microbiota significantly declined with the progression of liver cirrhosis, indicating that the changes in the gut microbiota may be closely correlated with the severity of liver cirrhosis. Walker et al. [15] discovered that intrahepatic cholestasis during the progression of liver cirrhosis may be correlated with abundant Gram-negative bacteria, a kind of Salmonella.

Changes in the gut microbiota play a crucial role in developing liver fibrosis or cirrhosis [16]. Recently, the development of machine learning (ML) techniques based on gut microbiome has brought prospects for liver fibrosis and cirrhosis prediction. Common ML algorithms include random forest (RF), logistic regression (LR), decision trees (DT), and support vector machines (SVM). RF, compared to other algorithms, is more advantageous due to high accuracy, a lower risk of over-fitting, automatic selection of important features, and efficient processing of large-scale data sets, which can improve the accuracy and reliability of the diagnostic prediction and reduce the possibility of misdiagnosis [17, 18]. Traditional diagnostic methods for liver disease, such as liver puncture, are usually invasive, whereas ML-based intestinal flora analysis only requires patients’ stool samples and is noninvasive, reducing their discomfort and risks. In addition, it has the potential to identify early signs of liver cirrhosis or fibrosis, facilitate early intervention and treatment, and reduce the risk of disease progression and complications. Meanwhile, it can conduct personalized diagnoses based on each patient’s intestinal microbiome characteristics, helping to develop tailored treatment plans given individual differences [19]. Therefore, this study, for the first time, aims to assess the efficiency of the gut microbiome-based ML model in diagnosing liver cirrhosis and fibrosis, providing new insights into the accurate and non-invasive detection of liver fibrosis and cirrhosis.

Materials and methods

This meta-analysis was conducted in accordance with the Preferred Reporting Items for Systematic Reviews and Meta-Analyses Statement. The study protocol was designed in the INPLASY - International Platform of Registered Systematic Review and Meta-analysis Protocols (INPLASY202250133).

Literature search

A systematic search of publications published before April 2th, 2023 in PubMed, Cochrane, Embase, and Web of Science was conducted to collect studies on the application of gut microbiome-based metagenomic sequencing modeling technology to diagnostic prediction of liver cirrhosis or fibrosis. Search terms mainly included Gastrointestinal Microbiome [Mesh], Liver Cirrhosis [Mesh], and Machine Learning [Mesh]. The detailed search strategy is presented in Table S1.

Literature inclusion and exclusion criteria

The inclusion criteria were as follows:

  1. (1)

    Study type: case-control study, cohort study, nest case-control study, and case-cohort study;

  2. (2)

    Studies that completely constructed a ML model based on intestinal microorganisms for the diagnosis of liver fibrosis or cirrhosis;

  3. (3)

    Studies without external validation were also included in our systematic review;

  4. (4)

    Studies reported in English.

Exclusion criteria were as follows:

  1. (1)

    Study type: meta-analysis, review, guide, and expert opinion;

  2. (2)

    Although a ML model for diagnosing liver fibrosis and cirrhosis was constructed in original studies, its modeling variables did not include intestinal microorganisms;

  3. (3)

    Studies that lack the following outcome indicators for the prediction accuracy of risk models: ROC, c-statistic, c-index, sensitivity, specificity, accuracy, recovery rate, precision rate, confusion matrix, diagnostic four-grid table, F1 score, and calibration curve;

  4. (4)

    Research on single-factor analysis of diagnostic accuracy.

Literature screening and data extraction

The retrieved studies were imported into the EndNote software. Duplicate publications were marked automatically and manually and then deleted. Titles and abstracts were checked to select potentially eligible studies. Their full texts were downloaded and reviewed to select eligible studies for our systematic review. A basic information spreadsheet was developed before data extraction. The extracted data mainly involved title, first author, publication date, study design, nation, diagnostic criteria, sample size, cohort, model types, external validation, number of types of intestinal microbiota, outcome measures, and information on participants such as mean age and gender. The literature screening and data extraction were completed independently by two researchers, and their results were cross-checked. Any disputes were adjudicated by a third researcher.

Quality assessment

Two researchers (Xiaopei Liu and Dan Liu) performed the quality assessment for the ten eligible articles using the Quality Assessment of Diagnostic Accuracy Studies-2 (QUADAS-2) tool [20]. The QUADAS-2 tool consists of four domains: index test, patient selection, flow and time, and reference standard. Each domain can be graded as an unclear (yellow), high (red), or low (green) risk bias. Likewise, the applicability is also graded as unclear, high, or low risk in the first three domains [21].

Statistical methods

In a systematic review based on ML, inconsistent modeling parameters in original studies are a primary source of heterogeneity. Therefore, the present study used the command of “Midas” (StataCorp LLC, College Station, TX) in Stata 15.0 to fit the bivariate mixed-effects model for assessing the sensitivity (SEN), positive likelihood ratio (PLR), specificity (SEP), diagnostic odds ratio (DOR), and negative likelihood ratio (NLR). Furthermore, we calculated estimates with 95% confidence intervals (95% CIs), drew the summary receiver operating characteristic curves (SROC), and calculated the area under curve (AUC) with 95% CI. Deek’s funnel plot was employed to detect publication bias, and Q statistic and I2 statistic were used for the heterogeneity test. An I2 > 50% indicated significant heterogeneity. A value of p less than 0.05 was considered statistically significant. A value of p more than or equal to 0.05 indicated no statistical difference.

Results

Literature retrieval results

Initially, 402 articles were identified in database searches, and duplicates were removed using the EndNote software. After the titles and abstracts screening, there were 26 articles left. Based on a full-text review, ten studies comprising 11 prediction trials were finally included [22,23,24,25,26,27,28,29,30,31]. The literature selection process is shown in Fig. 1.

Fig. 1
figure 1

Flowchart of the literature selection

Basic characteristics and quality assessment of included literature

This meta-analysis included ten articles involving 11 trials and 838 patients. The main ML algorithm was RF. Five trials conducted external validations, and all the included trials were considered high quality. The basic characteristics of included trials are presented in Tables 1 and 2. The quality assessment results are shown in Fig. 2.

Table 1 Baseline characteristics
Table 2 Baseline characteristics
Fig. 2
figure 2

Quality assessment of the included studies. Risk bias and applicability concerns graph (a). Risk bias and applicability concerns summary (b). The red, yellow, and green colors indicate high, unclear, and low risk, respectively

Results of meta-analysis

Overall statistics of liver fibrosis and cirrhosis

A bivariate random-effects model was employed to pool statistics. The results were: SEN = 0.81 (95% CI: 0.75–0.85), SPE = 0.85 (95% CI: 0.77–0.91), PLR = 5.55 (95% CI: 3.55–8.66), NLR = 0.23 (95% CI: 0.18–0.29) (Fig. 3a-c), DOR = 24.39 (95% CI:14.46–41.13) and SROC = 0.86 (95% CI: 0.83–0.89). The results suggested that the overall diagnostic accuracy was high (Fig. 3d).

Fig. 3
figure 3

Forest plot of SEN and SPE for the diagnosis of liver fibrosis and cirrhosis using gut microbiome-based ML (a); Forest plot of likelihood ratios for the diagnosis of liver fibrosis and cirrhosis gut microbiome-based ML (b); Forest plot of DOR for the diagnosis of liver fibrosis and cirrhosis using gut microbiome-based ML (c); Forest plot of SROC for the diagnosis of liver fibrosis and cirrhosis using gut microbiome-based ML (d). In figure (d), the full line represents the SROC curve; numerical circles represent the included prediction trials; the red rhombus indicates the point estimate of sensitivity/specificity; and the dotted line indicates the 95% confidence intervals (95% CI: 0.83–0.89)

Summary statistics for liver fibrosis and cirrhosis as well as liver biopsy diagnoses

A bivariate random effects model was adopted to pool the statistics. The results were as follows: (1) Liver fibrosis: SEN = 0.77 (95% CI: 0.69–0.82), SPE = 0.84 (95% CI: 0.71–0.92), and SROC = 0.80 (95% CI: 0.76–0.83); (2) Liver cirrhosis: SEN = 0.84 (95% CI: 0.77–0.891), SPE = 0.85 (95% CI: 0.74–0.92), and SROC = 0.91 (95% CI: 0.88–0.93); (3) Liver biopsy: SEN = 0.82(95% CI: 0.76–0.88), SPE = 0.87(95% CI: 0.76–0.93), and SROC = 0.89(95% CI: 0.86–0.92). (Figure S1a-d and Figure S2a, b). The results indicated that gut microbiome-based ML was more efficient in diagnosing liver cirrhosis than liver fibrosis. Excluding studies with non-liver biopsy diagnosis had no significant impact on the original results.

Heterogeneity test

The heterogeneity was considered significant [I2 = 91%, (95%CI: 83–100)]. As shown in Fig. 4a, two of the 11 included trials were located outside the box diagram, indicating that these two trials might be the primary sources of heterogeneity. To further examine whether different diagnostic criteria have an impact on heterogeneity, heterogeneity test was performed again after excluding non-liver biopsy studies. The results showed I2 = 93, 95% CI = [86–99], as shown in Supplementary Fig. S4.

Fig. 4
figure 4

Bivariate box plot for the diagnosis of liver fibrosis and cirrhosis using gut microbiome-based ML (a); Deek’s funnel plot for the diagnosis of liver fibrosis and cirrhosis gut microbiome-based ML (b); Fagan diagram for the diagnosis of liver fibrosis and cirrhosis using gut microbiome-based ML (c). In figure (b), numerical circles represent the included prediction trials; the dotted line represents the regression line. In figure (c), the green rhombus represents the prior probability; the full red line represents the probability of the diagnosis of liver fibrosis and cirrhosis; the gray dotted line represents the probability of having liver fibrosis or cirrhosis

Publication bias

According to Deek’s funnel plot (Fig. 4b), no significant publication bias was observed (p = 0.56). This suggested that our meta-analysis results were reliable.

Verification for diagnostic efficiency

The posterior probability was calculated according to the Fagan diagram (Fig. 4c). If the immediate probability was 20%, the probability of gut microbiome-based ML for diagnosing liver fibrosis and cirrhosis was 58%. If a negative result was reached, the probability of existing liver fibrosis or cirrhosis was 5%. The results indicated that gut microbiome-based ML could be remarkably accurate in diagnosing liver fibrosis or cirrhosis.

Discussion

Gut microbiota, also known as intestinal flora or intestinal microecosystem, refers to the microbial community in the intestines, including bacteria, fungi, viruses, and other microorganisms [32]. Researchers have become increasingly interested in the association between gut microorganisms and liver diseases. That is because the liver has the closest correlation with the gut and is easily exposed to many bacterial components and metabolites. Gut microbes interact and restrict each other to maintain the homeostatic balance of intestinal flora. When the homeostasis becomes unbalanced, the disordered flora can interact with the host immune system through the portal vein and bile secretion system, resulting in the occurrence and progression of hepatitis, fibrosis, and cirrhosis [32, 33]. The changes in the intestinal flora may provide novel insights into the exploration of new medical interventions. Several studies have proved that gut microbiota can be a diagnostic and therapeutic target for liver diseases [34, 35]. These findings suggest that gut microbiome-based ML models have enormous potential for the diagnosis of liver fibrosis and cirrhosis. Compared to non-invasive diagnostic methods such as ultrasonography, elastography and clinical predictive scoring, gut microbiome-based diagnostic techniques have some potential advantages in diagnosing liver fibrosis and cirrhosis. Especially, such techniques can provide early biomarkers to facilitate early diagnosis and intervention, whereas traditional methods may be not able to detect diseases until they progress to a later stage or symptoms appear [19].

Main findings

Ten studies were included in this systematic review, and one of them involved two prediction trials. Therefore, the present study involved 11 prediction trials and 838 patients, including 403 patients with liver cirrhosis or fibrosis and 435 patients with other diseases. The present study showed the gut microbiome-based ML method has high sensitivity, specificity, and accuracy in predicting liver fibrosis or cirrhosis (sensitivity: 0.81, specificity: 0.85, AUC: 0.86). In clinical practice, a high AUC (Area Under the Curve) value indicates better predictive performance of the model. However, it is essential to consider sensitivity and specificity along with it. An AUC of 0.86, a sensitivity of 0.81, and a specificity of 0.85 indicate excellent predictive capabilities, leading to improved clinical diagnostic performance. This suggests that machine learning methods based on gut microbiota show strong predictive performance for liver cirrhosis and fibrosis. It implies that a machine learning-based gut microbiome analysis, as a non-invasive diagnostic approach for liver cirrhosis and fibrosis, holds significant clinical applicability and can offer better decision-making support for healthcare professionals.

According to the verification results, in the case of liver fibrosis or cirrhosis diagnosis based on gut microbes, the probability of liver fibrosis or cirrhosis is 85%; otherwise, a negative result indicates a 19% chance of liver fibrosis or cirrhosis. The results reveal that the gut microbiome-based ML method is highly precise for diagnosing liver fibrosis or cirrhosis. The subgroup analysis by liver cirrhosis and fibrosis showed that the SEN, SPE, and SROC for liver fibrosis were 0.77, 0.87, and 0.80, respectively; the SEN, SPE, and SROC for liver cirrhosis were 0.84, 0.85, and 0.90, respectively. These results reveal that the gut microbiome-based ML method is more accurate in diagnosing liver cirrhosis than liver fibrosis. This finding is also confirmed by the research of Tien S. Dong et al. [23]. They compared patients with advanced liver fibrosis with those with mild liver fibrosis and found that patients with advanced fibrosis had obvious microbiota characteristics, and this result was not affected by the cause of liver disease. After adjusting for other covariates, this conclusion is established. The SEN, SPE, and SROC for liver biopsy were 0.82, 0.87, and 0.89, respectively. However, there was no significant difference between the diagnostic performance of liver biopsy and overall diagnostic performance, suggesting that excluding studies with non-liver biopsy diagnostic criteria had no significant impact on the diagnostic performance of gut microbiome-based machine learning methods. These results are visualized in a histogram (Figure S3), which can intuitively reflect their differences.

In contrast, non-invasive diagnostic methods, such as ultrasonography, elastography, and clinical predictive scoring systems, protect patients from invasiveness-related discomfort and risks and are often used as alternative or complementary tools for diagnosing liver fibrosis or cirrhosis. Elastography has become a preferred non-invasive imaging technique for the clinical assessment of liver fibrosis and cirrhosis. Liver stiffness measurement (LSM) provided by elastography is an alternative quantitative biomarker for fibrosis and cirrhosis burden in chronic liver disease. Elastography can be performed with ultrasound or MRI. TE (FibroScan, Echosens) is an ultrasound-based imaging technology most commonly used to assess fibrosis in patients in clinical practice. The AUROC value of TE for cirrhosis diagnosis (0.95–0.97) was higher than that for significant fibrosis (0.80) [36], which is consistent with the results of our subgroup analysis (Supplementary Fig. S1). Magnetic resonance (MR) elastography has been shown to accurately diagnose liver fibrosis and cirrhosis. MR elastography is similarly accurate in diagnosing significant fibrosis and cirrhosis. A meta-analysis of nearly 700 patients found that the mean AUCs of MR Elastography were 0.84, 0.88, 0.93, and 0.92 for stages 1–3 fibrosis as well as cirrhosis, respectively [37]. In summary, in addition to gut microbiome-based ML methods, other non-invasive diagnostic techniques are also accurate in the diagnosis of liver fibrosis and cirrhosis.

Although microflora is largely influenced by genetics [38], dynamic environment [39], seasons [40], diet [41], and smoking [42], the gut microbes are specific in predicting liver fibrosis and cirrhosis. Boursier et al. [43] discovered that patients with non-alcoholic steatohepatitis (NASH) or fibrosis had considerably higher levels of ruminococcus. This result was proved by Bajaj JS [44]. Oh et al. [25] found that a core set of gut microbiome species can detect cirrhosis in patients from geographically separated regions, free from the effects of disease etiology, environmental factors, and host genetics on the gut microbiome. Five of the included studies [23, 25, 26, 28, 30] contain six cohorts, and all of them conducted external validations of diagnostic efficiency. The AUCs of the six cohorts are 0.82, 0.89, 0.88, 0.81, 0.71, and 0.721, respectively. These cohorts comprise patients of different ethnic groups, environments, and etiologies. This indicates that gut microbiota signatures are robust in detecting liver fibrosis and cirrhosis in patients of different settings and etiologies. The gut microbiome-based ML method is universal and stable in diagnosing liver fibrosis and cirrhosis. Furthermore, a Deek’s funnel plot was drawn to evaluate the publication bias. The nearly symmetrical plot indicates no publication bias (P = 0.56 > 0.10), suggesting that our results are reliable.

Implications for clinicians, policymakers and other researchers

Although there are few studies on the application of gut microbiota in diagnosing liver cirrhosis or liver fibrosis, we still found that the gut microbiota has a strong association with the pathological grading of liver cirrhosis or liver fibrosis, indicating its great performance in diagnosing liver fibrosis and early liver cirrhosis. In addition to offering references for non-invasive diagnoses of liver cirrhosis and fibrosis, it can be combined with other non-invasive serum/plasma or imaging tests to stage liver fibrosis and cirrhosis clearly or to evaluate the effects of treatment dynamically. However, the clinical application of this technique requires clinicians equipped with extensive experience. Therefore, it is advised to strengthen operator training, improve the quantity and quality of original studies, including randomized controlled trials and multi-centered studies with large sample sizes and diversified ML algorithms, optimize predictive efficiency, and reduce the bias due to modeling with a single algorithm. In this way, research evidence would be more scientific, accurate, and of high quality, providing meaningful guidelines for clinical decision-making.

Research limitations

The application of the gut microbiome-based ML method in diagnosing liver fibrosis or cirrhosis is recommendable, but the present study still has some limitations.

  1. (1)

    The difference between operators affected the predictive results. Removing the examination results by inexperienced operators could avoid artificial influences caused by the batch processing effect in any single data set, which can improve accuracy and diagnostic performance. In an analysis of fecal NAFLD metagenomes based on three publicly available studies, Wang et al. [16] reported that the differences in metagenomic methods and study designs (e.g., sample collection and preservation, sequencing platforms, and DNA extraction methods) could impact the constitution of downstream sequential data. The three studies (Illumina HiSeq) used the same sequencing platforms, but their DNA extraction methods differed. As all three studies explained that their samples were extracted by fast freezing to − 80 °C, there are still technical variances due to human factors.

  2. (2)

    Differences in sequencing methods had an impact on the results. High-throughput sequencing of the culture-independent 16S rRNA gene is the primary research method. However, with the sequencing costs increasing and the data analysis methods maturing, metagenomic shotgun sequencing is expected to become an essential means of studying the gut microbiome due to its advantages of processing larger-scale and comprehensive information. Most included studies adopted the 16S ribosomal RNA sequencing method; one used shotgun metagenomic sequencing [24]; and two used whole-metagenome shotgun sequencing [23, 25]. Different sequencing methods also caused heterogeneity in research results, which was reported in the study by Bajaj JS et al. [44].

  3. (3)

    Undiversified models may also affect the research results. The main ML algorithm used in the included studies was RF. As a dynamic and complex ecosystem in the human body, the gut microbiota is affected by the interactions within microbial colonies as well as between microbial colonies and hosts. It is difficult for undiversified models to present such complicated interactions. Therefore, the multi-omics integration strategy will be a promising approach to understanding the gut microbiota composition and physiological functions.

  4. (4)

    The research results were also affected by sample size. The application of gut microbiota in diagnosing liver cirrhosis or liver fibrosis is still in the primary stage. At present, only few studies have been published on the gut microbiome-based ML method predicting liver fibrosis and cirrhosis. Thereby, there was a small sample size in the present study, and it is impossible to perform subgroup analysis according to microbial species. More studies with a larger sample size are needed, and we will supplement and update the analysis in a timely manner.

  5. (5)

    The impact of heterogeneity on research results

The heterogeneity test indicates the existence of heterogeneity (I2 = 63,95% CI = [16–100]). As shown in the bivariate box plot, 2 of the 10 included studies were located outside the box plot, suggesting that these two studies may be the main source of heterogeneity. After carefully reading these two studies, we found that their sample size and distribution were different, which may affect the performance and applicability of each model. Furthermore, the feature selection method, study type, and study population may all cause heterogeneity. In some countries, liver fibrosis or cirrhosis is not diagnosed by liver biopsy, for example, the three included studies by Caussy, Dong, and Chen. To delve into whether they have an impact on heterogeneity, we used the numerical method and a bivariate box plot to test heterogeneity after excluding the three studies that did not use liver biopsy to diagnose liver fibrosis or cirrhosis. The results showed I2 = 93, 95% CI = [86–99], suggesting that these three studies may not be the source of heterogeneity, and differences in diagnostic criteria have no significant impact on the study results.

  1. (6)

    Impact of external validation on results

In bioinformatics studies, external validation is an important step to ensure the reliability and applicability of research results, making such studies more scientific and practical. In our analysis, most included studies only used K- fold cross validation or random sampling methods to generate validation sets, but did not evaluate the generalization ability of their models or results in new data sets. This may limit the interpretation of the results. Therefore, more follow-up studies are needed to verify our findings.

Conclusion

The gut microbiome-based ML method can effectively predict liver cirrhosis and fibrosis. It is an optimal approach for non-invasive diagnosis of liver cirrhosis and fibrosis with a high clinical application value. Nevertheless, the present study merely included few studies and undiversified models. Therefore, developing a diversified prediction system based on microorganisms is necessary to enrich the efficient non-invasive detection methods for liver cirrhosis or fibrosis.

The introduction of diverse gut microbiome in machine learning will enhance model performance, robustness, and applicability, reduce potential bias in models, improve the interpretability of models, provide more information, and help clinicians make more comprehensive decisions.

Outlook

In summary, ML methods based on intestinal microorganisms have broad potential and prospects in the diagnosis of cirrhosis and liver fibrosis due to noninvasiveness, early diagnosis, personalized diagnosis and treatment, data integration, reduced medical costs, and interdisciplinary cooperation. However, challenges such as data quality, model interpretation, and clinical validation still need to be addressed in the future. Overall, with constant advances in technology and deepening research, ML will continue to play a key role in improving the diagnosis and treatment of liver diseases.

Availability of data and materials

Data sharing is not applicable to this article as no datasets were generated or analysed during the current study.

References

  1. Roehlen N, Crouchet E, Baumert TF. Liver fibrosis: mechanistic concepts and therapeutic perspectives. Cells. 2020;9(4):875.

  2. Aydın MM, Akçalı KC. Liver fibrosis. Turk J Gastroenterol. 2018;29(1):14–21.

    Article  PubMed  PubMed Central  Google Scholar 

  3. Parola M, Pinzani M. Liver fibrosis: pathophysiology, pathogenetic targets and clinical issues. Mol Asp Med. 2019;65:37–55.

    Article  CAS  Google Scholar 

  4. Smith A, Baumgartner K, Bositis C. Cirrhosis: diagnosis and management. Am Fam Physician. 2019;100(12):759–70.

    PubMed  Google Scholar 

  5. The French METAVIR Cooperative Study Group. Intraobserver and interobserver variations in liver biopsy interpretation in patients with chronic hepatitis C. Hepatology. 1994;20(1 Pt 1):15–20.

    Article  Google Scholar 

  6. Barbois S, Arvieux C, Leroy V, Reche F, Stürm N, Borel AL. Benefit-risk of intraoperative liver biopsy during bariatric surgery: review and perspectives. Surg Obes Relat Dis. 2017;13(10):1780–6.

    Article  PubMed  Google Scholar 

  7. Ohtani N, Kawada N. Role of the gut–liver Axis in liver inflammation, fibrosis, and Cancer: a special focus on the gut microbiota relationship. Hepatol Commun. 2019;3(4):456–70.

  8. Piccinino F, Sagnelli E, Pasquale G, Giusti G. Complications following percutaneous liver biopsy. A multicentre retrospective study on 68,276 biopsies. J Hepatol. 1986;2(2):165–73.

    Article  CAS  PubMed  Google Scholar 

  9. Myers RP, Benhamou Y, Imbert-Bismut F, Thibault V, Bochet M, Charlotte F, Ratziu V, Bricaire F, Katlama C, Poynard T. Serum biochemical markers accurately predict liver fibrosis in HIV and hepatitis C virus co-infected patients. Aids. 2003;17(5):721–5.

    Article  CAS  PubMed  Google Scholar 

  10. Blond E, Disse E, Cuerq C, Drai J, Valette PJ, Laville M, Thivolet C, Simon C, Caussy C. EASL-EASD-EASO clinical practice guidelines for the management of non-alcoholic fatty liver disease in severely obese people: do they lead to over-referral? Diabetologia. 2017;60(7):1218–22.

    Article  CAS  PubMed  Google Scholar 

  11. Li R. Data mining and machine learning methods for dementia research. Methods Mol Biol. 2018;1750:363–70.

    Article  CAS  PubMed  Google Scholar 

  12. Li Z, Ni M, Yu H, Wang L, Zhou X, Chen T, Liu G, Gao Y. Gut microbiota and liver fibrosis: one potential biomarker for predicting liver fibrosis. Biomed Res Int. 2020;2020:3905130.

    PubMed  PubMed Central  Google Scholar 

  13. Ren Z, Li A, Jiang J, Zhou L, Yu Z, Lu H, Xie H, Chen X, Shao L, Zhang R, et al. Gut microbiome analysis as a tool towards targeted non-invasive biomarkers for early hepatocellular carcinoma. Gut. 2019;68(6):1014–23.

    Article  CAS  PubMed  Google Scholar 

  14. Zaneveld JR, McMinds R, Vega Thurber R. Stress and stability: applying the Anna Karenina principle to animal microbiomes. Nat Microbiol. 2017;2:17121.

    Article  CAS  PubMed  Google Scholar 

  15. Usami M, Miyoshi M, Yamashita H. Gut microbiota and host metabolism in liver cirrhosis. World J Gastroenterol. 2015;41:11597–608.

    Article  Google Scholar 

  16. Wang T, Guo XK, Xu H. Disentangling the progression of non-alcoholic fatty liver disease in the human gut microbiota. Front Microbiol. 2021;12:728823.

    Article  PubMed  PubMed Central  Google Scholar 

  17. Greener JG, Kandathil SM, Moffat L, Jones DT. A guide to machine learning for biologists. Nat Rev Mol Cell Biol. 2022;23(1):40–55.

    Article  CAS  PubMed  Google Scholar 

  18. Johnson A, Cooper GF, Visweswaran S. A novel personalized random Forest algorithm for clinical outcome prediction. Stud Health Technol Inform. 2022;290:248–52.

    PubMed  Google Scholar 

  19. Plaza-Díaz J, Solis-Urra P, Aragón-Vela J, Rodríguez-Rodríguez F, Olivares-Arancibia J, Álvarez-Mercado AI. Insights into the impact of microbiota in the treatment of NAFLD/NASH and its potential as a biomarker for prognosis and diagnosis. Biomedicines. 2021;9(2):145.

  20. Whiting PF, Rutjes AW, Westwood ME, Mallett S, Deeks JJ, Reitsma JB, Leeflang MM, Sterne JA, Bossuyt PM, the QUADAS-2 Group. QUADAS-2: a revised tool for the quality assessment of diagnostic accuracy studies. Ann Intern Med. 2011;155(8):529–36.

    Article  PubMed  Google Scholar 

  21. He J, Jiang Y, Liu L, Zuo Z, Zeng C. Circulating MicroRNAs as promising diagnostic biomarkers for patients with glioma: a Meta-analysis. Front Neurol. 2020;11:610163.

    Article  PubMed  Google Scholar 

  22. Caussy C, Tripathi A, Humphrey G, Bassirian S, Singh S, Faulkner C, Bettencourt R, Rizo E, Richards L, Xu ZZ, et al. A gut microbiome signature for cirrhosis due to nonalcoholic fatty liver disease. Nat Commun. 2019;10(1):1406.

    Article  PubMed  PubMed Central  Google Scholar 

  23. Dong TS, Katzka W, Lagishetty V, Luu K, Hauer M, Pisegna J, Jacobs JP. A microbial signature identifies advanced fibrosis in patients with chronic liver disease mainly due to NAFLD. Sci Rep. 2020;10(1):2771.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  24. Chen Z, Xie Y, Zhou F, Zhang B, Wu J, Yang L, Xu S, Stedtfeld R, Chen Q, Liu J, et al. Featured gut microbiomes associated with the progression of chronic hepatitis B disease. Front Microbiol. 2020;11:383.

    Article  PubMed  PubMed Central  Google Scholar 

  25. Oh TG, Kim SM, Caussy C, Fu T, Guo J, Bassirian S, Singh S, Madamba EV, Bettencourt R, Richards L, et al. A universal gut-microbiome-derived signature predicts cirrhosis. Cell Metab. 2020;32(5):878–888.e876.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  26. Loomba R, Seguritan V, Li W, Long T, Klitgord N, Bhatt A, Dulai PS, Caussy C, Bettencourt R, Highlander SK, et al. Gut microbiome-based metagenomic signature for non-invasive detection of advanced fibrosis in human nonalcoholic fatty liver disease. Cell Metab. 2019;30(3):607.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  27. Lang S, Demir M, Martin A, Jiang L, Zhang X, Duan Y, Gao B, Wisplinghoff H, Kasper P, Roderburg C. Intestinal virome signature associated with severity of nonalcoholic fatty liver disease. Gastroenterology. 2020;159(5):1839–52.

    Article  CAS  PubMed  Google Scholar 

  28. Lang S, Farowski F, Martin A, Wisplinghoff H, Vehreschild MJ, Krawczyk M, Nowag A, Kretzschmar A, Scholz C, Kasper P. Prediction of advanced fibrosis in non-alcoholic fatty liver disease using gut microbiota-based approaches compared with simple non-invasive tools. Sci Rep. 2020;10(1):1–9.

    Article  Google Scholar 

  29. Lapidot Y, Am Ir A, Nosenko R, Uzan-Yulzari A, Ben-Ari Z. Alterations in the gut microbiome in the progression of cirrhosis to hepatocellular carcinoma. mSystems. 2020;5(3):e00153–20.

  30. Lee G, You HJ, Bajaj JS, Joo SK, Yu J, Park S, Kang H, Park JH, Kim JH, Lee DH, et al. Distinct signatures of gut microbiome and metabolites associated with significant fibrosis in non-obese NAFLD. Nat Commun. 2020;11(1):4982.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  31. Schwimmer JB, Johnson JS, Angeles JE, Behling C, Belt PH, Borecki I, Bross C, Durelle J, Goyal NP, Hamilton G, et al. Microbiome signatures associated with Steatohepatitis and moderate to severe fibrosis in children with nonalcoholic fatty liver disease. Gastroenterology. 2019;157(4):1109–22.

    Article  CAS  PubMed  Google Scholar 

  32. Ursell LK, Metcalf JL, Wegener PL, Rob K. Defining the human microbiome. Nutr Rev. 2012;(suppl_1):S38–44.

  33. Clemente JC, Ursell LK, Parfrey LW, Knight R. The impact of the gut microbiota on human health: an integrative view. Cell. 2012;148(6):1258–70.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  34. Falony G, Joossens M, Vieira-Silva S, Wang J, Darzi Y, Faust K, Kurilshikov A, Bonder MJ, Valles-Colomer M, Vandeputte D, et al. Population-level analysis of gut microbiome variation. Science. 2016;352(6285):560–4.

    Article  CAS  PubMed  Google Scholar 

  35. Qin J, Li R, Raes J, Arumugam M, Burgdorf KS, Manichanh C, Nielsen T, Pons N, Levenez F, Yamada T, et al. A human gut microbial gene catalogue established by metagenomic sequencing. Nature. 2010;464(7285):59–65.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  36. Castera L. Noninvasive evaluation of nonalcoholic fatty liver disease. Semin Liver Dis. 2015;35(3):291–303.

    Article  PubMed  Google Scholar 

  37. Serai SD, Yin M, Wang H, Ehman RL, Podberesky DJ. Cross-vendor validation of liver magnetic resonance elastography. Abdom Imaging. 2015;40(4):789–94.

    Article  PubMed  PubMed Central  Google Scholar 

  38. Goodrich JK, Waters JL, Poole AC, Sutter JL, Koren O, Blekhman R, Beaumont M, Van Treuren W, Knight R, Bell JT, et al. Human genetics shape the gut microbiome. Cell. 2014;159(4):789–99.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  39. Donia MS, Cimermancic P, Schulze CJ, Wieland Brown LC, Martin J, Mitreva M, Clardy J, Linington RG, Fischbach MA. A systematic analysis of biosynthetic gene clusters in the human microbiome reveals a common family of antibiotics. Cell. 2014;158(6):1402–14.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  40. Davenport ER, Mizrahi-Man O, Michelini K, Barreiro LB, Ober C, Gilad Y. Seasonal variation in human gut microbiome composition. PLoS One. 2014;9(3):e90731.

    Article  PubMed  PubMed Central  Google Scholar 

  41. Wu GD, Chen J, Hoffmann C, Bittinger K, Chen YY, Keilbaugh SA, Bewtra M, Knights D, Walters WA, Knight R, et al. Linking long-term dietary patterns with gut microbial enterotypes. Science. 2011;334(6052):105–8.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  42. Mason MR, Preshaw PM, Nagaraja HN, Dabdoub SM, Rahman A, Kumar PS. The subgingival microbiome of clinically healthy current and never smokers. ISME J. 2015;9(1):268–72.

    Article  PubMed  Google Scholar 

  43. Boursier J, Mueller O, Barret M, Machado M, Fizanne L, Araujo-Perez F, Guy CD, Seed PC, Rawls JF, David LA, et al. The severity of nonalcoholic fatty liver disease is associated with gut dysbiosis and shift in the metabolic function of the gut microbiota. Hepatology. 2016;63(3):764–75.

    Article  CAS  PubMed  Google Scholar 

  44. Bajaj JS, Heuman DM, Hylemon PB, Sanyal AJ, White MB, Monteith P, Noble NA, Unser AB, Daita K, Fisher AR, et al. Altered profile of human gut microbiome is associated with cirrhosis and its complications. J Hepatol. 2014;60(5):940–7.

    Article  CAS  PubMed  Google Scholar 

Download references

Acknowledgements

We would like to thank the researchers and study participants for their contributions.

Funding

This work was supported by the National Natural Science Foundation of China (81973754); Shaanxi University of Traditional Chinese Medicine Innovation Team Project (2019-QN06).

Author information

Authors and Affiliations

Authors

Contributions

Cong’e Tan and Wenzhe Feng design the study and served as a supervisor and director; Xiaopei Liu and Dan Liu obtained and analyzed the research data; Xiaopei Liu drafted the main manuscript; Xiaopei Liu and Dan Liu drafted and revised the key content of the article.

Corresponding author

Correspondence to Cong’e Tan.

Ethics declarations

Ethics approval and consent to participate

Not applicable.

Consent for publication

Not applicable.

Competing interests

The authors declare no competing interests.

Additional information

Publisher’s Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary Information

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated in a credit line to the data.

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Liu, X., Liu, D., Tan, C. et al. Gut microbiome-based machine learning for diagnostic prediction of liver fibrosis and cirrhosis: a systematic review and meta-analysis. BMC Med Inform Decis Mak 23, 294 (2023). https://doi.org/10.1186/s12911-023-02402-1

Download citation

  • Received:

  • Accepted:

  • Published:

  • DOI: https://doi.org/10.1186/s12911-023-02402-1

Keywords