- Research
- Open access
- Published:
Multiple machine-learning tools identifying prognostic biomarkers for acute Myeloid Leukemia
BMC Medical Informatics and Decision Making volume 24, Article number: 2 (2024)
Abstract
Background
Acute Myeloid Leukemia (AML) generally has a relatively low survival rate after treatment. There is an urgent need to find new biomarkers that may improve the survival prognosis of patients. Machine-learning tools are more and more widely used in the screening of biomarkers.
Methods
Least Absolute Shrinkage and Selection Operator (LASSO), Support Vector Machine-Recursive Feature Elimination (SVM-RFE), Random Forest (RF), eXtreme Gradient Boosting (XGBoost), lrFuncs, IdaProfile, caretFuncs, and nbFuncs models were used to screen key genes closely associated with AML. Then, based on the Cancer Genome Atlas (TCGA), pan-cancer analysis was performed to determine the correlation between important genes and AML or other cancers. Finally, the diagnostic value of important genes for AML was verified in different data sets.
Results
The survival analysis results of the training set showed 26 genes with survival differences. After the intersection of the results of each machine learning method, DNM1, MEIS1, and SUSD3 were selected as key genes for subsequent analysis. The results of the pan-cancer analysis showed that MEIS1 and DNM1 were significantly highly expressed in AML; MEIS1 and SUSD3 are potential risk factors for the prognosis of AML, and DNM1 is a potential protective factor. Three key genes were significantly associated with AML immune subtypes and multiple immune checkpoints in AML. The results of the verification analysis show that DNM1, MEIS1, and SUSD3 have potential diagnostic value for AML.
Conclusion
Multiple machine learning methods identified DNM1, MEIS1, and SUSD3 can be regarded as prognostic biomarkers for AML.
Backgrounds
Acute myeloid leukemia (AML) is a malignant bone marrow disease characterized by clonal expansion and differentiation arrest of bone marrow progenitor cells. Most AML cases still have no clear etiology [1]. AML is the most common acute leukemia in adults, and its survival time is short [2]. In recent years, with the rapid development of molecular targeted therapy and combined therapy, and the widespread application of these two therapies in clinical practice, the survival and prognosis of AML patients have been relatively prolonged and improved [3]. Intensive chemotherapy and gene stem cell transplantation are usually applied to a small number of young patients, and for most patients, the prognosis and survival rate are poor [4]. Although the treatment strategies for AML have been continuously adjusted and improved over the past few decades, the effect of these treatment strategies on the survival and prognosis of patients is still minimal [5]. Therefore, the identification of new and effective prognostic biomarkers is crucial for accurately predicting the prognosis of AML patients and for a deeper and more comprehensive understanding of the pathogenesis of AML.
With the advancement of gene sequencing technology, a series of gene databases have emerged, such as the Cancer Genome Atlas (TCGA) and the Gene Expression Omnibus (GEO). In addition, machine learning algorithms, as one of the main tools of data mining, are now widely used in the medical field. The algorithm establishes a risk model by learning the existing data of patients, which is used to predict the disease, diagnose the severity of the disease, and evaluate the prognosis of the disease [6, 7]. Its main types include the least absolute shrinkage and selection operator (LASSO), random forest graph (RF), support vector machine (SVM), decision tree, and other common algorithms. LASSO is the only property of the absolute value of the penalized regression coefficient [8]. The greater the penalty, the greater the shrinkage of the coefficient, and then remove the unimportant covariates [9, 10]. Support vector machine recursive feature elimination (SVM-RFE) is a supervised machine learning technique widely used in classification and regression. Its purpose is to classify data points by maximizing the margin between classes in high-dimensional space. The features are classified according to the accuracy value, and several features with higher accuracy are selected [11]. The RF algorithm is a method of training and predicting samples by constructing a decision tree. The features with high importance scores are obtained by calculating and sorting the importance scores of features [12]. These machine algorithms can learn and train from data to achieve accurate predictions of future events [13]. These algorithms are gradually being used in the prognosis of lung cancer, breast cancer, liver cancer, gastrointestinal cancer, and other malignant tumors, which has become a hot spot in clinical research [14,15,16,17].
Machine learning algorithms contain a variety of types, and each model has its scope of application. Different types, different volumes, and different characteristics of data have different prediction performance. Therefore, this study aims to screen genes related to AML prognosis based on multiple types of machine learning, and then take the intersection gene of each machine learning result as the key gene for subsequent research. Finally, pan-cancer analysis was performed on key genes to further clarify the correlation between key genes and the occurrence and development of AML or other cancers and then to clarify the diagnostic value of key genes for AML. This study will provide new ideas for the prognosis evaluation of AML patients, and then promote the efficient and accurate individualized treatment of AML.
Methods
Acquisition of data sets
Data on acute myeloid leukemia were obtained from the Gene Expression Omnibus (GEO) database (https://www.ncbi.nlm.nih.gov/geo/query/acc.cgi). Due to the excessive amount of search data, we set some conditions to select candidate data sets. The specific screening criteria for the dataset used in this study were acute leukemia, United States, Homo sapiens, adults, and with a total sample size greater than 100 people. Finally, three data sets were selected for subsequent research (GSE63270, GSE15061, and GSE48558). The basic information of each data set is shown in Table 1. The gene set of GSE15061 was obtained for expression difference analysis (setting the threshold: |logFC| > 1.5, P < 0.05). The differential volcano map (http://sangerbox.com/tool.html) was drawn using the SangerBox website.
Screening the key genes related to the prognosis of AML based on multiple machine-learning methods
Kaplan-Meier (KM) survival analysis of differentially expressed genes in the training set was performed using the R-4.1.1 software package (“survival” and “survminer”).
Subsequently, R-4.1.1 different software packages were used to perform machine learning such as LASSO (R package: “glmnet”, “survival”, “survminer”), RF (R package: “randomForest”, “caret”, “varSelRF”), SVM-RFE (R package: “svm”, “caret”, and “randomForest”), and XGBoost (R package: “xgboost”, and “caret”) on genes related to AML prognosis in the training set. In this study, we using the different functions in caret package of the R software (lrFuncs, IdaProfile, caretFuncs, and nbFuncs) to screened the optimal genes. The specific operation process and screening criteria were carried out according to the official manual of R software (http://topepo.github.io/caret/recursive-feature-elimination.html#recursive-feature-elimination-via-caret). The parameters of the machine learning algorithms used in this study were set according to previous studies [18,19,20].
Finally, the key genes were obtained after the intersection of the genes screened by the above machine-learning methods for subsequent analysis.
Pan-cancer analysis of key genes
Using the UCSC XENA database (http://xenabroswer.net/hub), which integrates public data from multiple databases, downloaded data sets that have been uniformly standardized (including the Cancer Genome Atlas (TCGA), Genotype Tissue Expression (GTEx). We extracted the expression of key genes in 33 cancer types, immune subtypes, clinical information, and other data from the downloaded data set. Then SangerBox (http://sangerbox.com/tool.html) was used to analyze the expression of key genes in cancer, survival analysis, immune analysis, and so on.
The correlation matrix heat map was drawn by SangerBox to explore the expression differences at immune checkpoints. The box plot was drawn by the R package (“ggplot2”, “ggsignif”, “ggpubr”, and “RColorBrewer”) to analyze the correlation between key genes and different subtypes of immune cells.
Gene set enrichment analysis (GSEA) of key genes
To identify pathways associated with key genes, we performed GSEA enrichment analyses. GSEA does not require genetic screening, thereby preserving genes that are not significantly different in expression but are functionally important [21]. GSEA analysis was performed by Sangerbox online bioinformatics tool (http://sangerbox.com/tool.html) based on AML mRNA data in the training set (GSE15061), which will identify the signaling pathways that potentially be related to the key genes screened by the machine learning algorithm.
Validation of diagnostic value of key genes
To verify the diagnostic value of key genes for acute myeloid leukemia, GSE63270 and GSE48558 data sets were used for verification. The “barplot” software package in R was used to verify the differential expression of key genes in AML patients, and the ROC (receiver operating characteristic curve) curve was drawn to evaluate the diagnostic value of key genes.
Results
Survival analysis of differentially expressed genes
The difference expression analysis of the training set GSE15061 was performed, and the threshold was set: |logFC| > 1.5, P < 0.05. The results showed that (Fig. 1) there were 171 differentially expressed genes, including 151 down-regulated genes and 20 up-regulated genes. Subsequent survival analysis of differentially expressed genes showed that a total of 26 differentially expressed genes had prognostic differences in the training set (Supplemental Fig. 1).
Screening key genes related to AML prognosis based on multiple machine learning
Firstly, LASSO regression analysis was used to screen 26 genes with prognostic value, and 10-fold cross-validation was performed. According to LASSO regression machine learning (Fig. 2A–B), a total of 20 genes were screened. Then, we use the RF algorithm (parameter settings: ntree = 2000, mtry = 6) to obtain the importance of input variables and screen out the top five genes (Fig. 2C–D). The SVM-RFE algorithm was used to remove the last few feature genes in the weight ranking of the training set in one round, and 22 genes were screened (Fig. 3A). According to the XGBoost model (Fig. 3B), 19 genes were screened and showed good discrimination, with an AUC of 0.964 (Fig. 3C). Finally, the recursive feature elimination in the ‘caret’ package was used to construct different models. The results showed that lrFuncs screened 15 genes (Fig. 4A), IdaProfile screened 26 genes (Fig. 4B), caretFuncs screened 12 genes (Fig. 4C), and nbFuncs screened 24 genes (Fig. 4D).
The important genes screened by each machine learning algorithm have been obtained (Table 2). And the intersection genes of important genes screened by these machine-learning algorithms are also summarized in Table 2. Figure 5 is the Upset diagram, which illustrating shared genes in important genes screened by different machine learning algorithm. The results showed that DNM1, MEIS1, and SUSD3 can be regarded as key genes for subsequent studies.
Pan-cancer analysis for key genes associated with AML prognosis
For the three key genes screened by machine learning tools, single gene pan-cancer analysis was performed respectively. The results showed that DNM1, MEIS1, and SUSD3 were differentially expressed between various cancers and normal tissues. MEIS1 and DNM1 are highly expressed in AML (Fig. 6A and B), while SUSD3 is not significantly different in AML (Fig. 6C).
To study the relationship between the expression levels of DNM1, MEIS1, and SUSD3 and the prognosis of 33 kinds of cancer, we carried out a survival correlation analysis. Cox proportional hazard model analysis showed that MEIS1 was not only associated with poor prognosis of AML (p = 8.0e-3, HR = 1.15), but also associated with poor prognosis of GBML, LGG, KIRP, and THCA (Fig. 7A). At the same time, MEIS1 was significantly correlated with the prognosis of HNSC, ACC, and KIRC. In addition to the poor prognosis of AML (p = 6.6e-3, HR = 1.25), SUSD3 was also significantly associated with the poor prognosis of GBML, LGG, and ACC (Fig. 7B). At the same time, SUSD3 was significantly associated with a better prognosis of various types of cancer (SKCM, SKCM-M, BRCA, KIPAN, MESO, SARC, LUAD). DNM1 was significantly associated with a better prognosis of AML (p = 0.04, HR = 0.88) and PAAD (p = 0.02, HR = 0.79) (Fig. 7C). In addition, DNM1 was also significantly associated with poor prognosis in a variety of types (THCA、ACC、LIHC、MESO、COADREAD、BLCA、COAD).
To explore the relationship between the expression levels of key genes and AML immune subtypes, we analyzed the correlation between key genes and AML immune subtypes based on the TCGA data set. The results showed that DNM1 (p < 0.001), MEIS1 (p < 0.001), and SUSD3 (p < 0.001) were significantly associated with AML C1-C6 immune subtypes (Fig. 8A). Based on the TCGA database, the correlation between key gene expression levels and immune checkpoints was explored. The results showed that the expression levels of DNM1, MEIS1, and SUSD3 were associated with many cancer immune checkpoints (Fig. 8B–D). Specifically, the expression level of MEIS1 was not correlated with the AML immune checkpoint (Fig. 8B). Figure 8C shows that the expression level of DNM1 is significantly correlated with the two immune checkpoints of AML (HAVCR2 and PDCD1LG2). Figure 8D showed that the SUSD3 expression level was significantly correlated with the five immune checkpoints of AML (CD274, CTLA4, LAG3, PDCD1, and TIGIT).
Gene set enrichment analysis (GSEA) of key genes
GSEA analysis showed that genes related to DNM1 in the training set were mainly enriched in aminoacyl tRNA biosynthesis, acute myeloid leukemia and other pathways (Fig. 9A). Genes related to MEIS1 were mainly enriched in pathways such as cell cycle, P53 signaling pathway, o glycan biosynthesis, etc. (Fig. 9B). Genes associated with SUND3 are mainly enriched in tryptophan metabolism, NOD-like receptor signaling pathway, chemokine signaling pathway etc. (Fig. 9C).
Validation of diagnostic value of key genes
To further explore the role of key genes as AML biomarkers, we selected two data sets to verify their diagnostic value. The results showed that in the GSE48588 (Fig. 10A–B) and GSE63270 (Fig. 10C–D) datasets, the expression levels of DNM1 and MEIS1 in AML were significantly higher than those in the control group, while the expression levels of SUSD3 in AML were significantly lower than those in the control group. ROC results suggested that the expression differences of DNM1, MEIS1, and SUSD3 have potential diagnostic value for AML.
Discussion
The prognosis of AML is poor. Young and elderly patients have a high risk of recurrence of chemotherapy resistance, and alternative and targeted drugs are needed to improve their survival rate [22]. At present, the treatment of AML mainly includes chemotherapy and molecular targeted therapy, such as FMS-like tyrosine kinase 3 (FLT3) inhibitors, IDH [isocitrate dehydrogenase (NADP+)] inhibitors, and monoclonal antibodies [23]. Despite many treatments, the prognosis of AML is still poor. High-throughput genomic screening methods and computer-aided techniques can be used to predict biomarkers related to disease occurrence and assist in the design of new targeted drugs [24]. Therefore, screening biomarkers related to the prognosis of AML patients through machine learning methods will provide a valuable reference for individualized targeted therapy and prognosis prediction of AML in clinical practice.
In this study, 26 genes with survival differences were screened from the differentially expressed genes in the training set. To further screen out the key genes closely related to the prognosis of AML, this study screened key genes based on a variety of machine learning. The results showed that the machine learning method used in this study identified DNM1, MEIS1, and SUSD3 as key genes significantly associated with AML prognosis based on different screening criteria. To further clarify the correlation between key genes and the occurrence and development of AML and various cancers, this study also performed a single-gene pan-cancer analysis of three key genes based on the TCGA database. The expression of MEIS1 and DNM1 in AML and normal controls was significantly different. MEIS1, SUSD3, and DNM1 were significantly associated with the prognosis of AML patients. Three key genes were significantly associated with AML immune subtypes, and DNM1 and SUSD3 were significantly associated with multiple immune checkpoints of AML. In addition to the strong association between the three key genes and AML, this study also found evidence that they are closely related to the prognosis and immunity of various cancers. More importantly, we also selected two validation datasets to verify the diagnostic value of key genes for AML, and the results showed that DNM1, MEIS1, and SUSD3 had good diagnostic values for AML.
The myeloid tropism leukemia virus integration site 1 (MEIS1) gene is located on 1p13-14 of human chromosome 2 and is widely expressed in various tissues including blood, liver, and brain [25]. MEIS1 is related to the differentiation of leukemia stem cells and the proliferation of leukemia cells [26]. Studies have shown that MEIS1 is often up-regulated in AML patients and can participate in disease progression through a variety of mechanisms [27, 28]. Thorsteinsdottir, U. et al. highlighted the role of Meis1 in regulating human AML cell maintenance and survival in vitro knockdown experiments [29]. Similar to the above study is that our study has also found evidence that MEIS1 expression is associated with AML prognosis, immunity, etc., which further proves that MEIS1 may be a biomarker for predicting AML prognosis.
Dynamins 1 (DNM1) is a member of the GTP-binding protein family. DNM1 is highly expressed in the nervous system of the human body and can regulate nerve activity [30, 31]. Therefore, DNM1 is often reported to play a role in nervous system diseases [32, 33]. However, in addition to neurological diseases, more and more studies have shown that DNM1 plays a role in the development of many cancers [34,35,36]. Previous studies have found that high expression of DNM1 is an independent prognostic biomarker for poor OS in patients with hepatocellular carcinoma [36]. DNM1 is overexpressed in many lung cancers, enhances the growth, migration, and invasion of cancer cells, and reduces the survival rate of lung cancer patients. Activated DNM1 selectively regulates tumor necrosis factor-related apoptosis-inducing ligand (TRAIL-R2) -mediated endocytosis, allowing cancer cells to escape death [37]. Based on the above, DNM1 may be used as a biomarker to predict the prognosis of patients with multiple cancers. In this study, we found for the first time evidence that DNM1 is potentially related to the prognosis of AML patients, further indicating that DNM1 plays a potential role in the occurrence and development of AML, and the specific mechanism of action is worthy of further discussion.
At present, there are few reports on Sushi domain-containing protein 3 (SUSD3), mainly focusing on the mechanism of SUSD3 in the occurrence and development of breast cancer. SUSD3 has extracellular, transmembrane, and cytoplasmic domains. It is highly expressed in breast cancer and estrogen-sensitive tissues such as the liver, breast, myometrium, endometrium, and ovary. Experiments have shown that SUSD3 has a higher level of expression in estrogen receptor (ER) -positive breast cancer cells, and estrogen treatment can further increase its expression [38]. SUSD3 has been reported as one of the potential biomarkers for the prognosis of breast cancer [39, 40]. This study found for the first time that SUSD3 is potentially related to the prognosis of AML and is expected to be a prognostic marker for AML patients.
In addition, we explored potential pathways associated with genes closely related to key genes in the training set through GSEA. Many pathways were found to be potentially related to the development of AML. Previous study has reported that the dysfunction of the typical metabolomics pathway Aminoacyl − tRNA biosynthesis indicates that mitochondrial dysfunction, which leads to a decrease in the detoxification ability of reactive oxygen species produced by AML chemotherapy and radiotherapy [41]. P53 plays a key role in normal and leukemia hematopoiesis and is the core of the complex network of AML-related signaling pathways [42]. NLRP3 inflammasome, a major factor in NOD-like receptor signaling pathway, promotes the progression of AML in an IL-1β-dependent manner. Targeting NLRP3 inflammasome may provide a new therapeutic option for AML [43]. Based on the above, it can be seen that the potential pathways related to DNM1, MEIS1 and SUSD3 participate in the occurrence and development of AML. We hypothesized that DNM1, MEIS1, and SUSD3 are closely related to the prognosis of AML, which may be mediated by the above pathways. However, the above is only speculation, and further functional verification experiments are needed to explore the mechanism of these three key genes in the occurrence and development of AML.
Based on a variety of machine learning, this study has explored three new biomarkers for AML prognosis, which provides a new idea for the clinical development of individualized targeted therapy and prognosis prediction of AML. However, this study still has some shortcomings. First, all the analyses in this study are based on retrospective data in public databases, and large-scale prospective studies and additional functional verification experiments are needed to confirm our findings. Secondly, it is necessary to further explore the specific mechanism of the three key genes in AML and their influence on the prognosis of AML in future research, so as to better explore the molecular mechanism involved in tumorigenesis and AML development.
Conclusion
This study found that DNM1, MEIS1, and SUSD3 were abnormally expressed in AML and were potentially related to its prognosis. They are expected to become new biomarkers and potential therapeutic targets for predicting the prognosis of AML patients. This study will provide a new theoretical basis for the basic research of AML.
Data availability
The datasets generated and/or analyzed during the current study are available in the [GEO] repository, [GSE63270, GSE15061, and GSE48558].
References
Shallis RM, Wang R, Davidoff A, Ma X, Zeidan AM. Epidemiology of acute Myeloid Leukemia: recent progress and enduring challenges. Blood Rev. 2019;36:70–87.
Allemani C, Matsuda T, Di Carlo V, Harewood R, Matz M, Nikšić M, et al. Global surveillance of trends in cancer survival 2000-14 (CONCORD-3): analysis of individual records for 37 513 025 patients diagnosed with one of 18 cancers from 322 population-based registries in 71 countries. Lancet (London England). 2018;391(10125):1023–75.
Totiger TM, Ghoshal A, Zabroski J, Sondhi A, Bucha S, Jahn J et al. Targeted Therapy Development in Acute Myeloid Leukemia. Biomedicines. 2023;11(2).
Vakiti A, Mewawalla P. Acute Myeloid Leukemia. StatPearls. Treasure Island (FL) ineligible companies. Disclosure: Prerna Mewawalla declares no relevant financial relationships with ineligible companies.: StatPearls Publishing Copyright © 2023. StatPearls Publishing LLC.; 2023.
Shimony S, Stahl M, Stone RM. Acute Myeloid Leukemia: 2023 update on diagnosis, risk-stratification, and management. Am J Hematol. 2023;98(3):502–26.
Handelman GS, Kok HK, Chandra RV, Razavi AH, Lee MJ, Asadi H. eDoctor: machine learning and the future of medicine. J Intern Med. 2018;284(6):603–19.
Komuro J, Kusumoto D, Hashimoto H, Yuasa S. Machine learning in cardiology: clinical application and basic research. J Cardiol. 2023;82(2):128–33.
McEligot AJ, Poynor V, Sharma R, Panangadan A. Logistic LASSO regression for dietary intakes and Breast Cancer. Nutrients. 2020;12(9).
Zhao Y, Ogden RT, Reiss PT. Wavelet-based LASSO in functional linear regression. Journal of computational and graphical statistics: a joint publication of American Statistical Association, Institute of Mathematical Statistics. Interface Foundation of North America. 2012;21(3):600–17.
Harezlak J, Coull BA, Laird NM, Magari SR, Christiani DC. Penalized solutions to functional regression problems. Comput Stat Data Anal. 2007;51(10):4911–25.
Yang Y, Yi X, Cai Y, Zhang Y, Xu Z. Immune-Associated Gene signatures and subtypes to predict the progression of atherosclerotic plaques based on machine learning. Front Pharmacol. 2022;13:865624.
Lai B, Lai Y, Zhang Y, Zhou M, OuYang G. Survival prediction in acute Myeloid Leukemia using gene expression profiling. BMC Med Inf Decis Mak. 2022;22(1):57.
Verma AA, Murray J, Greiner R, Cohen JP, Shojania KG, Ghassemi M, et al. Implementing machine learning in medicine. CMAJ: Can Med Association J = J de l’Association medicale canadienne. 2021;193(34):E1351–e7.
Lynch CM, Abdollahi B, Fuqua JD, de Carlo AR, Bartholomai JA, Balgemann RN, et al. Prediction of Lung cancer patient survival via supervised machine learning classification techniques. Int J Med Informatics. 2017;108:1–8.
Zhou CM, Xue Q, Wang Y, Tong J, Ji M, Yang JJ. Machine learning to predict the cancer-specific mortality of patients with primary non-metastatic invasive Breast cancer. Surg Today. 2021;51(5):756–63.
Ji GW, Fan Y, Sun DW, Wu MY, Wang K, Li XC, et al. Machine learning to Improve Prognosis Prediction of Early Hepatocellular Carcinoma after Surgical Resection. J Hepatocellular Carcinoma. 2021;8:913–23.
Christopherson KM, Das P, Berlind C, Lindsay WD, Ahern C, Smith BD, et al. A machine learning Model Approach to Risk-Stratify patients with gastrointestinal Cancer for hospitalization and mortality outcomes. Int J Radiat Oncol Biol Phys. 2021;111(1):135–42.
Qiu H, Luo L, Su Z, Zhou L, Wang L, Chen Y. Machine learning approaches to predict peak demand days of cardiovascular admissions considering environmental exposure. BMC Med Inf Decis Mak. 2020;20(1):83.
Kang J, Choi YJ, Kim IK, Lee HS, Kim H, Baik SH, et al. LASSO-Based machine learning algorithm for prediction of Lymph Node Metastasis in T1 Colorectal Cancer. Cancer Res Treat. 2021;53(3):773–83.
Mahmoudian M, Venäläinen MS, Klén R, Elo LL. Stable iterative variable selection. Bioinf (Oxford England). 2021;37(24):4810–7.
Subramanian A, Tamayo P, Mootha VK, Mukherjee S, Ebert BL, Gillette MA, et al. Gene set enrichment analysis: a knowledge-based approach for interpreting genome-wide expression profiles. Proc Natl Acad Sci USA. 2005;102(43):15545–50.
Huang Y, Zhang Z, Sui M, Li Y, Hu Y, Zhang H, et al. A novel stemness classification in acute Myeloid Leukemia by the stemness index and the identification of cancer stem cell-related biomarkers. Front Immunol. 2023;14:1202825.
Kantarjian H, Kadia T, DiNardo C, Daver N, Borthakur G, Jabbour E, et al. Acute Myeloid Leukemia: current progress and future directions. Blood cancer Journal. 2021;11(2):41.
Kumar N, Narayan Das N, Gupta D, Gupta K, Bindra J. Efficient automated Disease diagnosis using machine learning models. J Healthc Eng. 2021;2021:9983652.
Aksoz M, Turan RD, Albayrak E, Kocabas F. Emerging roles of Meis1 in Cardiac Regeneration, Stem cells and Cancer. Curr Drug Targets. 2018;19(2):181–90.
Kocabas F, Zheng J, Thet S, Copeland NG, Jenkins NA, DeBerardinis RJ, et al. Meis1 regulates the metabolic phenotype and oxidant defense of hematopoietic stem cells. Blood. 2012;120(25):4963–72.
Collins CT, Hess JL. Deregulation of the HOXA9/MEIS1 axis in acute Leukemia. Curr Opin Hematol. 2016;23(4):354–61.
Chen CW, Armstrong SA. Targeting DOT1L and HOX gene expression in MLL-rearranged Leukemia and beyond. Exp Hematol. 2015;43(8):673–84.
Thorsteinsdottir U, Kroon E, Jerome L, Blasi F, Sauvageau G. Defining roles for HOX and MEIS1 genes in induction of acute Myeloid Leukemia. Mol Cell Biol. 2001;21(1):224–34.
Raimondi A, Ferguson SM, Lou X, Armbruster M, Paradise S, Giovedi S, et al. Overlapping role of dynamin isoforms in synaptic vesicle endocytosis. Neuron. 2011;70(6):1100–14.
Ferguson SM, Brasnjo G, Hayashi M, Wölfel M, Collesi C, Giovedi S, et al. A selective activity-dependent requirement for dynamin 1 in synaptic vesicle endocytosis. Sci (New York NY). 2007;316(5824):570–4.
Sahly AN, Krochmalnek E, St-Onge J, Srour M, Myers KA. Severe DNM1 encephalopathy with dysmyelination due to recurrent splice site pathogenic variant. Hum Genet. 2020;139(12):1575–8.
Brereton E, Fassi E, Araujo GC, Dodd J, Telegrafi A, Pathak SJ, et al. Mutations in the PH Domain of DNM1 are associated with a nonepileptic phenotype characterized by developmental delay and neurobehavioral abnormalities. Mol Genet Genom Med. 2018;6(2):294–300.
Yamada H, Takeda T, Michiue H, Abe T, Takei K. Actin bundling by dynamin 2 and cortactin is implicated in cell migration by stabilizing filopodia in human non-small cell lung carcinoma cells. Int J Oncol. 2016;49(3):877–86.
Raja SA, Shah STA, Tariq A, Bibi N, Sughra K, Yousuf A, et al. Caveolin-1 and dynamin-2 overexpression is associated with the progression of Bladder cancer. Oncol Lett. 2019;18(1):219–26.
Tian M, Yang X, Li Y, Guo S. The expression of Dynamin 1, 2, and 3 in Human Hepatocellular Carcinoma and patient prognosis. Med Sci Monitor: Int Med J Experimental Clin Res. 2020;26:e923359.
Reis CR, Chen PH, Bendris N, Schmid SL. TRAIL-death receptor endocytosis and apoptosis are selectively regulated by dynamin-1 activation. Proc Natl Acad Sci USA. 2017;114(3):504–9.
Moy I, Todorović V, Dubash AD, Coon JS, Parker JB, Buranapramest M, et al. Estrogen-dependent sushi domain containing 3 regulates cytoskeleton organization and migration in Breast cancer cells. Oncogene. 2015;34(3):323–33.
Zhao S, Chen SS, Gu Y, Jiang EZ, Yu ZH. Expression and clinical significance of Sushi Domain- Containing protein 3 (SUSD3) and insulin-like growth Factor-I receptor (IGF-IR) in Breast Cancer. Asian Pac J cancer Prevention: APJCP. 2015;16(18):8633–6.
Lu N, Guan X, Bao W, Fan Z, Zhang J. Breast cancer combined prognostic model based on lactate metabolism genes. Medicine. 2022;101(51):e32485.
Cano KE, Li L, Bhatia S, Bhatia R, Forman SJ, Chen Y. NMR-based metabolomic analysis of the molecular pathogenesis of therapy-related myelodysplasia/acute Myeloid Leukemia. J Proteome Res. 2011;10(6):2873–81.
Prokocimer M, Molchadsky A, Rotter V. Dysfunctional diversity of p53 proteins in adult acute Myeloid Leukemia: projections on diagnostic workup and therapy. Blood. 2017;130(6):699–712.
Zhong C, Wang R, Hua M, Zhang C, Han F, Xu M, et al. NLRP3 Inflammasome promotes the progression of Acute Myeloid Leukemia via IL-1β pathway. Front Immunol. 2021;12:661939.
Acknowledgements
We thank all authors for their contributions and support.
Funding
This study was supported by the Kunming medical university joint special applied basic research [202001AY070001-111]; the First People’s Hospital of Yunnan Province Clinical Medical Center open project by the Yunnan Province Clinical Research Center for Hematologic Disease and the Yunnan Province Clinical Center for Hematologic Disease [2021LCZXXF-XY12, 2022LCZXKF-XY04, and 2023YJZX-XY02]; Chinese Academy of Medical Sciences (CAMS) Innovation Fund for Medical Sciences(CIFMS) [2016-I2M-3-024]; Kunming University of Science and Technology medical joint special project [KUST-KH2022028Y].
Author information
Authors and Affiliations
Contributions
Conceptualization, Y.C. and C.Z.; methodology, X.Y. and Y.W.; software, Q.L. and W.C.; data collection, R.D.; writing, review, and editing, Y.C. All authors have read and approved the manuscript.
Corresponding author
Ethics declarations
Ethics approval and consent to participate
Not applicable.
Consent for publication
Not Applicable.
Competing interests
The authors declare no competing interests.
Additional information
Publisher’s Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Electronic supplementary material
Below is the link to the electronic supplementary material.
Rights and permissions
Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated in a credit line to the data.
About this article
Cite this article
Cheng, Y., Yang, X., Wang, Y. et al. Multiple machine-learning tools identifying prognostic biomarkers for acute Myeloid Leukemia. BMC Med Inform Decis Mak 24, 2 (2024). https://doi.org/10.1186/s12911-023-02408-9
Received:
Accepted:
Published:
DOI: https://doi.org/10.1186/s12911-023-02408-9