A novel signature model based on mitochondrial-related genes for predicting survival of colon adenocarcinoma
BMC Medical Informatics and Decision Making volume 22, Article number: 277 (2022)
Colon cancer is the foremost reason of cancer-related mortality worldwide. Colon adenocarcinoma constitutes 90% of colon cancer, and most patients with colon adenocarcinoma (COAD) are identified until advanced stage. With the emergence of an increasing number of novel pathogenic mechanisms and treatments, the role of mitochondria in the development of cancer, has been studied and reported with increasing frequency.
We systematically analyzed the effect of mitochondria-related genes in COAD utilizing RNA sequencing dataset from The Cancer Genome Atlas database and 1613 mitochondrial function-related genes from MitoMiner database. Our approach consisted of differentially expressed gene, gene set enrichment analysis, gene ontology terminology, Kyoto Encyclopedia of Genes and Genomes, independent prognostic analysis, univariate and multivariate analysis, Kaplan–Meier survival analysis, immune microenvironment correlation analysis, and Cox regression analysis.
Consequently, 8 genes were identified to construct 8 mitochondrial-related gene model by applying Cox regression analysis, CDC25C, KCNJ11, NOL3, P4HA1, QSOX2, Trap1, DNAJC28, and ATCAY. Meanwhile, we assessed the connection between this model and clinical parameters or immune microenvironment. Risk score was an independent predictor for COAD patients’ survival with an AUC of 0.687, 0.752 and 0.762 at 1-, 3- and 5-year in nomogram, respectively. The group with the highest risk score had the lowest survival rate and the worst clinical stages. Additionally, its predictive capacity was validated in GSE39582 cohort.
In summary, we established a prognostic pattern of mitochondrial-related genes, which can predict overall survival in COAD, which may enable a more optimized approach for the clinical treatment and scientific study of COAD. This gene signature model has the potential to improve prognosis and treatment for COAD patients in the future, and to be widely implemented in clinical settings. The utilization of this mitochondrial-related gene signature model may be benefit in the treatments and medical decision-making of COAD.
According to the 2018 Global Cancer Data Report, colorectal cancer (CRC), including colon adenocarcinoma (COAD), is now among the top three cancers in terms of morbidity and ranks second with respect to mortality [1, 2]. Currently, the prognosis for COAD is largely determined by clinicopathological characteristics and the stage of the tumor [3, 4]. The primary COAD treatments include surgery, radiotherapy, and chemotherapy. 5-Fluorouracil (5-FU) and folinic acid (leucovorin), which are combined with oxaliplatin (FOLFOX) or irinotecan, represent one of the highest standards among these therapies (FOLFIRI). Although effective early screening improved recurrence, and additional treatment options have contributed to a decline in the incidence and mortality of COAD, many patients continue to be diagnosed at an advanced stage. In recent years, the average age of onset has decreased, and the 5-year survival rate of patients with distant metastases is under 10% . Since early symptoms of COAD are not readily apparent, most patients have already entered advanced stage when diagnosed. Over 50% of COAD patients are diagnosed in their advanced stages . For the development of effective treatment strategies, it is pivotal to conduct additional investigation on carcinogenesis of COAD to probe new and promising biomarkers. During tumor cell formation, the metabolism is reprogrammed to rapidly facilitate cancer cell growth. The central role of mitochondria in this process is critical. Pan-cancer mitochondrial-gene analysis displayed that mitochondrion genomic alterations and nuclear mitochondrial were closely associated with 38 tumor types . Some researches had showed the roles of mitochondrial genes and its relationship with the survival status of cancer patients. But the efficacy of mitochondrial-related genes in evaluating COAD patients’ prognosis lacks depth researches . Thus, our research aims to investigate whether transcriptomic profiling of mitochondrial genes is connected to the prognosis of COAD patients.
Mitochondria are unique organelles that carry extranuclear genetic material, and they are associated with a variety of metabolic diseases, degenerative diseases, age-related human diseases, and cancer [8, 9]. Mitochondria is evidenced to exert a significant part in the carcinogenesis and progression of COAD through retrograde regulation of the nucleus . Furthermore, reactive oxygen species (ROS) produced in mitochondria can promote the proliferation and migration of tumor cells . Accordingly, research into mitochondria is extensively acknowledged in a variety of fields. More recently, it has been demonstrated that mitochondria from non-tumor cell lines inhibit tumor formation in the same nuclear context, including inhibition of apoptosis, proliferation, anoxic survival, drug resistance, colony formation, and invasion, as well as enhanced tumor cell response to therapy . In addition, the bidirectional communication between mitochondria and the nucleus facilitates retrograde regulation of the nucleus . During the formation of tumor cells, metabolism is reprogrammed to facilitate rapid proliferation of cancer cells. Mitochondria play an indispensably pivotal role in this process. Mitochondrial gene analysis of pan-cancer revealed that nuclear mitochondrial genomic alterations were closely associated among 38 tumor types . There are many studies to explore the functions of mitochondrial-related genes in cancer and how they are connected to prognosis. However, research on the role and effectiveness of mitochondrial-related genes in predicting the prognosis of COAD is insufficient . Mitochondria could become innovative target for anti-cancer drugs, and the role of mitochondria-related genes in cancer prognosis prediction may become a novel and potential diagnostic model. Therefore, the object of this research is to excavate whether transcriptomic profiling of mitochondrial genes correlates with the prognosis and survival of COAD patients.
Recent evidence indicates that the combination of microarray technology and bioinformatics tools can effectively identify new targets concerning diagnosis, and prognosis of cancer [14, 15]. Therefore, bioinformatics is a feasible tool for filtering DEG and questing key genes [11, 12]. Using RNA sequencing data from TCGA database in conjunction with bioinformatics and statistical methods, we aimed to systematically discuss the effect of mitochondria-related genes in COAD. Initially, we analyzed 1613 mitochondrial-related genes and corresponding clinical data from TCGA of COAD patients. Then, we identified 249 mitochondrial-related genes with differential expression in COAD patients. Secondly, we developed a novel 8-gene signature prognosis model by employing Cox regression analysis to screen 8 significant genes that influence the survival of COAD patients. Thirdly, we validated the prognostic efficacy of the established mitochondrial-related gene pattern utilizing validation dataset GSE39582. On the basis of this signature model, a nomogram and good AUC curves were developed to demonstrate the predictability and stability of the model. Finally, the functional signaling pathways, immune checkpoints and immune cell fraction in the tumor microenvironment, and clinical parameters between high- and low-risk groups were further investigated and analyzed. The novel 8 mitochondria-related gene pattern to assess prognosis in COAD provided essential bioinformatics evidence to advance understanding of the complex mechanisms of the COAD progression and to optimize prognosis and improve survival of COAD patients.
Data origin and collection
RNA-seq transcriptome data of 480 COAD and 41 normal samples, along with corresponding clinical parameters of COAD patients, were downloaded from TCGA-COAD cohort (https://portal.gdc.cancer.gov/). GEO dataset GSE39582 (https://www.ncbi.nlm.nih.gov/geo/) comprising 566 colorectal cancer patients was defined as an independent validation dataset. Patients without survival information were excluded. Mitochondrion-related genes were downloaded from the MitoMiner database , which collected human genes encoding proteins associating with mitochondria and affecting their form and function. The most recent update to MitoMiner is version 4.0, which includes 1613 mitochondrial-related genes.
Differential expression of mitochondrion-related genes was analyzed by “limma” package (R v3.6)  with the cut-off P < 0.05 and abs(logFC) > 1. The expression of mitochondrion-related DEGs was compared between tumor and normal tissue utilizing heatmaps and volcano diagrams.
Functional enrichment analysis
Using the JAVA program gsea-3.0.jar, the GSEA was carried out on the gene ontology gene set of MSigDB to illustrate differences between normal tissue samples and COAD samples [18,19,20]. The algorithm of random sampling consisted of 1,000 permutations. Employing a false discovery rate (FDR) < 0.05, an enrichment between two types was identified. The "clusterProfiler" , "ggplot2" , and "GOplot" R packages  were utilized to perform GO and KEGG analyses n tumor tissue versus normal tissue.
Model construction based on differential mitochondrial-related genes
To unearth mitochondrion-related DEGs value, a univariate Cox analysis of OS was performed. We performed boxplot diagrams to visualize the expression of prognostic-related genes. The multivariate Cox analysis was leveraged to establish a prognostic pattern to minimize the hazard of overfitting. Normalized expression of each gene and their regression coefficients were utilized to compute risk scores. The formula was as follows: score = ESUM (expression of each gene × homologous coefficient). Riskscore = 0.53*CDC25C + 0.64*NOL3 + 0.601*QSOX2 + 0.281*KCNJ112.44*DNAJC28 + 1.294*ATCAY-0.604*TRAP1 + 0.436*P4HA1. Patients were stratified into high-risk and low-risk groups based on median risk score. For survival analysis, the optimal cut-off expression value was resolved by the "surv cutpoint" of "survminer" R package . Based on expression of each prognostic-related gene, Kaplan–Meier curves were utilized to juxtapose OS between two subgroups. To compare the correlation between clinicopathological variables and risk score, univariate and multivariate Cox analyses were performed. Independent cohort GSE39582 was retrieved to validate the model.
Construction of nomogram, ROC curves clinical features, and immune status for COAD
The "survivalROC" R package  was used to assess predictive worth of the gene pattern using time-dependent receiver operating characteristic (ROC) curve analyses. RMS package  was utilized to generate nomograms that incorporated clinically significant characteristics and risk scores. The relationship between clinicopathological variables and risk score was assessed by student's t-test. Visual data representations were produced using R package “beeswarm” . The correlation of immune checkpoints and immune cell infiltration fraction with risk score was also calculated by spearman correlation.
All statistical analyses were conducted by R package (v. 3.6.3). The Kaplan–Meier analysis with log-rank test was used to determine the significance of the difference in survival rates among risk groups. P values were adjusted utilizing Benjamini–Hochberg method. P < 0.05 was regarded significant.
Flow chart of overall design
521 COAD patients from the TCGA-COAD cohort were enrolled totally, including both tumor (n = 480) and normal samples (n = 41) (Fig. 1A). Upon downloading the RNA expression data for COAD patients from TCGA, the GSEA enrichment analysis was used to identify various mitochondrial-related pathways. These pathways enriched in mitochondrion-related metabolism prompted us to investigate the connection between mitochondrial metabolism and COAD pathogenesis further. We obtained the mitochondrion-related gene set (n = 1613) from MitoMiner database in a previous study by Anthony C Smith. The mitochondrion-related gene set was intersected with DEGs from TCGA-COAD datasets to obtain "differentially expressed mitochondrion-related genes" (n = 249). Next, we analyzed each gene using univariate/multivariate Cox regression, ultimately selecting eight genes for establishing the signature model. By using, univariate/multivariate Cox regression, correlation analysis, and Kaplan–Meier analysis, the relationship among the 8 gene signature model, clinical characteristics, immune checkpoint, and significance of survival were explored further. Finally, the nomogram graph and area under (AUC)/ receiver operating characteristic (ROC) curve were constructed to validate the efficacy. GSE39582 cohort also was applied to validate accuracy of the signature model.
Identification of differential metabolic gene sets between COAD tumors and normal tissue
Although it has been reported that COAD tumorigenesis exhibits a unique relationship with mitochondrial metabolic processes, the associated metabolic significance remains unknown. GSEA evidenced that in nine associated metabolic pathways (NOM P < 0.05) (Fig. 2), the gene sets were significantly enriched. These pathways included mitochondrial gene expression (NES = 1.7721, NOM P = 0.03), mitochondrial genome maintenance (NES = 1.6818, NOM P = 0.0153), mitochondrial RNA metabolism (NES = 1.8746, NOM P = 0.0123), mitochondrial RNA processing (NES = 1.8651, NOM P < 0.001), mitochondrial translation (NES = 1.7320, NOM P = 0.0412), positive regulation of mitochondrial translation (NES = 1.8722, NOM P < 0.001), positive regulation of mitochondrial outer membrane permeabilization involved in apoptotic signaling pathway (NES = 1.6025, NOM P = 0.0281), protein import into mitochondrial matrix (NES = 1.7508, NOM P = 0.0117), and regulation of mitochondrial gene expression (NES = 1.9865, NOM P < 0.001) (Table 1).
Functional analyses of differentially expressed mitochondrion-related genes in TCGA
We further investigated the relationship between COAD and mitochondrial metabolism as GSEA revealed the gene sets were significantly enriched in nine associated metabolic pathways. We identified 249 mitochondrion-related DEGs by intersecting the mitochondrion-related gene set with the TCGA-COAD DEG datasets (Additional file 3: Table S1). These mitochondrial-related DEGs appeared on volcano and heat maps (Fig. 3A–B). To clarify the biological implication connected with these mitochondrial-related DEGs, GO and KEGG analyses were performed on DEGs. Expectedly, DEGs were enriched in mitochondrial metabolism, including transport processes and fatty acid metabolism (P < 0.05) (Fig. 3C–F). Moreover, the DEGs were considerably enriched in a number of other biological processes, such as thermogenesis, the peroxisome proliferator-activated receptor (PPAR) signaling pathway, and apoptosis in multiple species.
The establishment of a mitochondrion-related prognostic model
To identify genes significantly associated with prognosis, a univariate Cox analysis was applied. 18 mitochondrion-related were initially identified as prognostic genes (Fig. 4A–B). Figure 4A displays18 mitochondrion-related differential genes in COAD and normal tissues. Figure 4B depicts the results of the univariate Cox analysis. Next, a prognostic pattern according to multivariate Cox analysis was developed (Additional file 4: Table S2). As displayed in Fig. 4C and 5A, a risk score was computed, as detailed in Materials and Methods section. A high-risk group and a low-risk group were divided according to median risk score (P < 0.001). High-risk patients often die earlier than low-risk patients (Fig. 5B). As for this scatter plot, every point just represents a patient. Tumors are heterogeneous, and each clinical patient is also specific. Analyzing this issue from a clinical point of view, patients assigned to a high-risk group do not absolutely have a worse prognosis, whereas patients assigned to a low-risk group do not necessarily have a longer survival time. Our model efficacy is decided by the final proportions and probability.
Eight genes were involved in the mitochondrion-related pattern. Kaplan–Meier plots revealed that eight genes were identified as independent prognostic signatures (P < 0.05) (Fig. 6A–F), involving CDC25C [P = 0.001, Hazard Ratio (HR) = 0.5], KCNJ11 (P = 0.004, HR = 1.91), NOL3 (P = 0.004, HR = 1.96), P4HA1 (P = 0.008, HR = 1.73), QSOX2 (P = 0.017, HR = 1.65), and TRAP1 (P = 0.002, HR = 0.52), and two genes with P value > 0.05, DNAJC28 (P = 0.051, HR = 0.66) and ATCAY (P = 0.206, HR = 1.3) (Additional file 1: Fig. S1), involving DNAJC28 (P = 0.051, HR = 0.66) and ATCAY (P = 0.206, HR = 1.3). A heat map depicts the expression of eight mitochondrion-related signatures in COAD (Fig. 5C).
Equipped with the available variables, univariate and multiple Cox analyses were performed to clarify whether risk score was independent for OS. In univariate Cox analyses, the risk score was strongly associated to OS in TCGA-COAD cohort (P < 0.001, HR = 1.112) (Fig. 7A). These included variables are important parameters used in the clinical treatment of colon adenocarcinoma to measure its disease progression, grading and staging. Age and gender can determine the pathogenic factors of the patient, while T, N, M, stage can indicate the severity of the disease. By comparing the risk score with these typical variables, we can show that it has a relatively better predictive value for prognosis. In a multivariate Cox analysis, after adjusting for other confounding factors, the risk score remained independent for OS (P < 0.001, HR = 1.109) (Fig. 7B). After multiple Cox analysis combined with clinical stage and risk score and the development of a prognostic prediction pattern, the nomogram was conducted. This was used to confirm the model’s risk score as a prognostic factor to assess the predicted probability of OS at 1-, 3- and 5- years (Fig. 7C). It has been demonstrated that the model is effective in predicting OS at 1-, 3-, and 5- years (Risk score-AUC: 1-year -: 0.687, 3-year: 0.752, 5-year: 0.762) (Fig. 7D). In addition, we conformed risk score and clinical representative characteristics into the same ROC curve to compare their 1-, 3-, and 5-year prediction efficacy (Fig. 7E–G).
Additionally, the prediction efficacy was validated in GSE39582 (Fig. 8A–C). The ROC curve was also validated in GSE39582 dataset, and 1-year risk score-AUC value: 0.757, 3-year risk score-AUC value: 0.714, 5-year risk score-AUC value: 0.691 (Fig. 8D). With respect to these data, Additional file 1: Fig. S2 displays the Decision Curve Analysis for this risk score. We developed two models; both of which included risk score and excluded risk score. It is evident that the risk score model appears to offer more advantages.
Connection between risk score and clinicopathological characteristics
Utilizing clinical information from the TCGA-COAD cohort, the current research probed the connection between risk score and prognostic factors. Results revealed a significant connection between higher risk scores and higher tumor (P = 4.116e-04), node (P = 0.022), and stage (P = 0.017) levels, as well as with tumors (P = 0.065) (Fig. 9). Other important clinical characteristics were not significantly interrelated with gender (P = 0.360), M stage (P = 0.107), histological type (P = 0.613), carcinoembryonic antigen (CEA) level (P = 0.313), lymphatic (P = 0.658) or perineural invasion (P = 0.450), which have been each reported to be correlated with COAD prognosis (Fig. S3).
Connection between risk score and immune status in tumor microenvironment
Immunotherapy is an emerging treatment for COAD. Increasingly related target mechanism research and clinical trials are in progress. Immunotherapies that inhibit immune checkpoints and target specific immune cell are common and effective immunotherapies in clinical practice. To discuss the fesible connection between our risk score model and immune cell infiltration and immune checkpoints, we first performed correlation analysis between 22 immune cell infiltration fraction and our risk score. Results revealed that CD4 memory resting T cells (P = 0.007) and CD4 memory activated T cells (P = 0.0059) were significantly higher in low-risk group (Fig. 10A–B). But the macrophages M0 cells (P < 0.001) and NK resting cells (P = 0.034) were significantly higher in high-risk group (Fig. 10C–D). In addition, we performed the correlation analysis between expression of six representative immune checkpoints and risk score. The expression level of CD274 (P = 0.032), HAVCR2 (P < 0.001) (Fig. 10E–F) and PDCD1LG2 (P = 0.22) (Fig. S4A) was higher in high-risk group than low-risk group. However, the expression of CTLA4 (P = 0.048), IDO1 (P = 0.017) (Fig. 10G–H) and PDCD1 (P = 0.085) (Fig. S4B) was lower in high- risk group. Different relationships among immune checkpoints may reflect the non-negligible tumor heterogeneity, and they may serve as a reference for future immune checkpoint inhibition treatment.
As the world's population ages, the incident rate of COAD is increasing globally. Genetic and epigenetic alteration, smoking and alcohol consumption, dietary factors, and inflammatory bowel disease are all contributory factors to the development of COAD. Current COAD treatment consists primarily of surgical resection and chemotherapy, but their ineffectiveness exemplifies the need for novel approaches . Due to a lack of early diagnostic tools, the majority of patients are diagnosed at advanced stages of disease. As a result, many patients miss the optimal window for curative surgical treatment [28, 29]. Although numerous studies have addressed the diagnosis and treatment of COAD in the past, no meaningful breakthroughs have been made. Resultantly, establishing a reliable model for early diagnosis and prognosis prediction in COAD is paramount. The use of such a model could accurately and promptly assess the outcomes of treatment and offer recommendations for additional treatment . Chen et al. and Zuo et al. [31, 32] published the COAD prognostic model of transcriptome characteristic genes, which describes the construction of prognostic models for patients. Nonetheless, the role of mitochondria-related genes in COAD has yet to be explored.
As an indispensable intracellular organelle of eukaryotes, mitochondrial function plays a crucial role in many cellular processes . Mitochondria serve as a metabolic hub to regulate the metabolic process and provide energy for cell growth, differentiation, and apoptosis. It has been proven that mitochondrial dysfunction affects the occurrence and development of cancer . Some biological processes related to cancer, including tumor formation, development, invasion, metastasis, and drug resistance, are dependent on mitochondria [35, 36]. Since the metabolic process in tumors is frequently changed, mitochondrial-related genes have been investigated as a potential cancer therapy target in a number of recent studies [37, 38]. Differential expression of mitochondria-related genes is associated with occurrence and metastasis of breast cancer , as well as the invasive phenotype of osteosarcoma , according to studies. During tumor initiation and metastasis, the metabolism is reprogrammed, and this reprogramming is largely dependent on mitochondria .
Furthermore, there are a number of studies focus on mechanisms of mitochondrial-related genes and designing corresponding drugs and inhibitors for COAD treatments. Growing tumors will quickly exceed the size that diffusion provides an adequate supply of oxygen, leading to tumor hypoxia and transition to glycolysis. This switch is caused by an important factor, hypoxia-inducible transcription factor 1α (HIF-1α), which determines metabolic fate of COAD. Downregulation of MPC1 and MPC2 has been reported in COAD, which is associated with poor prognosis . Moreover, a series of researches found the role of mitochondrial oxidative phosphorylation (OXPHOS) in COAD. Increased mitochondrial DNA copy number in COAD is connected to higher proliferation and lower apoptosis by mitochondrial OXPHOS . With the in-depth understanding of these metabolic processes and mitochondrial genes, recent studies to target cancer metabolism focus on the mitochondrial TCA and OXPHOS to block the aerobic glycolysis in tumor cells. A large number of drugs are under study to target mitochondria and mitochondrial function, such as metformin, 3-bromopyruvate or 2-deoxyglucose [44, 45]. Notably, most studies have explored a single mitochondrial-related gene or an associated signaling pathway in tumor formation, invasion, metastasis, and its relationship with the prognosis of cancer. In present study, the complicated biological process of mitochondria has been paid attention, and the utilization of mitochondria-related gene sets will be more reliable and can effectively judge the survival and prognosis of COAD.
Furthermore, a number of studies concentrate on the mechanisms of mitochondrial-related genes and the development of drugs and inhibitors for COAD treatments. Tumors will quickly outgrow the size at which diffusion can provide an adequate supply of oxygen, leading to tumor hypoxia and the transition to glycolysis. Hypoxia-inducible transcription factor 1α (HIF-1α) is responsible for this switch, resulting in up-regulation of several genes to avoid hypoxic stress and activate pyruvate dehydrogenase kinase (PDK) to inhibit mitochondrial metabolism [46, 47]. In COAD, HIF1α expression is associated with cancer-specific death, recurrence, vascular invasion and chemoresistance . In addition, the mitochondrial pyruvate carrier (MPC), consisting of MPC1 and MPC2 subunits, becomes another pivotal factor in determining the metabolic fate of COAD . Since MPC is responsible for mitochondrial pyruvate uptake, it causes oxidation in the tricarboxylic acid (TCA) cycle subsequently. MPC1 and MPC2 deletion or downregulation has been reported in COAD, which is associated with poor prognosis . In addition, variety of studies have identified the effect of mitochondrial oxidative phosphorylation (OXPHOS) in COAD. Increased mitochondrial DNA copy number in COAD correlates with increased proliferation and apoptosis inhibition by mitochondrial OXPHOS . With the in-depth understanding of these metabolic processes and mitochondrial genes, recent studies targeting cancer metabolism have been focusing on the mitochondrial TCA and OXPHOS to block the aerobic glycolysis in tumor cells. Numerous drugs targeting mitochondria and mitochondrial function are being investigated, such as 3-bromopyruvate, metformin or 2-deoxyglucose [44, 45]. Notably, the majority of studies have centered on a single mitochondrial-related gene or signaling pathway in tumor formation, progression and its association with cancer survival. The complex biological process of mitochondria has been considered in our work, and the utilization of nuclear mitochondria-related gene sets will be more reliable and can effectively estimate the survival status of COAD.
Many prognostic patterns according to mitochondrial-related genes have been clarified for certain cancers, including bladder, prostate, liver, and lung cancer [52,53,54,55]. However, no research has been reported in COAD. At the beginning of our research, we conducted a gene signature prediction model according to mitochondrial-related genes in COAD. Several bioinformatics instruments were used to analyze COAD sample transcriptome sequencing data. We discovered that 88 genes were up-regulated and 99 were down-regulated in COAD tissue samples compared to normal tissue by leveraging the human mitochondria-related gene library MitoMiner V4.0 . The identified DEGs are closely correlated with mitochondrial dysfunction and metabolic processes during the development of COAD. GO enrichment analysis revealed genetic variations in nine biological pathways related to cancer, including ROS generation, nucleic acids, amino acid metabolism, and dicarboxylic acid metabolism . These biological processes are consistent with the characteristics of tumor cells and are primarily associated with unrestricted cell proliferation , indicating that mitochondria-related genes is closely connected to the carcinogenesis of COAD. Differential expression of mitochondria-related genes primarily affects fatty acid metabolism and amino acid metabolism pathways, which are closely related to the metabolic adaptation of tumors and the metastasis of COAD , as indicated by the KEGG pathway maps. These metabolic changes are essential for tumor growth in an unfavorable tumor microenvironment and for the development and maintenance of cancer cell metastasis . Numerous studies have recently proposed the "lipolytic phenotype" of cancer; fatty acid metabolism is also reprogrammed in cancer-related immune cells , contributing to immune suppression and promoting the tumor microenvironment, making it a potential target of immunotherapy .
In our present study, differentially expressed mitochondria-related genes and prognostic correlation analysis were utilized to develop a prediction model for eight key genes. We discovered that the prediction model was able to effectively stratify patients based on survival, with high-risk group exhibiting worse OS than low-risk group. ROC and independent prognosis analysis suggested that the predictive pattern could be applied as an independent risk factor for patient prognosis and had a high predictive value for patient prognosis. Clinical staging, TNM staging, and histological grading continue to be the most frequently used tools for prognostic prediction and treatment strategies in COAD patients  at present. However, the heterogeneity of COAD makes it challenging to improve the treatment efficacy of COAD and make decisions for doctors regarding the therapy of COAD patients . A prognostic nomogram was developed in the present study, which had the advantage of overcoming COAD heterogeneity, and may lead to inaccurate prognosis prediction in COAD patients. In contrast, OS had greater AUC values at 1, 3, and 5 years, indicating that the newly constructed nomogram was credible. Through gene correlation analysis, it was discovered that CDC25C and P4HA1 may be key genes to target in COAD patients [63,64,65,66], as they are associated with the metabolism, cell cycle, and progression of tumors. Therefore, CDC25C and P4HA1 have the potential to serve as biomarkers for COAD patients and contribute to the decision-making process regarding colon cancer treatment. The novel genes including KCNJ11 NOL3, P4HA1, and QSOX2 were overlooked in COAD in the past; the correlation among these genes and COAD prognosis has been inadequately defined and requires further investigation. Our findings demonstrate the pioneering prognostic value of our model and offer a novel pathogenesis and prognostic mechanism for COAD.
Our study still has certain limitations. Even though we validated our signature model based on public GEO datasets, it is worth further validating with prospective clinical samples and local cohort data in the future. Additionally, although our study demonstrated a potential association between risk scores and tumor microenvironment or clinical characteristics that may influence clinical management decisions in patients with COAD, the validation of immune checkpoint inhibitors and patient-targeted therapies requires further research. The potential regulatory mechanisms in vivo or in vitro also need further research to explore in depth.
This study represents the first effort to discover polygenic markers of mitochondrial-related genes assess potential function of these genes during the carcinogenesis of COAD patients. In addition, a robust risk score tool based on the expression profile of mitochondrial-related genes was developed to prompt COAD patients’ prognosis. Furthermore, the prognostic nomogram and mitochondrial-related gene signature were shown to have clinical applicability. In addition, the analysis of clinical and histopathological features, which bodes well for patient-specific treatment and medical decision-making in the future.
Availability of data and materials
Publicly available datasets were analyzed in this study. The datasets TCGA-COAD and corresponding clinical patient information analyzed for this study can be found in the TCGA Knowledge Base (https://portal.gdc.cancer.gov/repository, accessed on 23 September 2022). Expression profile of GSE39582 in the manuscript was downloaded from the Gene Expression Omnibus (GEO) database (https://www.ncbi.nlm.nih.gov/geo/, accessed on 23 September 2022).
The cancer genome atlas
Differentially expressed genes
Receiver operating characteristic
False discovery rate
Kyoto encyclopedia of genes and genomes
Gene set enrichment analysis
Area under the curve
Siegel RL, Miller KD, Goding Sauer A, Fedewa SA, Butterly LF, Anderson JC, et al. Colorectal cancer statistics, 2020. CA Cancer J Clin. 2020;70:145–64.
Weitz J, Koch M, Debus J, Höhler T, Galle PR, Büchler MW. Colorectal cancer. Lancet. 2005;365:153–65.
Dienstmann R, Villacampa G, Sveen A, Mason MJ, Niedzwiecki D, Nesbakken A, et al. Relative contribution of clinicopathological variables, genomic markers, transcriptomic subtyping and microenvironment features for outcome prediction in stage II/III colorectal cancer. Ann Oncol. 2019;30:1622–9.
Bosch LJW, Carvalho B, Fijneman RJA, Jimenez CR, Pinedo HM, van Engeland M, et al. Molecular tests for colorectal cancer screening. Clin Colorectal Cancer. 2011;10:8–23.
Erickson LA. Adenocarcinoma of the colon and microsatellite instability. Mayo Clin Proc. 2018;93:669–70.
Yuan Y, Ju YS, Kim Y, Li J, Wang Y, Yoon CJ, et al. Author Correction: Comprehensive molecular characterization of mitochondrial genomes in human cancers. Nat Genet. 2020;52(3):342–52.
Lawrence MS, Stojanov P, Polak P, Kryukov GV, Cibulskis K, Sivachenko A, et al. Mutational heterogeneity in cancer and the search for new cancer-associated genes. Nature. 2013;499:214–8.
Pustylnikov S, Costabile F, Beghi S, Facciabene A. Targeting mitochondria in cancer: current concepts and immunotherapy approaches. Transl Res. 2018;202:35–51.
Burke PJ. Mitochondria, bioenergetics and apoptosis in cancer. Trends Cancer. 2017;3:857–70.
Vyas S, Zaganjor E, Haigis MC. Mitochondria and cancer. Cell. 2016;166:555–66.
Han Y, Kim B, Cho U, Park IS, Kim SI, Dhanasekaran DN, et al. Mitochondrial fission causes cisplatin resistance under hypoxic conditions via ROS in ovarian cancer cells. Oncogene. 2019;38:7089–105.
Kaipparettu BA, Ma Y, Park JH, Lee T-L, Zhang Y, Yotnda P, et al. Correction: crosstalk from non-cancerous mitochondria can inhibit tumor properties of metastatic cells by suppressing oncogenic pathways. PLoS ONE. 2019;14: e0221671.
Cardamone MD, Tanasa B, Cederquist CT, Huang J, Mahdaviani K, Li W, et al. Mitochondrial retrograde signaling in mammals is mediated by the transcriptional cofactor GPS2 via direct mitochondria-to-nucleus translocation. Mol Cell. 2018;69:757–772.e7.
Kok VC, Yu CC. Cancer-derived exosomes: their role in cancer biology and biomarker development. Int J Nanomedicine. 2020;15:8019–36.
Wu L, Qu X. Cancer biomarker detection: recent achievements and challenges. Chem Soc Rev. 2015;44(10):2963–97.
Smith AC, Robinson AJ. MitoMiner v4.0: an updated database of mitochondrial localization evidence, phenotypes and diseases. Nucleic Acids Res. 2018;47:D1225-1228.
Ritchie ME, Phipson B, Wu D, Hu Y, Law CW, Shi W, Smyth GK. limma powers differential expression analyses for RNA-sequencing and microarray studies. Nucleic Acids Res. 2015;43(7):e47–e47.
Kanehisa M, Goto S. KEGG: kyoto encyclopedia of genes and genomes. Nucleic Acids Res. 2000;28:27–30.
Kanehisa M. Toward understanding the origin and evolution of cellular organisms. Protein Sci. 2019;28:1947–51.
Kanehisa M, Furumichi M, Sato Y, Ishiguro-Watanabe M, Tanabe M. KEGG: integrating viruses and cellular organisms. Nucleic Acids Res. 2021;49:D545–51.
Yu G, Wang LG, Han Y, et al. clusterProfiler: an R package for comparing biological themes among gene clusters. Omics J Integr Biol. 2012;16(5):284–7.
Wickham H. ggplot2: Elegant Graphics for Data Analysis. Springer-Verlag New York. 2016; ISBN 978-3-319-24277-4. https://ggplot2.tidyverse.org.
Walter W, Sanchez-Cabo F, Ricote M. GOplot: an R package for visually combining expression data with functional analysis. Bioinformatics. 2015;31(17):2912–4.
Thrumurthy SG, Thrumurthy SSD, Gilbert CE, Ross P, Haji A. Colorectal adenocarcinoma: risks, prevention and diagnosis. BMJ. 2016;354: i3590.
Arena S, Corti G, Durinikova E, Montone M, Reilly NM, Russo M, et al. A subset of colorectal cancers with cross-sensitivity to olaparib and oxaliplatin. Clin Cancer Res. 2020;26:1372–84.
Seiwert N, Heylmann D, Hasselwander S, Fahrer J. Mechanism of colorectal carcinogenesis triggered by heme iron from red meat. Biochim Biophys Acta Rev Cancer. 2020;1873: 188334.
Luo C, Cen S, Ding G, Wu W. Mucinous colorectal adenocarcinoma: clinical pathology and treatment options. Cancer Commun (Lond). 2019;39:13.
Chen L, Lu D, Sun K, Xu Y, Hu P, Li X, et al. Identification of biomarkers associated with diagnosis and prognosis of colorectal cancer patients based on integrated bioinformatics analysis. Gene. 2019;692:119–25.
Zuo S, Dai G, Ren X. Identification of a 6-gene signature predicting prognosis for colorectal cancer. Cancer Cell Int. 2019;19:6.
Akbari M, Kirkwood TBL, Bohr VA. Mitochondria in the signaling pathways that control longevity and health span. Ageing Res Rev. 2019;54: 100940.
Koch RE, Josefson CC, Hill GE. Mitochondrial function, ornamentation, and immunocompetence. Biol Rev Camb Philos Soc. 2017;92:1459–74.
Kroemer G, Pouyssegur J. Tumor cell metabolism: cancer’s Achilles’ heel. Cancer Cell. 2008;13(6):472–82.
Weinberg SE, Chandel NS. Targeting mitochondria metabolism for cancer therapy. Nat Chem Biol. 2015;11(1):9–15.
Porporato PE, Filigheddu N, Pedro JMB, Kroemer G, Galluzzi L. Mitochondrial metabolism and cancer. Cell Res. 2018;28(3):265–80.
Peiris-Pagès M, Martinez-Outschoorn UE, Pestell RG, Sotgia F, Lisanti MP. Cancer stem cell metabolism. Breast Cancer Res. 2016;18(1):55.
Yan L-R, Wang A, Lv Z, Yuan Y, Xu Q. Mitochondria-related core genes and TF-miRNA-hub mrDEGs network in breast cancer. Biosci Rep. 2021;41(1):BSR20203481.
Buondonno I, Gazzano E, Jean SR, Audrito V, Kopecka J, Fanelli M, et al. Mitochondria-targeted doxorubicin: a new therapeutic strategy against doxorubicin-resistant osteosarcoma. Mol Cancer Ther. 2016;15:2640–52.
Yang Y, Karakhanova S, Hartwig W, D’Haese JG, Philippov PP, Werner J, et al. Mitochondria and mitochondrial ROS in cancer: novel targets for anticancer therapy. J Cell Physiol. 2016;231:2570–81.
Schell JC, Olson KA, Jiang L, Hawkins AJ, Van Vranken JG, Xie J, et al. A role for the mitochondrial pyruvate carrier as a repressor of the Warburg effect and colon cancer cell growth. Mol Cell. 2014;56:400–13.
Feng S, Xiong L, Ji Z, Cheng W, Yang H. Correlation between increased copy number of mitochondrial DNA and clinicopathological stage in colorectal cancer. Oncol Lett. 2011;2:899–903.
Jones NP, Schulze A. Targeting cancer metabolism-Aiming at a tumour’s sweet-spot. Drug Discov. 2012;17:232–41.
Luengo A, Gui DY, Heiden MGV. Targeting metabolism for cancer therapy. Cell Chem Biol. 2017;24:1161–80.
Marín-Hernández A, Gallardo-Pérez JC, Ralph SJ, Rodríguez-Enríquez S, Moreno-Sánchez R. HIF-1α modulates energy metabolism in cancer cells by inducing over-expression of specific glycolytic isoforms. Mini Rev Med Chem. 2009;9(9):1084–101.
Papandreou I, Cairns RA, Fontana L, Lim AL, Denko NC. HIF-1 mediates adaptation to hypoxia by actively downregulating mitochondrial oxygen consumption. Cell Metab. 2006;3(3):187–97.
Rasheed S, Harris AL, Tekkis PP, Turley H, Silver A, McDonald PJ, et al. Hypoxia-inducible factor-1alpha and -2alpha are expressed in most rectal cancers but only hypoxia-inducible factor-1alpha is associated with prognosis. Br J Cancer. 2009;100(10):1666–73.
Bricker DK, Taylor EB, Schell JC, Orsak T, Boutron A, Chen YC, et al. A mitochondrial pyruvate carrier required for pyruvate uptake in yeast, Drosophila, and humans. Science. 2012;337(6090):96–100.
Schell JC, Olson KA, Jiang L, Hawkins AJ, Van Vranken JG, Xie J, et al. A role for the mitochondrial pyruvate carrier as a repressor of the Warburg effect and colon cancer cell growth. Mol Cell. 2014;56(3):400–13.
Feng S, Xiong L, Ji Z, Cheng W, Yang H. Correlation between increased copy number of mitochondrial DNA and clinicopathological stage in colorectal cancer. Oncol Lett. 2011;2(5):899–903.
Li YP, Liu GX, Wu ZL, Tu PH, Wei G, Yuan M, et al. A novel mitochondrial-related gene signature for the tumor immune microenvironment evaluation and prognosis prediction in lung adenocarcinoma. J Immunol Res. 2022;2022:5366185.
Wang Y, Song F, Zhang X, Yang C. Mitochondrial-related transcriptome feature correlates with prognosis, vascular invasion, tumor microenvironment, and treatment response in hepatocellular carcinoma. Oxid Med Cell Longev. 2022;2022:1592905.
Jiang X, Xia Y, Meng H, Liu Y, Cui J, Huang H, et al. Identification of a nuclear mitochondrial-related multi-genes signature to predict the prognosis of bladder cancer. Front Oncol. 2021;11:1.
Zhang X, Dong W, Zhang J, Liu W, Yin J, Shi D, et al. A novel mitochondrial-related nuclear gene signature predicts overall survival of lung adenocarcinoma patients. Front Cell Dev Biol. 2021;9:1.
Ashburner M, Ball CA, Blake JA, Botstein D, Butler H, Cherry JM, et al. Gene ontology: tool for the unification of biology. Gene Ontol Consort Nat Genet. 2000;25:25–9.
Hanahan D, Weinberg RA. Hallmarks of cancer: the next generation. Cell. 2011;144:646–74.
Bergers G, Fendt S-M. The metabolism of cancer cells during metastasis. Nat Rev Cancer. 2021;21:162–80.
Lyssiotis CA, Kimmelman AC. Metabolic Interactions in the Tumor Microenvironment. Trends Cell Biol. 2017;27:863–75.
Caro P, Kishan AU, Norberg E, Stanley IA, Chapuy B, Ficarro SB, et al. Metabolic signatures uncover distinct targets in molecular subsets of diffuse large B cell lymphoma. Cancer Cell. 2012;22:547–60.
Ma Y, Temkin SM, Hawkridge AM, Guo C, Wang W, Wang X-Y, et al. Fatty acid oxidation: an emerging facet of metabolic transformation in cancer. Cancer Lett. 2018;435:92–100.
Jordan B, Meeks J. T1 bladder cancer: current considerations for diagnosis and management. Nat Rev Urol. 2019;16(1):23–34.
Wang W, Liu H, Wang S, Hao X, Li L. A diterpenoid derivative 15-oxospiramilactone inhibits Wnt/β-catenin signaling and colon cancer cell tumorigenesis. Cell Res. 2011;21:730–40.
Wu C, Lyu J, Yang EJ, Liu Y, Zhang B, Shim JS. Targeting AURKA-CDC25C axis to induce synthetic lethality in ARID1A-deficient colorectal cancer cells. Nat Commun. 2018;9:3212.
Zhu J, Wang S, Bai H, Wang K, Hao J, Zhang J, et al. Identification of five glycolysis-related gene signature and risk score model for colorectal cancer. Front Oncol. 2021;11: 588811.
Cao XP, Cao Y, Li WJ, Zhang HH, Zhu ZM. P4HA1/HIF1α feedback loop drives the glycolytic and malignant phenotypes of pancreatic cancer. Biochem Biophys Res Commun. 2019;516:606–12.
We would like to thank all the workers involved in TCGA database system design and maintenance. Thank these workers for providing public shared sequencing database to researchers to further analysis.
This work was supported by the grants from the National Natural Science Foundation of China (No. 81701570) and 345 Talent Project of Shengjing hospital of China Medical University.
Ethics approval and consent to participate
Consent for publication
The authors declare that they have no competing interests.
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Kaplan-Meier plots of another two prognostic mitochondrion-related genes signature (P > 0.05). Figure S2: Decision Curve Analysis for the risk score model. Figure S3: Correlation analysis Association between risk score and clinicopathological characteristics (P > 0.05). Figure S4: Correlation analysis between risk score and immune checkpoint expression of PDCD1 and PDCD1LG2. (PDF 2274 kb)
Raw data. (ZIP 320499 kb)
249 mitochondrion-related DEGs in TCGA-COAD. (XLSX 30 kb)
The coefficient, 95% CI and P value of each candidate gene in mitochondrial-related gene signature model by univariate and multivariate Cox regression. (XLSX 11 kb)
About this article
Cite this article
Gao, H., Xing, F. A novel signature model based on mitochondrial-related genes for predicting survival of colon adenocarcinoma. BMC Med Inform Decis Mak 22, 277 (2022). https://doi.org/10.1186/s12911-022-02020-3