 Technical advance
 Open Access
 Published:
Predicting COVID19 disease progression and patient outcomes based on temporal deep learning
BMC Medical Informatics and Decision Making volume 21, Article number: 45 (2021)
Abstract
Background
The coronavirus disease 2019 (COVID19) pandemic has caused health concerns worldwide since December 2019. From the beginning of infection, patients will progress through different symptom stages, such as fever, dyspnea or even death. Identifying disease progression and predicting patient outcome at an early stage helps target treatment and resource allocation. However, there is no clear COVID19 stage definition, and few studies have addressed characterizing COVID19 progression, making the need for this study evident.
Methods
We proposed a temporal deep learning method, based on a timeaware long shortterm memory (TLSTM) neural network and used an online open dataset, including blood samples of 485 patients from Wuhan, China, to train the model. Our method can grasp the dynamic relations in irregularly sampled time series, which is ignored by existing works. Specifically, our method predicted the outcome of COVID19 patients by considering both the biomarkers and the irregular time intervals. Then, we used the patient representations, extracted from TLSTM units, to subtype the patient stages and describe the disease progression of COVID19.
Results
Using our method, the accuracy of the outcome of prediction results was more than 90% at 12 days and 98, 95 and 93% at 3, 6, and 9 days, respectively. Most importantly, we found 4 stages of COVID19 progression with different patient statuses and mortality risks. We ranked 40 biomarkers related to disease and gave the reference values of them for each stage. Top 5 is Lymph, LDH, hsCRP, Indirect Bilirubin, Creatinine. Besides, we have found 3 complications  myocardial injury, liver function injury and renal function injury. Predicting which of the 4 stages the patient is currently in can help doctors better assess and cure the patient.
Conclusions
To combat the COVID19 epidemic, this paper aims to help clinicians better assess and treat infected patients, provide relevant researchers with potential disease progression patterns, and enable more effective use of medical resources. Our method predicted patient outcomes with high accuracy and identified a fourstage disease progression. We hope that the obtained results and patterns will aid in fighting the disease.
Background
Coronavirus disease 2019 (COVID19) outbreaks have caused health concerns worldwide since December 2019; the disease was declared a pandemic by the World Health Organization (WHO) on 11 March 2020 [1]. Over seven million cases of COVID19 have been reported worldwide, including more than 400,000 deaths (as of 15 June 2020) [2]. Even though the disease has been controlled in certain countries, the WHO director warns the pandemic is still ‘Speeding Up’ [3]. Because of its sudden onset, many hospitals are still facing medical resource shortages. For example, news in [4] reported a lack of medical resources in New Delhi. In [5], Arizona has experienced recordhigh hospital capacity as coronavirus cases climb. A reasonable allocation of resources according to patient condition is needed.
The solution to this problem involves determining the stages of disease progression by subtyping and predicting the outcome of COVID19 patients. Then, targeted treatment and medical resource allocation can be carried out for patients in different stages. Recent studies [6,7,8,9,10,11] have used statistical methods to analyze COVID19 progress by inpatient symptoms. However, different statistical results were obtained by considering different patient groups and different symptoms. At present, there is no clear division of the stages of COVID19 progression.
Longitudinal disease analysis is the key to understanding disease progression, designing prognoses and developing early diagnostic tools. The time dynamics of disease can provide more information than static symptom observation [12]. Considering the complex patient states, the amount of interventions and the realtime requirement, the datadriven machine learning approaches by learning from electronic health records are the desiderata to help clinicians [13].
Many existing works have used machine learning methods for COVID19 prediction tasks. We have summarized them in Table 1. For example, in most method of [27] and in [1, 14,15,16,17,18,19], authors used nondeep learning methods, such as kNN, LR, Cox, SVM and DT to classify CT/Xray images and predict the outcomes of COVID19 patients. However, in terms of prediction accuracy, nondeep learning is not as good as deep learning methods. Deep learning methods can train the parameters with complex nonlinearity to learn the data structures and have achieved stateoftheart in many medical prediction tasks [28,29,30]. Thus, many current works apply deep learning methods for COVID19 prediction tasks [17, 19,20,21,22,23,24,25,26]. However, these methods either use the simple multilayer perceptron for predicting or use the convolutional structures for image classification. Both the above methods ignored the temporal development of patient’s status. In the realworld patient records, except for the basic information, vital signs, test values and diagnoses are both time series, especially for the blood samples of COVID19 patients, the data we used in this paper.
Recently, a deep learning method, recurrent neural network (RNN) [31] can efficiently model temporal sequences. It uses recursion in the direction of sequence evolution to learning the relations among past, present and future. But the basic RNN has the longterm dependency problems [32]. Meanwhile, RNN only process uniformly distributed longitudinal data while COVID19 patient blood samples are distributed nonuniformly with irregular time intervals between observations. Thus, a method that can model this irregular time series of COVID19 patients is needed.
In this paper, we retrospectively analyzed the blood samples of 485 patients from the region of Wuhan, China. The medical records collected with standard case report forms, including epidemiological, demographic, clinical, laboratory and mortality outcome information, from an online open dataset under an MIT license. We applied a temporal deep learning method Timeaware Long Shortterm Unit (TLSTM) to model the irregular time series of COVID19 patients. TLSTM can predict the mortality with more than 98% accuracy before 3 days. Meanwhile, we have discovered four stages of COVID19 patients. According to the different stages, we gave the analysis of the patient’s state and found the related biomarkers and complications.
Methods
In this section, we first introduce the COVID19 dataset and the data preprocessing process. Then, we describe the methods for mortality prediction and disease progression in detail.
Dataset description
Blood index values can reflect a COVID19 patient’s physical condition [10]. COVID19 patients’ blood samples were collected between 10 January and 18 February 2020 at Tongji Hospital of Tongji Medical College, Huazhong University of Science and Technology, Wuhan, China [33]. The dataset contains 80 characteristics from 375 patients with 6120 records as a training set and 110 patients with 757 records as a test set. A case of sample is shown in Fig. 1. It draws lines of the time series of LHD, lymph and hsCRP of a 70yearold female patient during hospitalization. We can see the time intervals between two observations are irregular, which could be a few minutes or even days.
The detailed statistical information of demographic and 74 clinical laboratory test features is listed Table 2. For example, in the dataset, the average age of patients is 58.83, the survival rate is 53.6% and the ratio of male to female is about 1.5:1. We also list the range and mean value of each feature. In Fig. 1, we display the distributions of some features (age, gender, LHD, lymph and hsCRP) of survival class (0) and death class (1).
This COVID19 blood test data is publicly available at https://github.com/HAIRLAB/Pre_Surv_COVID_19.
Dataset preprocessing
First, we attempted to find a suitable time measurement granularity. In the raw dataset, the lengths of sequences are unequal and different sampling times result in missing data, with an 85% missing rate on average. The missing rate is expressed in Eq. 1. N_{missing} means the number of time points with missing data in one time series. N_{all} means the number of time points in that time series. The presence of vacancies has a large impact on data quality, resulting in unstable predictions and other unpredictable effects [34]. We used 3 days as the basic sampling interval, reducing the average mr below 30%. The time series length of raw data, the average missing rate and the missing rate for each feature are shown in Fig. 1.
Meanwhile, for feature selection, using all 74 laboratory test features is unrealistic. To address the high missing rate, repeated features and collection difficulties, we considered three key features: lactic dehydrogenase (LDH), lymphocytes (lymph) and highsensitivity Creactive protein (hsCRP). These features contain specific research biomarkers of COVID19 patients [33] and can be easily collected in any hospital. Considering that only three features may not achieve high prediction accuracy, we also select 40 features (listed in Table 7) with missing rate less than 30% for comparative experiment.
TLSTM
Recurrent neural networks (RNNs) [31] (the first structure in Fig. 2) are deep network architectures designed to model temporal sequences. They take sequence data as input, recursion occurs in the direction of sequence evolution, and all units are chained together. In basic RNN (the second structure in Fig. 2), the current state h_{t} is affected by the previous state h_{t − 1} and the current input x_{t} and is described as h_{t} = σ(Wx_{t} + Uh_{t − 1} + b), where σ is an activation function, and W, U and b are learnable parameters. Long ShortTerm Memory (LSTM) [32] (the third structure in Fig. 2) is a variant of RNN that is adept at solving longterm dependency problems. A standard LSTM unit consists of a forget gate f_{t}, an input gate i_{t}, memory cells C_{t}, \(\overset{\sim }{C_t}\) and an output gate o_{t}.
However, RNNs only process uniformly distributed longitudinal data by assuming that the sequences have an equal distribution of time differences. COVID19 patient blood samples are distributed nonuniformly. For example, the time gap between two sequential records could be hours or days. Timeaware Long ShortTerm Memory (TLSTM) [35] (the fourth structure in Fig. 2) incorporates the elapsed time information into LSTM. It applies a memory discount to capture the irregular temporal dynamics. TLSTM can be formulated as:
In Eq. 2, based on the basic LSTM, TLSTM possesses some new designs. \({C}_{t1}^S\) component learns the shortterm memory of sequence by learnable network parameters. \({C}_{t1}^T\) is the longterm memory calculated from the former memory cell C_{t − 1} with getting rid of \({C}_{t1}^S\). \({C}_{t1}^S\) is adjusted to the discounted shortterm memory \({\hat{C}}_{t1}^S\) by the elapsed time function g(Δ_{t}). The previous memory \({C}_{t1}^{\ast }\) is changed to the complement subspace of \({C}_{t1}^T\) combined with \({\hat{C}}_{t1}^S\).
We use a log calculation for the elapsed time function. Δ_{t} describes the time gap between two records at two sequential time steps t and t − 1. T_{t} is the actual time at time step t.
Analysis strategy
We first describe the two tasks in this study and then introduce the specific methods. The whole method process is shown in Fig. 3.
Task 1 (Outcome prediction ) A set of labeled patient data is represented as \(\mathcal{D}=\left\{\left({x}_i,{c}_i\right)\in \left(X,C\right)i=1,\dots, n\right\}\). X is a time series set of patients, where \({x}_i=\left\{{x}_i^tt=1,\dots, {t}_{onset}\right\}\) represents a patient’s records over t time steps; specifically, \({x}_i^t\) is multivariate data, and each dimension is a clinical record represented by a numeric vector. C ∈ {0, 1} is the outcome, where class 0 means death and class 1 means survival. The outcome prediction task aims to predict patient outcomes by the prediction function f : X → C
Task 2 (Temporal patient subtyping / Disease progression mining) The goal is to find patient groups G = {g_{i} i = 0, …, m} with similar feature representation \(R=\left\{{r}_i^ti=0,\dots, n;t=0,\dots, {t}_{onset}\right\}\). \({r}_i^t\) is the representation of clinical record \({x}_i^t\) at time t. Then, the patient groups G distributed over time are used to analyze the stages of disease progression
In COVID19 patient outcome prediction task, TLSTM is used to handle patient record sequences and then make the prediction. The process is displayed in the proposed method of Fig. 2, in the lower gray area.
For a patient i, the input of TLSTM at time step t is a threedimensional feature vector \({x}_i^t=\left[{v}_{LDH},{v}_{lymphocytes},{v}_{hs CRP},\right]\) with time gap Δ_{t}. The output is the state representation s_{i} at the last time step. We apply this outcome prediction task as a binary classification task, with two classes: death and survival.
The crossentropy [36] is mainly used to measure the difference between two probability distributions. We expect our predicted distribution of patient outcomes to be closer to the true distribution. Thus, we use the crossentropy loss function in Eq. 4. Besides, when using sigmoid active function, this loss can avoid the reduced learning rate causing by traditional mean square error loss when gradient decreases.
p(x) is the prior probability (true label vector) and q(x) is the prediction probability (predicted results vector). Correspondingly, \(\hat{C}\) is the real class of input data, and C represents the prediction class.
In COVID19 patient disease progression task, temporal patient subtyping can uncover the dynamic characteristics of diseases by significantly nuanced subtyping, which leads to the potential stages of disease progression. We addressed the issue by building a time stage reference and providing a lowdimensional representation of each subject, encoding his or her position with respect to this reference.
The method structure is displayed in the upper gray area of proposed method in Fig. 2. It has 4 steps: 1) Acquisition of patient representation r^{t}. We used the hidden state h_{t}, extracted from every TLSTM unit, as the patient’s representation r^{t} at time step t. 2) Dimension reduction of r^{t}. For better demonstration, we used the tdistributed Stochastic Neighbor Embedding (tSNE) [37] method to reduce these highdimensional vectors r^{t} into two dimensions. 3) Obtaining the patient group set G. As prior information about the patient groups was not available, we acquired patient groups by applying an unsupervised clustering method, the DensityBased Spatial Clustering of Applications with Noise (DBSCAN) [38], on r^{t}. 4) Analysis of G and stages of disease progression. The mortality rate MR, and the average time distance TD were calculated as the analysis criteria.
Equation 5 expresses the mortality rate. N_{death} is the number of patients with the death outcome and N_{patient} is the total number of patients. Eq. 6 expresses the average time distance. T_{t} means the current prediction time and \({T}_{t_{onset}}\) means the time of outcome. ∣g_{k}∣ is the number of patients in group g_{k}.
Evaluation metrics
The prediction results were evaluated by assessing the area under the curve of the Receiver Operating Characteristic (AUCROC). The ROC is a curve of the True Positive Rate (TPR) and the False Positive Rate (FPR). TN, TP, FP and FN represent true positives, true negatives, false positives and false negatives, respectively.
The patient groups obtained by unsupervised clustering were evaluated by the CalinskiHarabaz Index (CH), which measures the covariance of data within a class and between classes. A larger CH value indicates a better clustering performance. In Eq. 9, m is the number of data and k is the number of groups. B_{k} and W_{k} respectively represent the covariance matrices between groups and within groups.
When we get the stages of COVID19 patients, we used KullbackLeibler Divergence (KL divergence) to analyze patient characteristics through each laboratory test feature. KL divergence can measure the degree of difference between two probability distributions. For each feature, we first establish the Gaussian distribution \(\mathcal{N}\left(\mu, {\sigma}^2\right)\) with expected value μ and variance σ^{2} at each stage. Then, we calculate the average KL divergence of the distribution of adjacent stages. If the average KL divergence of a feature is large, it more likely is a biomarker to distinguish different stages. The basic KL divergence of distribution p(X) and q(X) and the KL divergence of two univariate Gaussian distributions are in Eq. 10 and 11.
For measure and evaluate each feature, we use the average KL divergence (Average KL) between neighbor stages g_{i}, g_{i + 1}. m is the number of groups.
Results
We used the records of 375 patients as a training set; the ratio of the training set to the verification set was 0.8:0.2. The records of 110 patients made up the test set. This experiment was conducted on 5fold crossvalidation. The code implementation is publicly available at https://github.com/scxhhh/COVID19.
Baselines
We use the related works summarized in Table 1 as comparison methods. Related works are divided into nondeep learning methods and deep learning methods. We use Cox [19], kNN [16], SVM [17], DT [1], BPNN [20], PNN [21], RNN, LSTM and TLSTM for COVID19 mortality prediction. TLSTM is our method.
Outcome prediction results
Table 3 shows the results of COVID19 mortality prediction using baselines. The AUCROC is evaluated at 0, 3, 6, 9, 12, 15, and 18 days early. Here, the results are obtained when the patient’s representations are 64 dimensional. The results indicate that our method TLSTM performed better than all of baselines no matter how early before the onset times of patients. More precisely, using TLSTM, the outcome prediction accuracy is above 90% at 12 days early and is approximately 97% accurate when predicting 3 days before the disease outcome. More detailed results of train, validation and test sets using TLSTM are listed in Table 4.
The first four figures in Fig. 3 are the visualizes of prediction results. The first two figures are the AUCROC of prediction results of baselines and TLSTM in different earliness. The third figure is the changes of prediction accuracy and crossentropy loss when training the model. The fourth figure represents the relation of patient representation dimension and AUCROC of prediction using TLSTM. Too few dimensions lead to incomplete feature learning, while too many dimensions lead to redundant calculations and easy overfitting. Considering result accuracy, computational complexity and ease of representation use in the following task, we decided to use 64 dimensional vectors to represent patients.
Based on prediction results, we found: 1) Deep learning approaches (TLSTM, RNN, PNN and BPNN) has higher COVID19 outcome prediction accuracy than nondeep learning approaches (Cox, kNN, SVM and DT) as they have completed the highly nonlinear feature transformation by neural junction structures. 2) RNNbased models (TLSTM and RNN) performance better on time series data as they contain state connections for reproducing time delays and output feedback connections for forming a loop. 3) Timeaware model (TLSTM) has the best performance as it can model the time series with irregular time intervals, which is a prominent feature of COVID19 blood sample dataset.
Further, we also select 40 features (listed in Table 7) as the input of TLSTM for comparative experiment. The results in Table 5 indicate that learning a large number of patient characteristics does not necessarily contribute to COVID19 patient mortality prediction task. The three biomarkers, LDH, lymph and hsCRP can make the results better. The AUCROC of using 3 features is 3% higher than using 40 features on average. This is because too many features will introduce redundant and irrelevant dependencies leading by redundant features.
Disease progression results
By implementing the four steps of disease progression mining, we obtained the 4 stages in both the death class (critical) and the survival class (general), shown in Fig. 4.
For better visualization, we reduced the dimension of the patient’s representation vector, the fifth figure in Fig. 3 is the dimension reduction effect. We chose 2 dimensions due to low representation loss and clear observation. Besides, the DBSCAN clustering effect evaluated by the CH index is shown in the sixth and seventh figures in Fig. 3. Different clustering effects can be obtained by changing the cluster radius parameter ε. The best CH index values for the death class and the survival class are 680.07 and 44.24, respectively.
In this case, both classes have four groups. Four stages of COVID19 patients are shown in Fig. 4. For each stage, we calculate the mortality rate MR and the average time distance TD. For the death class, MR increases over stages and is 100% at stage 4. For the survival class, MR decreases over stages and is 0% in stage 4. TD in both classes decreases, meaning that the 4 stages are distributed over time. Meanwhile, as the CH index of the survival class is higher than that of the death class, the survival class stages are relatively loosely distributed.
In Fig. 4, the first clustering is obtained by using biomarkers directly and shows that reasonable stages could not be found. In the first clustering, no stage is clustered in the death class and the 2 stages in the survival class have similar mortality rates and no time difference, as the shade of blue indicates. However, using our method, different stages have obvious differences, such as the data point color deepening with the stages. Meanwhile, as shown in the two insets, the class boundary is clearer based on our method.
The division of stages contains the potential characteristics of COVID19. Here, we present three findings. First, at the time of initial diagnosis, the COVID19 infected patients’ physical conditions are similar, regardless of final survival or death. In Fig. 4, the distance between stage 1 for the death class and the survival class is small, and the two even overlap. This indicates that outcome prediction is likely not accurate at the time of infection. Second, the physical condition of patients who eventually die changes less than that of those who eventually survive. We conclude this from CH index values, where the CH value of the survival class is larger than that for the death class. Third, mortality rate varies by stage. For example, if the patient is classified into the death class and is at stage 1, there is still hope of survival, as shown by the green triangle sample in Fig. 4. However, if the patient is in stage 3 or 4, he or she is very likely to die. Based on estimating the current stage of a patient, doctors will be given a reference, which can help them assess a patient’s current situation. Based on that, doctors can carry out targeted treatment and reasonable resource allocation more easily. Thus, the ultimate goal of this study, helping improve the quality of medical care, can be achieved.
Meanwhile, we calculated the mean values of 40 laboratory test features in each stage, the feature values vary with stages. Table 6 lists 10 of these features  Lymph, LDH, hsCRP, Indirect Bilirubin, Creatinine, INR, Serum Sodium, eGFR, Serum Chlorine and Albumin. The changes of values through 4 stages are visualized in Fig. 5. Under different classes, the trends of features are different.
Further, we calculated the average KL divergence between adjoint stages of each features in 40 clinical laboratory tests data. We ranked the average KL values. The higher the ranking, the better the biomarkers can be used to distinguish different stages. By ranking 40 biomarkers according to the degree of correlation with COVID19 (Table 7), we have found the biomarkers which are more relevant to COVID19. The top 10 are: Lymph, LDH, hsCRP, Indirect Bilirubin, Creatinine, INR, Serum Sodium, eGFR, Serum Chlorine and Albumin. For each marker, we gave its reference value in each stage, shown in Table 6. Different markers have unique trends in different stages.
Combining the correlation analysis with the reference value analysis, we found that the critical COVID19 patients are usually accompanied by low values of lymph, eGFR, albumin and Serum Sodium, high values of LDH, hsCRP, indirect bilirubin, creatinine and INR. For example, in the critical stage 4, the average lymph (%) is just 4 and the average LDH (U/l) is up to 499. Besides, there are three major complications of COVID19 patients  myocardial injury, liver function injury and renal function injury. We got the conclusions separately through the value of 1) LDH, 2) albumin and indirect bilirubin, 3) serum sodium, serum chlorine and creatinine in different stages.
Discussion
In recent years, deep learning (DL) technology has been widely used because of its superior performance in various medical applications [28, 29], such as medical image recognition [39] and medication recommendations [40]. In the past year, the spread of COVID19 has had a peripheral effect on the global economy and health. Therefore, we expect to combine DL methods to study and fight COVID19.
The states of COVID19 patients in hospital are dynamic time sequence processes. In addition to the basic information of patients, the vital signs, diagnoses and other lab tests are all time series. Existing many works [14,15,16,17,18,19,20,21,22,23,24,25,26,27, 41, 42] have achieved good results for COVID19 prediction tasks. But they paid little attention to analyze and model the characteristics of COVID19 patients’ time series. Dynamic time series modeling can grasp the relationship between historical observations and current observations, and learn the potential development mode of sequence, which is conducive to more accurate prediction and representation. Besides, we have found that the time series of COVID19 patients is irregularly sampled  Different time intervals exist in adjacent observations. Every possible test is not regularly measured during an admission. When a certain symptom worsens, corresponding variables are examined more frequently; when the symptom disappears, the corresponding variables are no longer examined. These time intervals will add a time sparsity factor when the intervals between observations are large [13]. Therefore, it is necessary not only to deal with time series, but also to deal with irregular time series according to the characteristics of COVID19 patients. In this paper, we use timeaware LSTM model solved this problem.
Deep learning methods have outstanding performance in prediction tasks. If a doctor predicts survival or death only by observing the biomarkers and using a threshold, the accuracy is at or below 80% for early predictions. However, the clinical reference value of inaccurate results is very low [43, 44]. The DL method has better performance, and the timeaware aspect enables higher accuracy, as shown in Table 3.
However, there are some concerns about the use of DL methods in the highrisk tasks of healthcare.
First, it may be risky to apply predictive methods directly to clinical practice [45]. DL methods may be assistive tools for doctors but not used to make decisions directly. It is challenging for doctors to make optimal decisions, a datadriven and highaccuracy prediction method could help. In this paper, we can predict patient outcomes with higher accuracy than baselines. The method can effectively predict whether the infected patient will die or survive 12 days prior to disease outcome with over 90% accuracy. The prediction accuracies at 3, 6, and 9days prior are 98, 95 and 93%, respectively.
Second, the DL method is the blackbox models which are troubled by poor interpretability [46, 47], but clinical settings prefer interpretable models. For example, finding the appropriate predictionrelated biomarkers is important. Currently, certain studies have identified suitable predictive biomarkers, such as the 3 biomarkers in [33], which are regarded to have a significant impact on patient mortality. For interpretability, our method identified four disease stages distributed over time. This interesting finding cannot be distinguished simply by the value of biomarkers, as shown as the comparison of two clustering results in Fig. 4. The discovered stages are closely related to mortality and time of illness and can help analyze the status of infected patients. This shows that the DL method can explore new patterns in multidimensional space that cannot be demonstrated by a simple variable value [48]. We also ranked 40 biomarkers according to the degree of correlation with COVID19 progression, which can provide interpretable results to help doctors better understand the model.
This study has three basic contributions. 1) we can predict patient outcomes with higher accuracy than all baselines. 2) We identified four stages of COVID19 progression. The stages are closely related to mortality and time of illness and can help analyze the status of infected patients. 3) We give the ranking of 40 biomarkers according to the degree of correlation with COVID19. Based on this, we found three major complications of COVID19 patients  myocardial injury, liver function injury and renal function injury.
Further, there is room for further improvement. First, because of the data limitations, our method may face risk of bias, because datadriven methods are easily influenced by different source of data. For example, the results may vary when using different datasets [45]. Second, our current interpretation is based on results, such as the degree of association between biomarkers and disease. We hope to give more explanations about the complex DL blackbox model, such as telling more specific effect of each part of the model on the result. Meanwhile, we hope to enlighten the relevant researchers to further study these 4 stages and present more clinical explanations. In particular, we expect to be able to give specific treatments for different stages. Targeted treatment is significant for both patient rehabilitation and the reasonable allocation of medical resources.
Conclusions
The sudden outbreak and epidemic of COVID19 has led to worldwide suffering and shortages of medical resources. In this paper, we propose TLSTM to predict patient outcomes with high accuracy  98, 95 and 93% at 3, 6, and 9 days, which will enable reasonable allocation of medical resources. TLSTM can effectively model the irregular sampled time series in blood test samples of COVID19 patients and predict more accurately than existing baselines. Meanwhile, we identified four COVID19 stages. We ranked 40 biomarkers according to correlations to the outcomes of patients, gave the reference values of top 10 biomarkers for each stage. The top 10 biomarkers are: Lymph, LDH, hsCRP, Indirect Bilirubin, Creatinine, INR, Serum Sodium, eGFR, Serum Chlorine and Albumin. We also found 3 complications of COVID19, which are myocardial injury, liver function injury and renal function injury. By analyzing patients’ life conditions at different stages, doctors can choose specific, targeted treatments. Future work will focus more on the study of pathological characteristics of different stages. Aiming at four stages, targeted treatments are expected to be designed. Meanwhile, more real clinical data are expected to be available for model validation and the model will be used to mine the inherent hidden features of other diseases.
Availability of data and materials
The code implementation is publicly available at https://github.com/scxhhh/COVID19. The data is from an online open dataset https://github.com/HAIRLAB/Pre_Surv_COVID_19 under an MIT license (https://doi.org/10.5281/zenodo.3758806).
Abbreviations
 COVID19:

Corona Virus Disease 2019
 WHO:

World Health Organization
 LDH:

Lactic Dehydrogenase
 hsCRP:

Highsensitivity Creactive Protein
 lymph:

Lymphocytes
 DL:

Deep Learning
 RNN:

Recurrent Neural Network
 LSTM:

Long ShortTerm Unit
 TLSTM:

Timeaware Long ShortTerm Memory
 PNN:

Probabilistic Neural Network
 RBFNN:

Radial Basis Function Neural Network
 GRNN:

Generalized Regression Neural Network
 BPNN:

Back Propagation Neuron Network
 DT:

Decision Tree
 RF:

Random Forest
 XGBoost:

eXtreme Gradient Boosting
 SVM:

Support Vector Machines
 Cox:

Cox’s Proportional Hazards Regression
 LR:

Linear Regression
 NB:

Naive Bayes
 LDA:

Linear Discriminant Analysis
 tSNE:

tdistributed Stochastic Neighbor Embedding
 DBSCAN:

DensityBased Spatial Clustering of Applications with Noise
 MR:

Mortality Rate
 TD:

Average Time Distance
 AUCROC:

The Area Under the Curve of the Receiver Operating Characteristic
 CH:

CalinskiHarabaz Index
 KL divergence:

KullbackLeibler Divergence
References
World Health Organization. Coronavirus Disease 2019 (COVID19) Situation Report 68, 28 March 2020. https://www.who.int/docs/defaultsource/coronaviruse/situationreports/20200328sitrep68covid19.pdf.
World Health Organization. Coronavirus Disease 2019 (COVID19) Situation Report 147, 15 June 2020. https://www.who.int/docs/defaultsource/coronaviruse/situationreports/20200615covid19sitrep147.pdf?sfvrsn=2497a605_2
Emily Czachor. WHO Director Warns COVID19 Pandemic is ‘Speeding Up,’ Here for ‘Long Haul’. Newsweek, News. 6/29/2020. https://www.newsweek.com/whodirectorwarnscovid19pandemicspeedingherelonghaul1514169.
Sébastien Farcis. Coronavirus: worries and worries about bed shortage in New Delhi. Liberation, Reportage. 6/15/2020. https://www.liberation.fr/planete/2020/06/15/coronavirusanewdelhiinquietudeetdesarroifacealapenuriedelits_1791300.
Katherine Fung. Arizona Hits RecordHigh Hospital Capacity as Coronavirus Cases Climb. Newsweek, News. 6/29/2020. https://www.newsweek.com/arizonahitsrecordhighhospitalcapacitycoronaviruscasesclimb1511578.
Huang C, et al. Clinical features of patients infected with 2019 novel coronavirus in Wuhan, China. Lancet. 2020;395:497–506.
Chen N, et al. Epidemiological and clinical characteristics of 99 cases of 2019 novel coronavirus pneumonia in Wuhan, China: a descriptive study. Lancet. 2020;395:507–13.
Yang X, et al. Clinical course and outcomes of critically ill patients with SARS CoV2 pneumonia in Wuhan, China: a singlecentered, retrospective, observational study. Lancet Resp Med. 2020;8:475–81.
Zhou F, Yu T, Du R, et al. Clinical course and risk factors for mortality of adult inpatients with COVID19 in Wuhan, China: a retrospective cohort study. Lancet. 2020;395:10229.
Wang D, Hu B, Hu C, Zhu F, et al. Clinical characteristics of 138 hospitalized patients with 2019 novel coronavirus–infected pneumonia in Wuhan, China. JAMA. 2020;323(11):1061.
Yang X, Yu Y, Xu J, Shu H, Xia J, Liu H, et al. Clinical course and outcomes of critically Ill patients with SARSCoV2 pneumonia in Wuhan, China: a singlecentered, retrospective, observational study. Lancet Respir Med. 2020;8(5):1–7.
Spadon G, Hong S, Brandoli B, Matwin S, RodriguesJr JF, Sun J. Pay Attention to Evolution: Time Series Forecasting with Deep GraphEvolution Learning. arXiv preprint arXiv. 2020;2008:12833.
Sun C, Hong S, Song M, Li H. A review of deep learning methods for irregularly sampled medical time series data. arXiv. 2020;2010:12493.
Wang C, Deng R, Gou L, Fu Z, Zhang X, Shao F, et al. Preliminary study to identify severe from moderate cases of COVID19 using NLR&RDWSD combination parameter. 2020. medRxiv 2020.04.09.20058594.
Farid AA, Selim GI, Khater HAA. A novel approach of CT images feature analysis and prediction to screen for corona virus disease (COVID19). Int J Sci Eng Res. 2020;11(3):1–9.
Kumar R, Arora R, Bansal V, Sahayasheela VJ, Buckchash H, et al. Accurate prediction of COVID19 using chest Xray images through deep feature learning model with SMOTE and machine learning classifiers. 2020. medRxiv 2020.04.13.20063461.
Batista AFM, Miraglia JL, Donato THR, Chiavegatto Filho ADP. COVID19 diagnosis prediction in emergency care patients: a machine learning approach. 2020. medRxiv 2020.04.04.20052092.
Li K, et al. The clinical and chest CT features associated with severe and critical COVID19 pneumonia. Investig Radiol. 2020;55(6):327.
Liang W, Yao J, Chen A, et al. Early triage of critically ill COVID19 patients using deep learning. Nat Commun. 2020;11:3543.
Sujath R, Chatterjee JM, Hassanien AE. A machine learning forecasting model for COVID19 pandemic in India. Stoch Env Res Risk Assess. 2020;34(7):959–72.
Dhamodharavadhani S, Rathipriya R, Chatterjee JM. COVID19 mortality rate prediction for India using statistical neural network models. Front Public Health. 2020;8:441.
Panwar H, Gupta PK, Siddiqui MK, et al. Application of deep learning for fast detection of COVID19 in Xrays using nCOVnet. Chaos, Solitons Fractals. 2020;138:109944.
Harsh P, Gupta PK, Siddiqui MK, MoralesMenendez R, Bhardwaj P, Singh V. A deep learning and gradCAM based color visualization approach for fast detection of COVID19 cases using chest Xray and CTScan images. Chaos, Solitons Fractals. 2020;140:110190.
Babukarthik RG, Adiga VAK, Sambasivam G, Chandramohan D, Amudhavel J. Prediction of COVID19 Using Genetic Deep Learning Convolutional Neural Network (GDCNN). IEEE Access. 2020;8:177647–66.
Wang L, Wong A. COVIDNet: A tailored deep convolutional neural network design for detection of COVID19 cases from chest radiography images. arXiv. 2020;2003:09871.
Li L, Qin L, Xu Z, Yin Y, Wang X, Kong B, et al. Using artificial intelligence to detect COVID19 and communityacquired pneumonia based on pulmonary CT: evaluation of the diagnostic accuracy. Radiology. 2020;296(2):E65–71.
Radanliev P, Roure DD, Walton R. Data mining and analysis of scientific research data records on covid 19 mortality, immunity, and vaccine development  in the first wave of the Covid19 pandemic. Diabetes Metab Syndr. 2020;14(5):1121.
Adam G, Rampášek L, Safikhani Z, et al. Machine learning approaches to drug response prediction: challenges and recent progress. Precis Onc. 2020;4:19.
Jalali A, Lonsdale H, Do N, et al. Deep learning for improved risk prediction in surgical outcomes. Sci Rep. 2020;10:9289.
Siuly S, Zhang Y. Medical big data: neurological diseases diagnosis through medical data analysis. Data Sci Eng. 2016;1:54–64.
Williams RJ, Zipser D. A learning algorithm for continually running fully recurrent neural networks. Neural Comput. 1998;1(2):270–80.
Hochreiter S, Schmidhuber J. Long shortterm memory. Neural Computation. 1997;9(8):1735–80.
Yan L, Zhang HT, Goncalves J, et al. An interpretable mortality prediction model for COVID19 patients. Nat Mach Intell. 2020;2:283–8.
Chu X, Ilyas IF, Krishnan S, Wang J. Data cleaning: Overview and emerging challenges, in Proc. Int. Conf. Manage. Data SIGMOD Conf., San Francisco, CA, USA, Jun./Jul; 2016. p. 2201–6.
Baytas I M, Xiao C, Zhang X, et al. Patient Subtyping via TimeAware LSTM Networks. the 23rd ACM SIGKDD International Conference. ACM, 2017.
Corduneanu C. Integral Equations and Applications; 1991. https://doi.org/10.1017/CBO9780511569395.
Laurens VDM, Hinton G. Visualizing Data using tSNE. J Mach Learn Res. 2008;9(2605):2579–605.
Ester M, Kriegel HP, Sander J, Xiaowei X. A densitybased algorithm for discovering clusters in large spatial databases with noise. KDD. 1996:226–31.
Wang S, Wang S, Zhang S, Fan F, He G. Research on recognition of medical image detection based on neural network. IEEE Access. 2020;8:94947–55.
Shang J, Xiao C, Ma T, Li H, Sun J. GAMENet: graph augmented memory networks for recommending medication combination. AAAI. 2019:1126–33.
Tang Z, et al. Severity assessment of coronavirus disease 2019 (COVID19) using quantitative features from chest CT images. 2020. arXiv:2003:11988.
Mohamadou Y, Halidou A, Kapen PT. A review of mathematical modeling, artificial intelligence and datasets used in the study, prediction and management of COVID19. Appl Intell. 2020;50(11):3913–25.
Wang Z, He Z, Shah M, Zhang T, Fan D, Zhang W. Networkbased multitask learning models for biomarker selection and cancer outcome prediction. Bioinform. 2020;36(6):1814–22.
Liu L, Li H, Hu Z, Shi H, Wang Z, Tang J, Zhang M. Learning hierarchical representations of electronic health records for clinical outcome prediction. AMIA Annu Symp Proc. 2020;2019:597–606.
Wynants L, Calster BV, Bonten MMJ, et al. Prediction models for diagnosis and prognosis of covid19 infection: systematic review and critical appraisal. BMJ (online). 2020;369:m1328.
Molnar C. Interpretable machine learning: a guide for making black box models explainable; 2019.
Ito T, Tsubouchi K, Sakaji H, et al. Contextual sentiment neural network for document sentiment analysis. Data Sci. Eng. 2020;5:180–92.
Gibney HMJ. Analysis of meal patterns with the use of supervised data mining techniques—artificial neural networks and decision trees. Am J Clin Nutr. 2008;88(6):1632–42.
Acknowledgments
This paper is dedicated to those who want to fight COVID19.
Funding
This work was supported by the Scientific Research Foundation for the Returned Overseas Chinese Scholars, State Education Ministry and UKRI’s Global Challenge Research Fund (ES/P011055/1). This work was also supported by the National Key Research and Development Program of China (No. 2020YFB2103402). The funders had no role in study design, data collection, analysis, the writing of the manuscript, or the decision to submit this article for publication.
Author information
Authors and Affiliations
Contributions
C.S. and S.H. conceptualized the idea. S.H., Z.W. and H.L. initialized, conceived and supervised the project. C. S, S.H. and M.S. collected data and implemented the experiments. C. S, S. H, M.S. and Z.W. drafted the manuscript. All authors provided a critical review of the manuscript and approved the final draft for publication. All authors read and approved the final manuscript.
Corresponding authors
Ethics declarations
Ethics approval and consent to participate
The original study was approved by the Ethics Committee of Tongji Hospital, Tongji Medical College, Huazhong University of Science and Technology (Yan Li, et al. “An interpretable mortality prediction model for COVID19 patients.” Nature Machine Intelligence). In the current study, the data used is from that study as an online open dataset under an MIT license (https://doi.org/10.5281/zenodo.3758806).
Consent for publication
Not applicable.
Competing interests
No financial competing interests.
Additional information
Publisher’s Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated in a credit line to the data.
About this article
Cite this article
Sun, C., Hong, S., Song, M. et al. Predicting COVID19 disease progression and patient outcomes based on temporal deep learning. BMC Med Inform Decis Mak 21, 45 (2021). https://doi.org/10.1186/s12911020013599
Received:
Accepted:
Published:
DOI: https://doi.org/10.1186/s12911020013599
Keywords
 COVID19
 Disease progression
 Outcome early prediction
 Irregularly sampled time series
 Timeaware long shortterm memory