 Research
 Open Access
 Published:
Kinematics approach with neural networks for early detection of sepsis (KANNEDS)
BMC Medical Informatics and Decision Making volume 21, Article number: 163 (2021)
Abstract
Background
Sepsis is a severe illness that affects millions of people worldwide, and its early detection is critical for effective treatment outcomes. In recent years, researchers have used models to classify positive patients or identify the probability for sepsis using vital signs and other timeseries variables as input.
Methods
In our study, we analyzed patients’ conditions by their kinematics position, velocity, and acceleration, in a sixdimensional space defined by six vital signs. The patient is affected by the disease after a period if the position gets “near” to a calculated sepsis position in space. We imputed these kinematics features as explanatory variables of long shortterm memory (LSTM), convolutional neural network (CNN) and linear neural network (LNN) and compared the prediction accuracies with only the vital signs as input. The dataset used contained information of approximately 4800 patients, each with 48 hourly registers.
Results
We demonstrated that the kinematics features models had an improved performance compared with vital signs models. The kinematics features model of LSTM achieved the best accuracy, 0.803, which was nine points higher than the vital signs model. Although with lesser accuracies, the kinematics features models of the CNN and LNN showed better performances than vital signs models.
Conclusion
Applying our novel approach for early detection of sepsis using neural networks will prove to be an invaluable, more accurate method than considering only simple vital signs as input variables. We expect that other researchers with similar objectives can use the model presented in this innovative approach to improve their results.
Background
Sepsis is a fatal organic dysfunction caused by a patient’s deregulated response to infection, and septic shock is a subset of sepsis where circulatory and cellular/metabolic dysfunction occurs with a higher risk of mortality [1, 2]. Sepsis is considered as the disease with the highest rate of death from infection [3]. Overall, the occurrence of sepsis and septic shock treated in a hospital is 437 and 270 cases per 100,000 personyears, respectively, with a total mortality rate of 26% [3, 4]. Nevertheless, according to Rhodes et al. [5], similar to most other diseases, early identification and management of the patient in the initial hours significantly improve the results of treatment.
Currently, several hospitals use the sepsis clinical score, called sequential organ failure assessment (SOFA), to define if a patient is diagnosed with sepsis. The score was recommended by the Third International Consensus Definitions for Sepsis and Septic Shock (Sepsis3) [1]. The authors also proposed quickSOFA, or qSOFA, as the rapid score bedside criteria to facilitate the identification of patients with suspected infection and high risk of death [6]. The sepsis3 criteria require monitoring patients’ vital signs (VS), such as heart rate, body temperature, blood pressure, and laboratory test results, among other variables, and calculating a sepsis score based on these values. Therefore, monitoring a patients’ variables is essential for the diagnosis and treatment as well as for medical analyses in studies that improve the medicine [7, 8].
In recent years, several studies have used VS data and other variables to predict sepsis and septic shock. Henry et al. [9] analyzed data from patients in ICUs and developed an algorithm based on survival analysis and supervised learning techniques to predict septic shock.
Manaktala and Claypool [10] proposed a system to detect sepsis and to provide decision support to medical staff during patient monitoring through a mobile app. The authors employed demographic, VS, medications, laboratory results, and nursing notes data to perform the detection and the decision support.
Horng et al. [11] showed—through tokenization algorithms, bigram detection, and text denial, in addition to machine learning—the possibilities of using data sourced from the text of patients’ medical registers, demographic data, and VS to identify patients with suspected infection and sepsis.
Mao et al. [12] developed a sepsis prediction algorithm based on a machine learning technique called gradient tree boosting, using six VS commonly available in ICUs.
Bock et al. [13] presented a new type of shapelet, a technique for identifying subsequences of timeseries data that are statistically most significant for prediction. The authors showed shapelet patterns using VS as indicators of the severity of future sepsis.
Kamaleswaran et al. [14] compared the performances of some machine learning techniques to predict the onset of severe sepsis in children using VS as input.
Mohammed et al. [15] used five VS collected every minute and a support vector machine classifier to detect sepsis approximately 17 h before its onset.
Perng et al. [16] proposed a convolutional neural network (CNN) to predict sepsisrelated mortality. They used 53 selected clinical variables and showed that the accuracy of the CNN model was higher than other machine learning methods and qSOFA.
Li et al. [17], Lin et al. [18], and Lauritsen et al. [19] proposed new model architectures composed especially of CNN and longshort term memory (LSTM) neural networks for predicting sepsis. The authors proved that their approach was more efficient for the early detection.
Lake et al. [20] studied 4096 intervals (25 min) of respiratory rates of neonatal ICU patients using approximate entropy [21] and sample entropy [22]. They verified that entropy values were lower right before the sepsis onset.
Ahmad et al. [23] analyzed the continuous heart rate of patients hospitalized for bone marrow transplantation and observed a 25% reduction in sample entropy before the diagnosis of sepsis.
Drewry et al. [24] proposed a temperature curve analysis that could find an abnormal pattern before sepsis onset, using the maximum, minimum, and variations of the ICU patients' temperature continuous timeseries.
Additionally, Shashikumar et al. [25] analyzed the dynamics of continuous blood pressure and heart rate timeseries to predict sepsis in ICU patients. They found that entropybased measures of the two VS dynamics are independent predictors of sepsis.
Even though other important works have dealt with sepsis prediction in the past decade, we can consider those mentioned above a representative sample of the scientific community’s goals in this regard. These goals are, in most cases, to implement efficient models based on statistics, machine learning, and deep learning, to detect the most precise sepsis onset, to achieve the highest prediction accuracy as early as possible, to investigate the dynamics of measured features (variables) and to use the smallest number of easily collected features.
The above prediction models analyze the behavior of timeseries variables until they classify the patient as positive for sepsis or identify a threshold probability for sepsis as early as possible. Although these studies have achieved commendable results, we propose a novel approach to accomplish some goals of the scientific community. The innovation is that we consider, at each timestamp, a “distance” measurement between the values of the patient’s variables and a proposed sepsis point. We developed a sepsis prediction model that considers not only the patient’s timeseries VS behavior but also their kinematics over time, including position, velocity, and acceleration.
Therefore, we assumed that a patient can be represented by a point moving in an ndimensional space, where n is the number of VS. We defined sepsis point as a single target point in this vital sign space. The position of the sepsis point is estimated from the patients’ VS values at their sepsis onset. Then, at each instant of time, we computed the relative distance between the patient and the sepsis point positions, as well as their relative velocity and acceleration. We named these variables as kinematics features (KF). When the KF were included as input variables of sepsis prediction neural networks (KF model), we verified that accuracies were higher compared to including only VS as input (VS model). In this work, we used a database where the sepsis3 criteria were applied to define the sepsis onset of patients who developed the disease in the ICU.
We expect that other researchers with similar objectives can use this innovative approach to enhance their early detection models. To promote the reproducibility of our work and contribute to its scientific expansion, we published sample data and essential source code in [26].
Methods
Kinematics analysis of patient’s variables
Patient’s position, velocity, and acceleration
In our approach, we represented a patient as a point in an ndimensional space, and the patient’s \(n\) VS values at each instant of time are the coordinates that define the patient’s position vector [27, 28].
Thus, the VS values \({x}_{1 }, {x}_{2 },\dots ,{x}_{n}\) at time \(t\) are the values of the components of the position vector \({\overrightarrow{r}}_{{A}_{i}}\) of a patient \({A}_{i}\) at time \(t\), that is:
where \(n=1, 2,..., N\) VS or clinical variables.
We also calculated the velocity \({\overrightarrow{v}}_{{A}_{i}}\) and acceleration \({\overrightarrow{a}}_{{A}_{i}}\) vectors at each instant of time \(t\) as follows:
Sepsis position, velocity, and acceleration
Defining sepsis and identifying its onset is a challenging task, and the criteria or methodology adopted can vary. There are several criteria available like SIRS criteria [29], sepsis3 criteria [5, 30], Angus methodology [31], and Martin methodology [32], among others. The database adopted in this work uses the sepsis3 criteria to set the sepsis onset; however, the criteria used by the database are not relevant to our approach but just the indication of sepsis onset by some criterion.
Each patient has a sepsis onset with their correspondent values of VS variables. In a simple and immediate configuration, the sepsis position point or just sepsis position would be a unique and static point in the ndimensional space defined by some simple statistical calculation, common for all patients. In a more sophisticated configuration, the sepsis position could be a moving point according to some statistical and/or medical rule of position changing, also applied to all patients as a common sepsis point path. Another issue is that the patients can be grouped by characteristics like gender, age range, comorbidity, among others. Here, the definition of the sepsis position would be made for each group.
In this work, we decided to nongroup the patients and use a statistic median to calculate a unique static sepsis position for all patients, as an initial configuration to test our approach, because our goal is to verify the better accuracy between the VS and KF models.
Therefore, considering that \({t}_{s}\) represents the sepsis onset time for the positive patients, we established that the sepsis position vector for all patients were defined by the median values of each VS on the sepsis onset. That is, \({\overrightarrow{r}}_{{B}_{i}}(t)\)= \({\overrightarrow{r}}_{B}(t) ={\overrightarrow{r}}_{B}\) will be given by
where \(I\) is the number of positive patients.
Once the vector \({\overrightarrow{r}}_{B}\) has been identified, it will be used to determine the relative position of patients to the sepsis position, at each instant, for positive and negative patients.
Movement of patient \({A}_{i}\) relative to sepsis point \(B\)
As the components of \({\overrightarrow{r}}_{B}\) are composed by the median of VS values at \({t}_{s}\), it is not probable that patient \({A}_{i}\) will reach this exact sepsis position. Thus, after some time, a positive patient \({A}_{i}\) reaches some position (given by \({\overrightarrow{r}}_{{A}_{i}}\)) “near” the sepsis position given by \({\overrightarrow{r}}_{B}\), while a negative patient “never” gets “near” this position. There is no special need to calculate in advance “how near” or how the patients’ positions spread around the sepsis position point because the learning process of the prediction model will consider it internally. Therefore, we focused on the relative position vectors \({\overrightarrow{r}}_{{A}_{i}/B}\) of the patients relative to the sepsis position at each instant \(t\):
Figure 1 shows an example of vectors \({\overrightarrow{r}}_{{A}_{i}}\), \({\overrightarrow{r}}_{B}\), and \({\overrightarrow{r}}_{{A}_{i}/B}\), considering only three VS (3D space). The schematic illustrates the movements of a patient (red) that reached a position “near” the sepsis position point and another patient (green) that did not.
The distance between patient position point and sepsis position point at each instant \(t\) is the intensity of position vectors \({\overrightarrow{\mathrm{r}}}_{{\mathrm{A}}_{\mathrm{i}}/\mathrm{B}}\) at each instant \(\mathrm{t}\), \(\left{\overrightarrow{r}}_{{A}_{i}/B}(t)\right\), and it is calculated using the Euclidian Distance formula:
The velocity vector of patient \({A}_{i}\) relative to the sepsis position vector \(B\) is denoted by \({\overrightarrow{v}}_{{A}_{i}/B}\):
Similarly, the acceleration vector is denoted by \({\overrightarrow{a}}_{{A}_{i}/B}\):
Projection of vectors \({\overrightarrow{v}}_{{A}_{i}/B}\) and \({\overrightarrow{a}}_{{A}_{i}/B}\) in direction of the vector \({\overrightarrow{r}}_{{A}_{i}/B}\)
We considered that the relative movement between a patient and the sepsis point must be “rectilinearized,” that is, it must be viewed at each instant as a potential rectilinear movement of the patient directly to the sepsis point. Using the projection of the vectors \({\overrightarrow{v}}_{{A}_{i}/B}\) and \({\overrightarrow{a}}_{{A}_{i}/B}\) in the direction of the vector \({\overrightarrow{r}}_{{A}_{i}/B}\) is a way to express this “tendency” of patient \({A}_{i}\) to move straight through the sepsis point \(B\).
These projections comprise calculating their scalar products with the unit relative position vector \({\overrightarrow{e}}_{i}\), which has the same direction and sense of the relative position vector \({\overrightarrow{r}}_{{A}_{i}/B}\):
The projection of the velocity vector, \({\mathrm{Proj}}_{{\overrightarrow{e}}_{i}(t)}^{{\overrightarrow{v}}_{{A}_{i}/B}(t)},\) is a scalar that can be positive, null, or negative. Its value, at every instant, either maintains, increases, or decreases the distance between the patient \({A}_{i}\) and the sepsis point \(B\). Likewise, the projection of the acceleration vector, \({\mathrm{Proj}}_{{\overrightarrow{e}}_{i}(t)}^{{\overrightarrow{a}}_{{A}_{i}/B}(t)},\) is a scalar value that, at every instant, signifies the changes in the rate that patient \({A}_{i}\) and sepsis point \(B\) are distancing from or approaching each other.
Sepsis early detection and kinematics features
In our study, we propose to calculate the patients’\({\overrightarrow{e}}_{i}\), \({\mathrm{Proj}}_{{\overrightarrow{e}}_{i}(t)}^{{\overrightarrow{v}}_{{A}_{i}/B}(t)}\) and \({\mathrm{Proj}}_{{\overrightarrow{e}}_{i}(t)}^{{\overrightarrow{a}}_{{A}_{i}/B}(t)}\) at each instant, and the hypothesis we seek to verify is whether accuracy is gained by including these KF as the input of neural network predictors/classifiers, instead of using VS alone.
If the KF show that a patient, as an ndimensional space point, is approaching the sepsis position with a certain velocity and acceleration during the monitoring time, it could show more precisely that the patient will be diagnosed with sepsis in the future.
Hence, we consider a general prediction/classification mathematical function:
where \({a}_{j}\) (\(j=1, 2,\dots ,\mathrm{ n}\)) are the input parameters and \({c}_{j}\) (\(j=1, 2, \dots ,\mathrm{ n}\)) are the coefficients to achieve the best accuracy.
Our goal is to compare the resultant of this function using the KF as input parameters with the resultant considering only VS as the input parameters. If we demonstrate that KF increases the accuracy of early detection of sepsis, we will have contributed to our intended objectives.
Data source and preprocessing
As a data source, we used the clinical multivariate timeseries database published in the Early Prediction of Sepsis From Clinical Data: The PhysioNet/Computing in Cardiology Challenge 2019 [33,34,35]. This database has 40 hourly clinical variables, including VS, laboratory tests, and static patient descriptions, collected from United States hospitals—namely Beth Israel Deaconess Medical Center and Emory University Hospital with respective institutional approval. The authors of the database used the sepsis3 clinical criteria [5] to define the sepsis onset and included a variable to show the instant it occurs. Only the positive patients have this sign, thus we relied on it to define a precise sepsis position vector and separate the positive and the negative patients. According to what we posed in the kinematics analysis section, we assumed that the last instant of a positive patient series is the sepsis onset time, therefore all the registers after this time were rejected.
The dataset posted publicly for download has 40,336 patients, divided into test sets A and B, both with the 40 hourly clinical variables. From these clinical variables, we selected six common VS monitored in hospitals (Table 1).
The authors of the database challenged contestants to develop algorithms for the early prediction of sepsis and published the results on the corresponding website [35], but the authors plan to publish a paper about the results as soon as possible [36]. The bestranked result achieved 82.8% accuracy on test set A and 88.8% on test set B using all 40 clinical variables. Because we used only six common VS, it is important to have a point of reference.
Because the authors of the database intentionally preserved the missing data and erroneous values, we preprocessed the selected data before setting the sepsis position and calculating the KF. First, it was necessary to define a minimum number of registers, which we could deal with. Once in this work we considered patients who developed sepsis in the ICU, any number of registers we chose would be compatible with the database used because it is sourced from ICU patients. We chose to eliminate patients with fewer than 36 registers, which correspond to three 12h medical shifts of measured data. Additionally, if a patient had at least one of the six VS with all Not a Number (NaN) values, we eliminated that patient. Figure 2 shows the exclusion flowchart where the number of patients reduced from 40,336 to 15,515.
After the exclusion process, we found VS with absurd values, probably because of sensor reading or typing errors. Here, we set the operating limits for each variable based on [11], and we replaced those values with NaNs. Figure 3 illustrates this step and the subsequent preprocessing steps.
The number of registers varied from patient to patient. Some of them reached hundreds of hours before the instant of sepsis detection whereas others reached sepsis position in a few hours. However, because we intended to use the data as input for neural networks, we established that all the patients should have the same number of registers. Therefore, we arbitrated that the last 48 h (four medical shifts) were "adequate and sufficient" to analyze the data behavior and to determine the risk of sepsis occurrence. For patients who had less than 48 registers, with or without NaNs in their time series, we filled the "previously missing data" with NaNs.
To tackle the NaNs, we used the mean value of each VS of all patients at each hour and proceeded with the imputation of values accordingly. After this process, we linearly interpolated the remaining missing values after verifying that just 1 or 2 h remained with NaN in some VS.
After the imputation, the VS curves showed a ruggedness even bigger. Although the necessity of smoothing the curves was not imperative, we preferred to do this using the moving average of the 3 last values from hour minus 2 to the present hour.
Then, to address the problem of having multiple variable units, we normalized the VS values based on their maximum values. Thus, we fixed the minimum as zero and applied the formula, \({xNorm}_{j}= {x}_{j}/{x}_{j}Max\).
After all these preprocessing procedures, we could calculate the sepsis position components and the KF using the 1521 positive and 13,994 negative patients. With the six VS along with their KF, each patient had 48 registers. Tables 2 and 3 present part of the sample database available in [26], where the six columns in Table 2 are the normalized VS, and the eight columns in Table 3 are the KF, that is, the six components of the unit vector \({\overrightarrow{e}}_{i}\) followed by the scalars \({\mathrm{Proj}}_{{\overrightarrow{e}}_{i}(t)}^{{\overrightarrow{v}}_{{A}_{i}/B}(t)}\) and \({\mathrm{Proj}}_{{\overrightarrow{e}}_{i}(t)}^{{\overrightarrow{a}}_{{A}_{i}/B}(t)}\).
Moreover, we randomly selected 3316 negative patients from the total 13,994 to balance the number of positive and negative patients according to the processing capacity of the technological infrastructure used. Thus, the final database contained one CSV file with 1521 positive and another CSV file with 3316 negative patients. We labeled the positives with 1 and the negatives with 0 for inputting to the prediction models. Table 4 shows unnormalized values of the basic statistics of the final database.
Preprocessing the data is a necessary step in the statistical analysis that can artificially affect the outcome of modeling studies. However, as our approach aims to compare the accuracy of the VS and KF models, all effects from any preprocessing will not compromise the result achieved because they will equally improve or degrade the quality of both models.
Neural networks for early detection of sepsis
In our research, we calculated KF values using six patients’ VS values at each instant of time. Then, we used these KF values as the input of some neural networks (NN) for the early detection of sepsis.
We decided to use LSTM NN to test our hypothesis because of its “remember” or “forget” features (as it receives inputs sequentially along with the training). These features improve the LSTM’s capacity to discover dependencies in timeseries data [37].
LSTM is a variation of a recurrent neural network (RNN) proposed by Hochreiter et al. [38]. The authors solved two known problems of traditional RNNs when dealing with timeseries data: (1) the backward propagation error over time becomes extremely high or extremely low, and (2) the progress of the backpropagated error highly depends on the NN weight values. These can cause the weights to vary abnormally, which can increase the time for the learning process significantly [39].
Therefore, we defined an LSTM NN model with the input formed by the VS, and another LSTM model with KF as input. The output layer is always binary (0–negative for sepsis and 1–positive for sepsis). Then, we crossvalidated the main LSTM parameters to find those that resulted in better accuracy and lower standard deviation (STD). Thus, we arrived at a model with 128 nodes, 20% dropout, a batch size of 64, and a sigmoid output activation function that was compiled with a binary crossentropy and Adagrad optimizer [40].
Even though the LSTM NN is sufficient to show if the KF model had a positive impact on accuracy, we compared its results with other models. Hence, we also used a CNN, a linear NN (LNN), and the nonNN classifiers, logistic regression, and decision tree. All of them have the same input and output as the LSTM NN.
Although CNNs are usually applied to visual recognition and text tasks, some researchers have proposed them to classify timeseries data [41,42,43]. Thus, after using the crossvalidation process to find the best parameters, we created a CNN model with 64 filters and a kernel size of 3 and used a rectified linear unit (ReLU) as the activation function [44]. The CNN was also compiled with binary crossentropy and the Adagrad optimizer.
The LNN and the nonNN classifiers do not have characteristics tailored to deal with timeseries data. Therefore, we used them only as references, keeping their default parameters.
Results
Rather than using traditional prediction models, we applied classification models that could discriminate if a set of timeseries data represents a positive or negative patient for sepsis at a certain number of hours before sepsis onset (HBS). HBS is the number of hours before the patient's last timestamp; therefore, it represents the earliness of the prediction and affects the number of rows in the model input. In our work, we used 6 HBS; thus, we used only the first 42 rows out of the total of 48 rows available. Hence, when we used VS, the input model had 6 columns and 42 rows, and when we used KF, the input model had 8 columns and 42 rows. In both cases, when the model classifies patients as positive or as negative with certain accuracy, it is predicting sepsis six hours before its onset with the accuracy achieved.
Because the model with KF had a higherdimensional input, we had to use some modelselection criteria for a fair comparison of the models. After searching for criteria suitable for neural network models, we used the mean accuracies of kfold crossvalidation and the standard deviation. We used the same sepsis position to calculate the KF for the entire database; thus, the data in each fold of the crossvalidation process had the same reference.
Figure 4 shows the results of the crossvalidation process for each model. Inside the bars are the mean accuracies values with KF or with VS as input, that is, the prediction accuracies at 6 HBS.
As required in our hypothesis, we delineated the differences between the accuracies of the KF and VS models: the LSTM model using KF as input achieved a significant difference of 9 compared with the model using VS as input. The CNN models achieved lower accuracies, but the results were similar because using KF as input increased the accuracy by 6 points. As expected, when we tried to classify the patients using an LNN model, the accuracies were particularly lower than the LSTM model, but there was only a slight difference between the KF and VS results. The logistic regression and decision tree classifiers showed virtually no differences between the KF and VS models; moreover, the decision tree reached the worse accuracies of all.
Table 5 presents a comparative summary of the results of each classifier ordered by KF accuracy. The accuracies shown by the LSTM and CNN classifiers were higher. Unlike the other classifiers, they have characteristics that are essential for classifying the data we had constructed: the KF is timedependent, has an onset (sepsis position), and each register has values that are relative to the onset.
Figure 5 shows the calibration curves of the models. As expected, decision tree models are the less calibrated ones.
Our hypothesis became more distinct with the LSTM and CNN classifiers because the differences between the mean accuracies with and without KF as input (Mean Diff. column) were at least three times higher than the mean differences of the other classifiers. As a result, the other classifiers could not achieve our goal, especially if we consider the STDs.
Discussion
In our approach, we showed a novel way to handle the input parameters of NNbased sepsis classification models to improve their accuracies. Instead of inputting directly the patients’ timeseries variables, as is commonly proposed, we treated them as points with velocity and acceleration relative to a sepsis reference and generated a new type of input: the patients’ timeseries kinematic features.
KF models have proven to be more accurate than the VS models, because of the higher capability of KF to distinguish positive patients from negative patients, as the distance from the sepsis point is considered at every instant and the speed with which the patients reach the sepsis point.
In the approach we posit, any set of timeseries variables can be used as inputs, although in this work, we selected only six VS commonly monitored in hospitals: heart rate (HR), pulse oximetry (O_{2}Sat), temperature (Temp), systolic blood pressure (SBP), diastolic blood pressure (DBP), respiration rate (Resp). Therefore, this work can be easily reproduced and expanded by incorporating data from additional and different databases.
In this work, we also assumed that the last 48 h of VS data were a sufficient window size to prove our hypothesis. However, future works can verify if different window sizes affect the results.
Likewise, the criteria for selecting the VS variables can be improved by fitting probabilistic models applied to their timeseries values or by any new technique for selecting the most statistically significant timeseries variable for prediction, such as the one proposed by [13]. The entropy of the VS dynamics [20, 23,24,25] can also be useful in this matter.
In addition, other missing data imputation, interpolation, and normalization procedures certainly would contribute to the preprocessing stage and overall results. It would also be useful to verify how other options of preprocessing methods affect the models’ accuracies with KF and VS as input.
The proposed kinematics approach can also be applied in realtime vital sign monitoring. Here, the position of the sepsis point must be previously defined using the existing data of positive patients, and each new positive patient datum can be used to recalculate a new sepsis position. In this manner, all KF can be calculated in realtime and imputed in the NN model at every instant.
Although the assumption of a static median for the sepsis position point is a simplified way used as baseline to test our hypothesis, there are different ways of defining the sepsis position point. It can be a moving point set according to some statistical method of changing variable values or according to some clinical rules relative to the variables (Fig. 6). Additionally, it can be set combined with clustering techniques to find a moving sepsis position point for each group of patients, depending on their features, such as previous diseases, gender, age range, ethnic group. Therefore, other ways to define sepsis position point will be an enhancement for this work and further validation of the approach proposed here.
Among the six components of the sepsis position vector, only two of them are included as variables in the qSOFA score, which, together with the SOFA score, is part of the sepsis3 criteria to set the sepsis onset. In this work, the “unnormalized” values of components SBP and Resp achieved 90.91 and 21.65 respectively, which mean the attendance of 2point of the qSOFA score (\(SBP \le 100, Resp \ge 22\)), which is sufficient to define a positive condition for sepsis. However, this is not true all the time because the positiveness for sepsis could come from the third qSOFA score variable. In any case, our approach does not depend on this verification, as different variables can be chosen independently of the sepsis criteria variables.
We identified the KF model of the LSTM classifier achieved the highest accuracy. To compare its performance with other works that used VS as input for early detection of sepsis, we present Table 6 that shows their HBS and area under the receiver operating characteristic (AUROC) values. The KF model reached a similar AUROC but a better HBS value, excluding the “outlier” result achieved by Mohammed et al. [15].
Despite proving our hypothesis using ordinary classifiers, we pondered the possibility of improving the KF model performance using new neural network architectures, based on CNN and LSTM [16,17,18,19], or new methodologies/algorithms [9, 12].
Furthermore, we think that it is essential to compare our results with the results obtained using other databases to reveal any bias, even though it is difficult to locate public databases that identify the disease onset.
Conclusion
In recent years, researchers have proposed various models for predicting patient illnesses, seeking to use fewer variables while maximizing accuracy and speed of prediction. Applying our new and innovative kinematics approach for early detection of sepsis using NN classifiers will prove to be a valuable and more accurate approach than considering only simple VS input variables, showing its significant potential in the development of this scientific knowledge.
Availability of data and materials
The sample data and essential source code are available at http://www.github.com/marciofreire/KANNEDS.
Abbreviations
 CNN:

Convolutional neural network
 HBS:

Hours before onset
 KANNEDS:

Kinematics approach with neural networks for early detection of sepsis
 KF:

Kinematic feature
 LSTM:

Long shortterm memory
 NaN:

Not a Number
 NN:

Neural network
 STD:

Standard deviation
 VS:

Vital sign
References
Singer M, Deutschman CS, Seymour CW, ShankarHari M, Annane D, Bauer M, et al. The Third International Consensus Definitions for Sepsis and Septic Shock (Sepsis3). JAMA. 2016;315(8):801.
Seymour CW, Liu VX, Iwashyna TJ, Brunkhorst FM, Rea TD, Scherag A, et al. Assessment of clinical criteria for sepsis for the third international consensus definitions for sepsis and septic shock (sepsis3). JAMA J Am Med Assoc. 2016;315(8):762–74.
Fleischmann C, Scherag A, Adhikari NKJ, Hartog CS, Tsaganos T, Schlattmann P, et al. Assessment of global incidence and mortality of hospitaltreated sepsis current estimates and limitations. Am J Respir Crit Care Med. 2016;193(3):259–72.
GiamarellosBourboulis EJ, Tsaganos T, Tsangaris I, Lada M, Routsi C, Sinapidis D, et al. Validation of the new Sepsis3 definitions: proposal for improvement in early risk identification. Clin Microbiol Infect. 2017;23(2):104–9.
Rhodes A, Evans LE, Alhazzani W, Levy MM, Antonelli M, Ferrer R, et al. Surviving sepsis campaign: international guidelines for management of sepsis and septic Shock. Crit Care Med. 2016;2017:486–552.
Song JU, Sin CK, Park HK, Shim SR, Lee J. Performance of the quick Sequential (sepsisrelated) Organ Failure Assessment score as a prognostic tool in infected patients outside the intensive care unit: a systematic review and metaanalysis. Crit Care. 2018;22(1):1–13.
Rousselot J, Decotignie JD. Wireless communication systems for continuous multiparameter health monitoring. In: IEEE International conference on ultrawideband. Vancouver: IEEE; 2009. p. 480–4.
Cruz MF, Cavalcante CAMT, Sá Barretto ST. Using OPC and HL7 standards to incorporate an industrial Big Data historian in a health IT environment. J Med Syst. 2018;42(7):11.
Henry KE, Hager DN, Pronovost PJ, Saria S. A targeted realtime early warning score (TREWScore) for septic shock. Sci Transl Med. 2015;7(299):122.
Manaktala S, Claypool SR. Evaluating the impact of a computerized surveillance algorithm and decision support system on sepsis mortality. J Am Med Inform Assoc. 2017;24(1):88–95.
Horng S, Sontag DA, Halpern Y, Jernite Y, Shapiro NI, Nathanson LA. Creating an automated trigger for sepsis clinical decision support at emergency department triage using machine learning. PLoS ONE. 2017;12(4):e0174708.
Mao Q, Jay M, Hoffman JL, Calvert J, Barton C, Shimabukuro D, et al. Multicentre validation of a sepsis prediction algorithm using only vital sign data in the emergency department, general ward and ICU. BMJ Open. 2018;8(1):e017833.
Bock C, Gumbsch T, Moor M, Rieck B, Roqueiro D, Borgwardt K. Association mapping in biomedical time series via statistically significant shapelet mining. Bioinformatics. 2018;34(13):I438–46.
Kamaleswaran R, Akbilgic O, Hallman MA, West AN, Davis RL, Shah SH. Applying artificial intelligence to identify physiomarkers predicting severe sepsis in the PICU. Pediatr Crit Care Med. 2018;19(10):E495503.
Mohammed A, Van Wyk F, Chinthala LK, Khojandi A, Davis RL, Coopersmith CM, et al. Temporal differential expression of physiomarkers predicts sepsis in critically ill adults. Shock. 2020 Sep 28; Publish Ah. Available from https://doi.org/10.1097/SHK.0000000000001670.
Perng JW, Kao IH, Kung CT, Hung SC, Lai YH, Su CM. Mortality prediction of septic patients in the emergency department based on machine learning. J Clin Med. 2019 Nov 7 [cited 2021 Jan 27];8(11):1906. Available from https://www.mdpi.com/20770383/8/11/1906.
Li X, André Ng G, Schlindwein F. Convolutional and recurrent neural networks for early detection of sepsis using hourly physiological data from patients in intensive care unit. In: 2019 Computing in Cardiology Conference (CinC). Computing in Cardiology; 2019.
Lin C, Zhangy Y, Ivy J, Capan M, Arnold R, Huddleston JM, et al. Early diagnosis and prediction of sepsis shock by combining static and dynamic information using convolutionalLSTM. In: Proceedings—2018 IEEE international conference on healthcare informatics, ICHI 2018. Institute of Electrical and Electronics Engineers Inc.; 2018. p. 219–28.
Lauritsen SM, Kalør ME, Kongsgaard EL, Lauritsen KM, Jørgensen MJ, Lange J, et al. Early detection of sepsis utilizing deep learning on electronic health record event sequences. Artif Intell Med. 2020;1(104):101820.
Lake DE, Richman JS, Griffin MP, Moorman JR. Sample entropy analysis of neonatal heart rate variability. Am J Physiol Integr Comp Physiol. 2002;283(3):R789–97.
Pincus SM, Gladstone IM, Ehrenkranz RA. A regularity statistic for medical data analysis. J Clin Monit. 1991 Oct [cited 2021 Jan 27];7(4):335–45. Available from https://doi.org/10.1007/BF01619355.
Richman JS, Moorman JR. Physiological timeseries analysis using approximate entropy and sample entropy. Am J Physiol Circ Physiol. 2000;278(6):H2039–49.
Ahmad S, Ramsay T, Huebsch L, Flanagan S, McDiarmid S, Batkin I, et al. Continuous multiparameter heart rate variability analysis heralds onset of sepsis in adults. PLoS ONE. 2009;4(8):e6642.
Drewry AM, Fuller BM, Bailey TC, Hotchkiss RS. Body temperature patterns as a predictor of hospitalacquired sepsis in afebrile adult intensive care unit patients: a casecontrol study. Vol. 17, Critical Care. 2013 [cited 2021 Jan 27]. Available from http://ccforum.com/content/17/5/R200.
Shashikumar SP, Stanley MD, Sadiq I, Li Q, Holder A, Clifford GD, et al. Early sepsis detection in critical care patients using multiscale blood pressure and heart rate dynamics. J Electrocardiol. 2017;50(6):739–43.
Freire Cruz M, Ono N, Huang M, AltafUlAmin M, Kanaya S, Cavalcante CAMT. Repository of the project KANNEDS—kinematics analysis and neural networks for early detection of sepsis. 2020. Available from http://www.github.com/marciofreire/KANNEDS.
Beer FP, Johnston ER, Mazurek DF, Cornwell PJ, Eisenberg ER. Vector mechanics for engineers: statics and dynamics. 9th ed. New York: McGrawHill; 2009.
Meriam JL, Kraige LG. Engineering mechanicsdynamics. 7th ed. New York: Wiley; 2012.
Bone RC, Balk RA, Cerra FB, Dellinger RP, Fein AM, Knaus WA, et al. Definitions for sepsis and organ failure and guidelines for the use of innovative therapies in sepsis. Chest. 1992;101(6):1644–55.
Levy MM, Evans LE, Rhodes A. The surviving sepsis campaign bundle: 2018 update. Intensive Care Med. 2018;44(6):925–8.
Angus DC, LindeZwirble WT, Lidicker J, Clermont G, Carcillo J, Pinsky MR. Epidemiology of severe sepsis in the United States: analysis of incidence, outcome, and associated costs of care. Crit Care Med. 2001;29(7):1303–10.
Martin GS, Mannino DM, Eaton S, Moss M. The epidemiology of sepsis in the United States from 1979 through 2000. N Engl J Med. 2003 [cited 2021 Jan 16];16:1546–54. Available from www.nejm.org.
Reyna MA, Josef CS, Jeter R, Shashikumar SP, Westover MB, Nemati S, et al. Early prediction of sepsis from clinical data. Crit Care Med. 2020;48(2):210–7.
Goldberger AL, Amaral LAN, Glass L, Hausdorff JM, Ivanov PC, Mark RG, et al. PhysioBank, PhysioToolkit, and PhysioNet. Circulation. 2000;101(23):e215–20.
Reyna M, Josef C, Jeter R, Shashikumar S, Moody B, Westover MB, et al. Early prediction of sepsis from clinical data—the PhysioNet computing in cardiology challenge 2019 (version 1.0.0). 2019. Available from https://physionet.org/content/challenge2019/1.0.0/.
Reyna M, Clifford G. Voting of predictive models for clinical outcomes: consensus of algorithms for the early prediction of sepsis from clinical data and an analysis of the PhysioNet/CinC Challenge 2019. arXiv:2012.11013v1. 2020.
Fagerström J, Bång M, Wilhelms D, Chew MS. LiSep LSTM: a machine learning algorithm for early detection of septic shock. Sci Rep. 2019;9(1):15132.
Hochreiter S, Schmidhuber J. Long shortterm memory. Neural Comput. 1997;9(8):1735–80.
Hochreiter S, Schmidhuber JJ. Bridging long time lags by weight guessing and “Long short term memory.” In: Silva FL, Principe JC, Almeida LB, editors. Spatiotemporal models in biological and artificial systems. IOS Press; 1996. p. 65–72.
Duchi J, Singer Y. Adaptive subgradient methods for online learning and stochastic optimization * Elad Hazan. J Mach Learn Res. 2011;12:2121–59.
Yi S, Ju J, Yoon MK, Choi J. Grouped convolutional neural networks for multivariate time series. arXiv:1703.09938. 2017.
Liu CL, Hsaio WH, Tu YC. Time series classification with multivariate convolutional neural network. IEEE Trans Ind Electron. 2019;66(6):4788–97.
Yazdanbakhsh O, Dick S. Multivariate time series classification using dilated convolutional neural network. In: 36th International Conference on Machine Learning. Long Beach; 2019.
Nair V, Hinton GE. Rectified linear units improve restricted boltzmann machines. In: The 27a International Conference on Machine Learning. Haifa; 2010.
Acknowledgements
We would like to thank Editage (www.editage.com) for English language editing.
Funding
This study was partially supported by the Coordenação de Aperfeiçoamento de Pessoal de Nível Superior – Brasil (CAPES) – Finance Code 001 and by the Nara Institute of Science and Technology (NAIST) Data Science Center.
Author information
Authors and Affiliations
Contributions
MFC wrote the first draft of the manuscript with the support of NO, MH, MAUA, SK, and CAMTC. CAMTC and MFC developed the code for preprocessing data and KF calculation with the support of NO. MFC developed the NN code with the support of NO and CAMTC. All authors read and approved the final manuscript.
Corresponding author
Ethics declarations
Additional information
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated in a credit line to the data.
About this article
Cite this article
Cruz, M.F., Ono, N., Huang, M. et al. Kinematics approach with neural networks for early detection of sepsis (KANNEDS). BMC Med Inform Decis Mak 21, 163 (2021). https://doi.org/10.1186/s12911021015293
Received:
Accepted:
Published:
DOI: https://doi.org/10.1186/s12911021015293
Keywords
 Kinematics
 Neural network
 Sepsis
 Early detection
 Vital sign
 Machine learning