Modelling and identification of characteristic kinematic features preceding freezing of gait with convolutional neural networks and layer-wise relevance propagation

Filtjens, Benjamin; Ginis, Pieter; Nieuwboer, Alice; Afzal, Muhammad Raheel; Spildooren, Joke; Vanrumste, Bart; Slaets, Peter

doi:10.1186/s12911-021-01699-0

Research article
Open access
Published: 07 December 2021

Modelling and identification of characteristic kinematic features preceding freezing of gait with convolutional neural networks and layer-wise relevance propagation

Benjamin Filtjens ORCID: orcid.org/0000-0003-2609-6883^1,2,
Pieter Ginis³,
Alice Nieuwboer³,
Muhammad Raheel Afzal¹,
Joke Spildooren⁴,
Bart Vanrumste² &
…
Peter Slaets¹

BMC Medical Informatics and Decision Making volume 21, Article number: 341 (2021) Cite this article

2228 Accesses
9 Citations
1 Altmetric
Metrics details

Abstract

Background

Although deep neural networks (DNNs) are showing state of the art performance in clinical gait analysis, they are considered to be black-box algorithms. In other words, there is a lack of direct understanding of a DNN’s ability to identify relevant features, hindering clinical acceptance. Interpretability methods have been developed to ameliorate this concern by providing a way to explain DNN predictions.

Methods

This paper proposes the use of an interpretability method to explain DNN decisions for classifying the movement that precedes freezing of gait (FOG), one of the most debilitating symptoms of Parkinson’s disease (PD). The proposed two-stage pipeline consists of (1) a convolutional neural network (CNN) to model the reduction of movement present before a FOG episode, and (2) layer-wise relevance propagation (LRP) to visualize the underlying features that the CNN perceives as important to model the pathology. The CNN was trained with the sagittal plane kinematics from a motion capture dataset of fourteen PD patients with FOG. The robustness of the model predictions and learned features was further assessed on fourteen PD patients without FOG and fourteen age-matched healthy controls.

Results

The CNN proved highly accurate in modelling the movement that precedes FOG, with 86.8% of the strides being correctly identified. However, the CNN model was unable to model the movement for one of the seven patients that froze during the protocol. The LRP interpretability case study shows that (1) the kinematic features perceived as most relevant by the CNN are the reduced peak knee flexion and the fixed ankle dorsiflexion during the swing phase, (2) very little relevance for FOG is observed in the PD patients without FOG and the healthy control subjects, and (3) the poor predictive performance of one subject is attributed to the patient’s unique and severely flexed gait signature.

Conclusions

The proposed pipeline can aid clinicians in explaining DNN decisions in clinical gait analysis and aid machine learning practitioners in assessing the generalization of their models by ensuring that the predictions are based on meaningful kinematic features.

Peer Review reports

Background

Parkinson’s disease (PD) is the second most common neurodegenerative disorder, impacting over 6 million people worldwide [1]. Freezing of gait (FOG) is one of the most debilitating symptoms of PD, given that an estimated 20-60% of falls and fall-related injuries for this group can be directly attributed to this paroxysmal symptom [2, 3]. Moreover, FOG is common in PD, with approximately 70% of Parkinson’s disease patients developing FOG over the duration of the disease [4, 5]. FOG is clinically defined as a “brief, episodic absence or marked reduction of forward progression of the feet despite the intention to walk” [6]. PD patients describe freezing of gait as “the feeling that their feet are glued to the ground” [7]. PD patients with FOG have more anxiety and falls [8,9,10,11,12], and an overall lower quality of life [13]. Freezing episodes are most frequently provoked when traversing small spaces, during turning and gait initiation, and while dual-tasking [14, 15]. However, and especially in gait laboratories, it is common that FOG does not occur, despite providing adequate FOG-provoking conditions [15].

To date, Levodopa is the gold standard intervention for the treatment of PD. Levodopa shows a positive effect on FOG [16], with 95% of PD patients showing FOG to a lesser degree after Levodopa is administered [17]. However, the relationship between FOG and Levodopa remains complex, as Levodopa often only elicits a partial response in the more advanced stages and may even exacerbate FOG [18, 19]. Non-pharmaceutical interventions, such as sensory cueing, have shown to improve gait and reduce the severity of FOG [20,21,22,23,24]. The notion of sensory cueing relates to the provision of spatial (visual) stimuli to regulate stride placement and amplitude, or temporal (auditory or somatosensory) stimuli to regulate stride timing and regenerate gait. PD patients have shown to adapt to cueing, reducing the effectiveness of the intervention over time [25]. Hence, the provision of continuous stimuli carries the risk of habituation, which may also negatively impact patient compliance [26]. Furthermore, it has been suggested that the optimal cue timing is before the onset of a FOG episode, as providing cues during a FOG episode may result in cognitive overload [26, 27].

To facilitate research in on-demand preventive cueing, there is a clear need for an automated approach to objectively predict the onset of FOG [27]. Several studies have attempted to characterize and predict FOG [28,29,30,31], typically by relying on manually extracted features and traditional machine learning techniques. However, the pathophysiology of FOG is complex and characterized by highly variable gait patterns between subjects [32,33,34]. Moreover, FOG is characterized by several apparent gait sub-types: (1) Akinetic FOG, characterized by a complete absence of movement in the lower and upper limbs. (2) Trembling FOG, characterized by an alternating tremble of the legs at a frequency of 3 to 8hz. (3) Shuffling FOG, characterized by small shuffling steps with minimal forward displacement [16]. These characteristics make it challenging to hand-engineer features that generalize across subjects and sub-types. Therefore, there is increasing interest in Deep learning (DL) techniques to model FOG [35,36,37,38,39,40].

Owing to their large parametric space, deep learning techniques can infer relevant features directly from the raw input data, a technique called end-to-end learning [41]. However, the large parametric space has as a downside that deep learning models are considered to be black-box algorithms, i.e. there is a lack of direct understanding of the models’ ability to identify relevant features [42]. For FOG prediction, where an intervention to alleviate FOG may be triggered before an episode has visually occurred, it will be especially challenging to motivate the provision of the stimuli. This phenomenon hampers further insight into the complex characteristics that define FOG. Therefore, clinical applications tend to avoid deep learning techniques and use simpler and more interpretable techniques [43].

Interpretability methods have been developed to ameliorate this concern by providing a way to explain the predictions of black-box deep neural networks (DNN). The essential idea behind these methods is to decompose the predicted probability of a specific target into a set of attribution values, sometimes also termed “relevance scores”, to each input sample of the network [44]. The present study goes further than deep learning-based FOG prediction by presenting a two-stage pipeline consisting of: (1) a convolutional neural network (CNN) to model the characteristic kinematic features that differentiate gait cycles that directly precede FOG from their functional counterparts, and (2) layer-wise relevance propagation (LRP) [45] to interpret the trained model and visualize the features that the model perceives as important to the classification problem. LRP is a recently developed gradient-based attribution technique, that has been previously employed to explain DNN predictions in MRI-based Alzheimer’s disease classification [46], EEG classification [47], and to explain the unique characteristics of individual gait patterns [48]. To the best of our knowledge, this is the first study that applies an interpretability method in clinical gait analysis in general and FOG prediction in particular. The proposed pipeline aims to aid clinicians in explaining DNN decisions, and aid machine learning practitioners in assessing the generalization of their DNN models.

Methods

Subject characteristics

An existing dataset [49] of twenty-eight patients diagnosed with PD and in Hoehn & Yahr stage II or III while on medication, and fourteen healthy age-matched controls was used. The PD diagnosis was established by a movement disorders neurologist. Patients were further classified as PD with FOG, from now on called “freezers”, by the New Freezing of Gait Questionnaire [50], when they reported that they had experienced FOG in the past month after showing them a video of different types of freezing episodes, including very mild ones (NFOGQ $\ge$ 1). Patients without FOG, called “non-freezers”, reported not to have had such episodes over this period (NFOGQ = 0). Freezers who did not freeze during the actual experiments are indicated as “NoLab-freezers”. The study was approved by the local ethics committee of the University Hospital Leuven and all subjects gave written informed consent. The clinical characteristics of the three subject groups are presented in Table 1.

Table 1 Subject characteristics of the fourteen healthy controls (controls), fourteen PD patients without FOG (non-freezers), and fourteen PD patients with FOG (freezers) in terms of mean ± SD as measured during the ON-phase of the medication cycle

Full size table

Table 2 Visual overview of the nested leave one subject out cross validation

Full size table

Procedure

Gait analysis was performed using an eight-camera Vicon 3D motion analysis system recording at a sampling frequency of 100Hz (Fig. 1: Phase 1). Thirty-four retro-reflective markers were placed on anatomical landmarks according to the full-body plug-in-gait model [51, 52]. Two retro-reflective markers placed .5 m from each other indicated where subjects either had to (1) walk straight ahead, (2) turn 180$^{\circ }$left, (3) turn 180$^{\circ }$right, (4) turn 360$^{\circ }$left, or (5) turn 360$^{\circ }$right. The five experimental conditions were offered randomly and performed with or without a verbal cognitive dual-task, namely the color classification task [53, 54]. All experiments were done during the off-state of the subjects’ medication cycle (after an overnight withdrawal of their normal medication intake), except for clinical testing which was conducted ON-medication [49].

Two researchers, blinded for NFOG-Q score, visually detected all FOG episodes. The onset of FOG, defined as the start of delayed knee flexion, was detected by visual inspection of the knee-angle data (flexion-extension) in combination with the Vicon 3D images. Termination of FOG was determined at the time point when at least two consecutive movement cycles were regained [55].

The last complete gait cycle before the onset of the freezing episode was chosen as the positive class. To obtain representative data for functional gait, each pre-FOG cycle was matched with one functional gait cycle (FGC) of the same subject (if possible) and one FGC of one of the seven “NoLab-freezers” that did not freeze during the experiments. For the pre-FOG and FGC matching, preference was given to functional strides that occurred during the same experimental protocol and within the same section of the turning radius that was utilized to elicit the FOG episode. This matching protocol was not always possible if, for example, a patient was unable to complete a certain experiment without freezing. To preserve class imbalance if no identical matching FGC could be found, the mismatched FGCs were left included in the training dataset, however, the mismatched pairs were excluded during the attribution analysis. This protocol allowed us to control for class imbalance while ensuring that the variability of all fourteen freezers remained present in the dataset. To prevent human bias and error, our data-driven model [35] was used to automatically extract the gait cycles. It should be noted that the gait cycles termed as “functional gait” were extracted from all fourteen freezers. These functional gait cycles thus included highly impaired movement and it cannot be ensured that had the experiment continued would not have amounted to a freezing episode. However, this more conservative protocol allows the network to model the characteristic movement that precedes FOG, rather than general movement that differentiates freezers from non-freezers.

Data preprocessing and problem formulation

The balanced dataset of pre-FOG and functional gait cycles $[\{X_1, Y_1\}, \{X_2, Y_2\}, \dots , \{X_M, Y_M\}]$ is a collection of M pairs $\{X_i, Y_i\}$, where each gait cycle $X_i$ is a collection of joint trajectories and $Y_i$ its respective label. Each gait cycle was low-pass filtered with a cut-off frequency of 7 Hz [56] using a forward-backward fourth-order butter-worth filter and was resampled to 101 samples such that each sample corresponds to one percent of the gait cycle. Each input signal $X_i = x_{1}, x_{2}, \dots , x_{101}$ thus consists of 101 real-valued joint trajectories, where the joint trajectories $x \in \mathbb {R}^{1\times 3}$ are composed of the sagittal plane kinematics $x_h, x_k, x_a$, respectively the hip, knee, and ankle components. To ensure an equal contribution of all joint trajectories [57], each joint trajectory $x_i$ was individually re-scaled to a range of $[-1,1]$. $Y \in \mathbb {R}^{M\times 2}$ is the one-hot encoded label vector, where each element $Y_i \in \{0,1\}$, is equal to 1 if the gait cycle $X_i$ is preceding a FOG episode and 0 if it is a functional gait cycle. The goal of a deep learning model is to classify the multivariate input signal $X_i \in \mathbb {R}^{101\times 3}$ into its corresponding label $Y_i$ (Fig. 1: Phase 2).

Model definition

Deep Neural Networks (DNNs), such as Convolutional Neural Networks (CNNs), have shown state of the art results in time series classification [58]. A CNN [59] consists of altering convolutional and pooling layers and comprises three phases. In the first phase, the input signal is convolved in a convolutional layer with a set of filters, where each filter is defined by a weight matrix W and bias b. These convolutions consist of element-wise multiplications and summations of the input signal and have an interesting property called parameter sharing, i.e. the same convolution (filter values W and b) is used for all time samples of the input signal [60]. This property enables a CNN to learn features that are invariant across the time dimension [58]. In the second phase, the output of the convolution is passed through a non-linear activation function. In the third phase, the non-linearity is followed by a local pooling layer to reduce the dimensionality of the convolutional layer output.

The result is a p-dimensional feature vector, where p is equal to the number of filters. The feature vector is fed into a global average pooling layer [61], which drastically reduces the number of parameters compared to a traditional summation. The pooled features are then transformed to predictions over the output label through a softmax activation function. To improve regularization, dropout [62] along with max-norm regularization, and a sigmoidal decaying learning rate was used.

During training, the weights are optimized to minimize the error between the model prediction $\hat{Y}_{i}$ and the observed data $Y_{i}$, defined as the loss function. To account for class imbalance, a weighted categorical cross-entropy loss was used [60]:

$$\begin{aligned} L(Y_{i}, \hat{Y}_{i}) = -\sum _{i} \alpha _{i} Y_{i}log\hat{Y}_{i}, \end{aligned}$$

(1)

where L is the loss and $\alpha _{i}$ the weighing factor of class i.

As a simple baseline, a support vector machine (SVM) [63, 64] with a linear kernel was implemented. For the simple baseline, the Linear Support Vector Classifier (LinearSVC) of the scikit-learn toolbox [65] was used with a regularization parameter C of 0.01.

Model selection

To find a good set of hyperparameters, a recently proposed Bayesian optimization algorithm was used [66]. For a complete overview of the optimized hyperparameter space, the reader is referred to Table 1 in Additional file 1: Table S1. Model selection and training was done by following a nested cross validation approach, with training and validation folds split by subject, as formalized in Table 2. To assess generalization of the model to a different cohort of subjects, a pre-trained model on the fourteen freezers was used to predict the gait cycles of the fourteen non-freezers and fourteen healthy control subjects. Since the dataset consists out of balanced pre-FOG and functional pairs for the PD patients with FOG that froze during the experiments and solely functional cycles for the NoLab-freezers that did not freeze during the experiments, the results were summarized in terms of accuracy:

$$\begin{aligned} Accuracy = \frac{Number \, of \, correct \, predictions}{Number \, of \, all \, predictions} \, \% \end{aligned}$$

(2)

For the fourteen freezers, the models’ predictions were additionally summarized with the positive and negative predictive values (PPV and NPV), the sensitivity, and the specificity, defined as:

$$\begin{aligned} PPV= & {} \frac{Number \, of \, true \, positives}{Number \, of \, true \, positives + \, false \, positives} \, \% \end{aligned}$$

(3)

$$\begin{aligned} NPV= & {} \frac{Number \, of \, true \, negatives}{Number \, of \, true \, negatives + \, false \, negatives} \, \% \end{aligned}$$

(4)

$$\begin{aligned} Sensitivity= & {} \frac{Number \, of \, true \, positives}{Number \, of \, true \, positives + \, false \, negatives} \, \% \end{aligned}$$

(5)

$$\begin{aligned} Specificity= & {} \frac{Number \, of \, true \, negatives}{Number \, of \, true \, negatives + \, false \, positives} \, \% \end{aligned}$$

(6)

To determine if the differences in predictive performance between the two evaluated methods are statistically significant, a McNemar’s test was performed [67]. The McNemar’s test, sometimes also called a “within-subjects chi-squared test”, is a non-parametric statistical test for paired nominal data that can be used to compare the performance of two classifiers [68]. McNemar’s test evaluates the null hypothesis that there is no difference in the classification performance of the two methods. For the statistical evaluations, the significance level was set to 95%, which means that the differences are considered statistically significant if the calculated p-values are lower than 0.05.

Model interpretation

Layer-wise Relevance Propagation (LRP) [45] was used to improve transparency and provide insight into the predictions of the DL model (Fig. 1: Phase 3). LRP is a commonly used attribution technique that decomposes the prediction of a particular output $Y_i$, computed over a gait cycle $X_i$, down to relevance scores of each input sample. Formally, LRP computes the relevance by back-propagating over the following equation:

$$\begin{aligned} R_{i}^{(l)} = \sum _{j} \frac{z_{ij}}{\sum _{i'} z_{i'j}} R_{j}^{(l+1)} \quad \text {with}\quad z_{ij} = x_{i}^{(l)} w_{ij}^{(l,l+1)}, \end{aligned}$$

(7)

where $R_{i}^{(l)}$ is the relevance of unit i of layer l. This decomposition results in a relevance map (heatmap) $\sum _{x} R_{x}^{(1)}$, which demonstrates the importance of each input sample $x_i$ to the prediction of the output. This study uses the epsilon variant of LRP ($\epsilon$-LRP), as implemented in [44]:

$$\begin{aligned} R_{i}^{(l)} = \sum _{j} \frac{z_{ij}}{\sum _{i'} z_{i'j} + \epsilon \, sign(\sum _{i' z_{i'j}})} R_{j}^{(l+1)}, \end{aligned}$$

(8)

where the term $\epsilon$ is added to the denominator of Equation 10 to avoid numerical instabilities. For a theoretical deduction of LRP the reader is referred to [69], where the authors show how LRP can be theoretically justified as a deep Taylor decomposition.

Results

Freezing proved difficult to elicit in front of the cameras. FOG was provoked for ten of the fourteen freezers during the test period, but only seven patients froze in visibility of the cameras. Most freezing episodes occurred during directional change, i.e. after initiating the 180 or 360-degree turn. Subject 1 froze eighteen times, subject 2 thirteen times, subject 3 seven times, subject 4 three times, subject 5 five times, subject 6 nine times, and subject 7 froze once, amounting to a total of fifty-six freezing episodes. The CNN model and the SVM baseline showed excellent classification accuracy. For the fourteen PD patients with FOG, both models achieved comparable accuracy (p = 0.56), with an accuracy of 86.8% and 85.9% by the CNN and SVM, respectively. Interestingly, an analysis of the false detection shows that the lower sensitivity of the CNN is attributed to subject five, for whom all strides were falsely predicted as FGC. Furthermore, most false FGC detections of both models are attributed to subject thirteen and fourteen, two of the three patients that froze during the test period, but not in front of the cameras. For the PD patients without FOG and healthy control subjects, a total of 2421 and 2258 strides were extracted, respectively. For these subjects, the CNN proved the most robust (p = 2.40e-07), with only 26 strides falsely classified for the PD patients without FOG and only a single stride falsely classified for the healthy control subjects. All the results are summarized in Table 3.

Table 3 Results of the convolutional neural network (CNN) and support vector machine with linear kernel (LSVC)

Full size table

Mean attribution plots were obtained for six of the seven freezers who experienced FOG during the protocol (Fig. 2a), with the excluded subject for which the model did not perform well (subject five) discussed separately (Fig. 2b), and the fourteen non-freezers and fourteen healthy control subjects (Fig. 2c). The attribution plots visualize the gait characteristics that were the most relevant to the prediction. The mean and standard deviation of the time normalized and re-scaled hip, knee, and ankle joint trajectories in the sagittal plane are plotted and colorized with the relevance map (heatmap) $\sum _{x} R_{x}^{(1)}$ from $\epsilon$-LRP. Positive relevance (red) indicates contribution to FOG, while negative relevance (blue) indicates contribution to FGC.

The attribution analysis of the freezers (Fig. 2a) indicates that the most relevant kinematic features that characterize the movement preceding FOG are the fixed knee extension during the stance phase, reduced peak knee flexion during the swing phase, and fixed ankle dorsiflexion during the swing phase. For FGC, the most relevant features are the peak hip extension and peak knee flexion during the swing phase.

An attribution plot of subject five (Fig. 2b) was created to assess if the heatmaps can uncover an explanation for the poor predictive performance on this subject. Subject five contributed 5 pre-FOG and FGC pairs, with the model classifying all strides as FGC. The lower extremity kinematics indicate that this subject has a severely stooped posture, characterized by large hip and knee flexion. The attribution analysis highlights a near-complete absence of features with a positive contribution to pre-FOG. Additionally, the analysis highlights that the large hip and knee flexion apparent during both pre-FOG and FGC are features that contribute to FGC, indicating that the gait characteristics that uniquely describe this subject are utilized to wrongly classify pre-FOG as FGC.

The attribution analysis of the non-freezers and healthy controls (Fig. 2c) indicates a near complete absence of features with a positive contribution to FOG. The most relevant features to classify FGC for this cohort of subjects are the peak hip and knee flexion during the swing phase.

Discussion

To tackle the problem of explainable freezing of gait (FOG) prediction, this paper proposed a two-stage pipeline of: (1) a convolutional neural network (CNN) to model the dramatic reduction of movement present before a FOG episode, and (2) layer-wise relevance propagation (LRP) to visualize the underlying features that the CNN perceives as important to model the pathology. The CNN was trained end-to-end on a dataset that consists of fourteen PD patients with FOG. The patients were instructed to complete a FOG provoking protocol of 180 and 360-degree turning, with or without a verbal cognitive dual-task. FOG proved difficult to elicit, with a total of 56 FOG episodes provoked to train the models. This phenomenon is not uncommon, with previous literature also reporting low numbers of freezing episodes occurring in experimental situations, pointing to the unpredictability of FOG [70]. Based on these 56 episodes, a training dataset was created which consists of the time normalized gait cycles directly preceding FOG, each matched with one functional gait cycle (FGC) of the same subject and one FGC of one of the seven NoLab-freezers that did not freeze during the experiments. Despite the relatively low amount of FOG and FGC matched pairs in the training dataset, this study confirms that the dramatic reduction of movement present before freezing can be accurately modelled with DL. After training the CNN to separate movement preceding FOG from normal functional gait, heatmaps were created with LRP. These heatmaps provide insight into the model predictions by quantifying the contribution of each joint trajectory at a certain percentage of the gait cycle to the classification prediction.

From a machine learning perspective, direct comparisons with other studies that researched the motor patterns that precede FOG is challenging because of different underlying study designs. For example, in [29, 71], and [72] the authors extracted time domain and frequency domain features from inertial sensors. Next, the extracted features were used to train a linear discriminant analysis classifier [29], ensemble classifiers [71], or a SVM [72]. In [29] the authors additionally quantitatively assessed the statistical significance of the extracted features. In contrast, DNNs extract features automatically from the raw input signal. To identify whether these features are based on noise or on meaningful kinematic patterns, a qualitative assessment is performed by using heatmap-based attribution methods. To the best of our knowledge, no studies have either: (1) trained a DNN on MoCap-based kinematic data to model the movement that precedes FOG, or (2) used an attribution method to gain insight into a DNNs ability to identify meaningful kinematic patterns that precede FOG.

From a clinical perspective, in [73] the authors found that prior to freezing subjects had severely decreased range of motion in the sagittal plane joint trajectories (with the reduction in the range of motion varying between 31% and 61.5%) of the hip, knee, and ankle. In the interpretability case study, the heatmaps indicated that the CNN model also identified the reduced range of motion as a relevant feature to model the movement preceding FOG. This finding supports the notion that DNN decisions are based on meaningful features. For one of the seven freezers, the CNN was unable to model the movement preceding FOG. The heatmaps indicated that the stooped posture, characterized by a dramatic increase in knee and hip flexion, were the features that the CNN model used to wrongfully classify FOG as FGC. This finding supports the notion that heatmap-based visualizations can aid in uncovering an indication of which features a DNN wrongfully associates with the underlying pathology and thereby allow machine learning practitioners to assess the generalization of their models. Interestingly, the heatmaps also suggest that FOG affects the stance limb to a sufficient degree to influence the prediction, with the fixed knee extension during the stance phase seen as a relevant feature. In [73] the authors only considered FOG events that occurred without directional change. Therefore, future quantitative research should assess whether the stance limb influencing the model predictions is due to the different underlying study designs and thus based on a meaningful kinematic pattern or is the result of noise picked up by the model.

This study also has important limitations. Firstly, the interpretability case study uses a heatmap-based visualization of the learned features. The main limitation of heatmap-based visualizations is the lack of ground-truth, which means that the visualizations can solely be qualitatively assessed [46]. Secondly, the interpretability case study applied to FOG prediction is a proof-of-concept and further research is needed to assess generalization to other use-cases in gait analysis. Thirdly, from a modelling perspective, it should be noted that the threshold model of FOG [74] states that freezing is characterized by continuous degradation of the movement pattern until a threshold is reached and the FOG episode occurs. In this study, the movement preceding FOG is modelled based on the kinematics of a single gait cycle. Therefore, better predictive performance may be achieved by modelling the movement preceding FOG as a sequence of gait cycles, rather than treating each gait cycle as conditionally independent. However, a larger pool of participants with a more varied FOG-provoking protocol will be required to verify this hypothesis. Lastly, the small cohort of PD patients with FOG in this study may not be representative of all freezers, making the conclusions here generalizable to only a small subset of PD patients with FOG.

Conclusions

Due to the black-box nature of deep learning, clinical gait analysis applications tend to avoid DNNs and retreat to simpler and more interpretable techniques. Using the use-case of FOG prediction, this paper proposed a two-stage pipeline of: (1) a CNN to model the dramatic reduction of movement present before FOG, and (2) LRP to visualize and interpret the underlying features that the CNN perceives as important to the respective classification. The proposed methodology shows that CNNs are capable of modelling the dramatic reduction of movement present before FOG. More importantly, this paper confirms the notion that model interpretation is a powerful tool that allows detailed insight into the complex intertwining between DNN predictions and FOG.

In conclusion, it can be established that the benefit of the proposed interpretability pipeline is two-fold: (1) it can assist expert clinical opinion in explaining DNN predictions by visualizing the kinematic features that the model has learned, and (2) it can aid machine learning practitioners in assessing the generalization of their models by ensuring that the predictions are based on meaningful kinematic features. Future work is now possible in which the proposed pipeline can be used as an automated and objective approach to trigger preventive interventions, i.e. the provision of external stimuli, for FOG. In such work, the interpretations will allow: (1) the clinician to motivate the provision of external stimuli, and (2) a detailed assessment of the efficacy of the intervention by visualizing whether the strides following the intervention show reduced relevance for FOG.

Availability of data and materials

The input set was imported and labelled using Python version 2.7.12 with Biomechanical Toolkit (btk) version 0.3 [75]. Deep learning models were trained on an NVIDIA Tesla K80 GPU using Python version 3.6.8 and Tensorflow version 1.14 [76]. Hyperparameters were optimized using the Hyperopt python library [77], with the cross-validation splits and SVM implemented with scikit-learn version 0.21.3 [65]. Relevance scores were computed with e-LRP as implemented in the DeepExplain framework version 0.3 [44] and visualized with the bipolar colormap [78]. Plotting scripts were modified from spm1d [79].Statistical analysis was conducted using the mlxtend python library [80]. Due to privacy concerns, the dataset analysed during the current study is not publicly available.

Abbreviations

FOG:: Freezing of gait
FGC:: Functional gait cycle
PD:: Parkinson’s disease
DL:: Deep learning
DNN:: Deep neural network
CNN:: Convolutional neural network
LRP:: Layer-wise relevance propagation
H[MYAMP:: Y] Hoehn and Yahr
NFOQ-Q:: New Freezing of Gait Questionnaire
SVM:: Support vector machine
LinearSVC:: Linear Support Vector Classifier
BTK:: Biomechanical Toolkit

References

GBD 2016 Parkinson’s Disease Collaborators. Global, regional, and national burden of parkinson’s disease, 1990–2016: a systematic analysis for the global burden of disease study 2016. Lancet Neurol. 2018;17(11):939–53.
Rudzińska M, Bukowczan S, Stożek J, Zajdel K, Mirek E, Chwala W, Wójcik-Pedziwiatr M, Banaszkiewicz K, Szczudlik A. Causes and consequences of falls in Parkinson disease patients in a prospective study. Neurol Neurochir Pol. 2013;47(5):423–30.
Article PubMed Google Scholar
Pelicioni PHS, Menant JC, Latt MD, Lord SR. Falls in Parkinson’s disease subtypes: risk factors, locations and circumstances. Int J Environ Res Public Health. 2019;16(12):66.
Article Google Scholar
Perez-Lloret S, Negre-Pages L, Damier P, Delval A, Derkinderen P, Destée A, Meissner WG, Schelosky L, Tison F, Rascol O. Prevalence, determinants, and effect on quality of life of freezing of gait in Parkinson disease. JAMA Neurol. 2014;71(7):884–90.
Article PubMed Google Scholar
Hely MA, Reid WGJ, Adena MA, Halliday GM, Morris JGL. The Sydney multicenter study of Parkinson’s disease: the inevitability of dementia at 20 years. Mov Disord. 2008;23(6):837–44.
Article PubMed Google Scholar
Nutt JG, Bloem BR, Giladi N, Hallett M, Horak FB, Nieuwboer A. Freezing of gait: moving forward on a mysterious clinical phenomenon. Lancet Neurol. 2011;10(8):734–44.
Article PubMed PubMed Central Google Scholar
Snijders AH, Nijkrake MJ, Bakker M, Munneke M, Wind C, Bloem BR. Clinimetrics of freezing of gait. Mov Disord. 2008;23(Suppl 2):468–74.
Article Google Scholar
Fahn S. The freezing phenomenon in Parkinsonism. Adv Neurol. 1995;67:53–63.
CAS PubMed Google Scholar
Bloem BR, Hausdorff JM, Visser JE, Giladi N. Falls and freezing of gait in Parkinson’s disease: a review of two interconnected, episodic phenomena. Mov Disord. 2004;19(8):871–84.
Article PubMed Google Scholar
Grimbergen YAM, Munneke M, Bloem BR. Falls in Parkinson’s disease. Curr Opin Neurol. 2004;17(4):405–15.
Article PubMed Google Scholar
Gray P, Hildebrand K. Fall risk factors in Parkinson’s disease. J Neurosci Nurs. 2000;32(4):222–8.
Article CAS PubMed Google Scholar
Giladi N, Hausdorff JM. The role of mental function in the pathogenesis of freezing of gait in Parkinson’s disease. J Neurol Sci. 2006;248(1–2):173–6.
Article PubMed Google Scholar
Moore O, Kreitler S, Ehrenfeld M, Giladi N. Quality of life and gender identity in Parkinson’s disease. J Neural Transm. 2005;112(11):1511–22.
Article CAS PubMed Google Scholar
Nonnekes J, Snijders AH, Nutt JG, Deuschl G, Giladi N, Bloem BR. Freezing of gait: a practical approach to management. Lancet Neurol. 2015;14(7):768–78.
Article PubMed Google Scholar
Okuma Y. Practical approach to freezing of gait in Parkinson’s disease. Pract Neurol. 2014;14(4):222–30.
Article PubMed Google Scholar
Schaafsma JD, Balash Y, Gurevich T, Bartels AL, Hausdorff JM, Giladi N. Characterization of freezing of gait subtypes and the response of each to levodopa in Parkinson’s disease. Eur J Neurol. 2003;10(4):391–8.
Article CAS PubMed Google Scholar
Fietzek UM, Zwosta J, Schroeteler FE, Ziegler K, Ceballos-Baumann AO. Levodopa changes the severity of freezing in Parkinson’s disease. Parkin Relat Disord. 2013;19(10):894–6.
Article Google Scholar
Lucas McKay J, Goldstein FC, Sommerfeld B, Bernhard D, Perez Parra S, Factor SA. Freezing of gait can persist after an acute levodopa challenge in Parkinson’s disease. NPJ Parkin Dis. 2019;5:25.
Article CAS Google Scholar
Espay AJ, Fasano A, van Nuenen BFL, Payne MM, Snijders AH, Bloem BR. “On” state freezing of gait in Parkinson disease: a paradoxical levodopa-induced complication. Neurology. 2012;78(7):454–7.
Article CAS PubMed PubMed Central Google Scholar
Lim I, van Wegen E, de Goede C, Deutekom M, Nieuwboer A, Willems A, Jones D, Rochester L, Kwakkel G. Effects of external rhythmical cueing on gait in patients with Parkinson’s disease: a systematic review. Clin Rehabil. 2005;19(7):695–713.
Article CAS PubMed Google Scholar
Nieuwboer A, Kwakkel G, Rochester L, Jones D, van Wegen E, Willems AM, Chavret F, Hetherington V, Baker K, Lim I. Cueing training in the home improves gait-related mobility in Parkinson’s disease: the RESCUE trial. J Neurol Neurosurg Psychiatry. 2007;78(2):134–40.
Article CAS PubMed Google Scholar
Rubinstein TC, Giladi N, Hausdorff JM. The power of cueing to circumvent dopamine deficits: a review of physical therapy treatment of gait disturbances in Parkinson’s disease. Mov Disord. 2002;17(6):1148–60.
Article PubMed Google Scholar
Arias P, Cudeiro J. Effect of rhythmic auditory stimulation on gait in parkinsonian patients with and without freezing of gait. PLoS ONE. 2010;5(3):9675.
Article Google Scholar
Cosentino C, Baccini M, Putzolu M, Ristori D, Avanzino L, Pelosin E. Effectiveness of physiotherapy on freezing of gait in Parkinson’s disease: a systematic review and Meta-Analyses. Mov Disord. 2020;35(4):523–36.
Article PubMed Google Scholar
Ginis P, Nackaerts E, Nieuwboer A, Heremans E. Cueing for people with Parkinson’s disease with freezing of gait: a narrative review of the state-of-the-art and novel perspectives. Med Ann Phys Rehabil. 2017;6:66.
Google Scholar
Ginis P, Heremans E, Ferrari A, Bekkers EMJ, Canning CG, Nieuwboer A. External input for gait in people with Parkinson’s disease with and without freezing of gait: one size does not fit all. J Neurol. 2017;264(7):1488–96.
Article PubMed Google Scholar
Mancini M, Bloem BR, Horak FB, Lewis SJG, Nieuwboer A, Nonnekes J. Clinical and methodological challenges for assessing freezing of gait: future perspectives. Mov Disord. 2019;34(6):783–90.
Article PubMed PubMed Central Google Scholar
Naghavi N, Wade E. Prediction of freezing of gait in Parkinson’s disease using statistical inference and Lower–Limb acceleration data. IEEE Trans Neural Syst Rehabil Eng. 2019;27(5):947–55.
Article PubMed Google Scholar
Palmerini L, Rocchi L, Mazilu S, Gazit E, Hausdorff JM, Chiari L. Identification of characteristic motor patterns preceding freezing of gait in Parkinson’s disease using wearable sensors. Front Neurol. 2017;8:394.
Article PubMed PubMed Central Google Scholar
Mazilu S, Calatroni A, Gazit E, Roggen D, Hausdorff JM, Tröster G. Feature learning for detection and prediction of freezing of gait in Parkinson’s disease. In: Perner P, editor. Machine learning and data mining in pattern recognition. Berlin: Springer; 2013. p. 144–58.
Chapter Google Scholar
Demrozi F, Bacchin R, Tamburin S, Cristani M, Pravadelli G. Towards a wearable system for predicting the freezing of gait in people affected by Parkinson’s disease. IEEE J Biomed Health Inform. 2019;6:66.
Google Scholar
Hausdorff JM, Schaafsma JD, Balash Y, Bartels AL, Gurevich T, Giladi N. Impaired regulation of stride variability in Parkinson’s disease subjects with freezing of gait. Exp Brain Res. 2003;149(2):187–94.
Article CAS PubMed Google Scholar
Chee R, Murphy A, Danoudis M, Georgiou-Karistianis N, Iansek R. Gait freezing in Parkinson’s disease and the stride length sequence effect interaction. Brain. 2009;132(Pt 8):2151–60.
Article PubMed Google Scholar
Plotnik M, Giladi N, Hausdorff JM. Bilateral coordination of walking and freezing of gait in Parkinson’s disease. Eur J Neurosci. 2008;27(8):1999–2006.
Article PubMed Google Scholar
Filtjens B, Nieuwboer A, D’cruz N, Spildooren J, Slaets P, Vanrumste B. A data-driven approach for detecting gait events during turning in people with Parkinson’s disease and freezing of gait. Gait Post. 2020;80:130–6.
Filtjens B, Ginis P, Nieuwboer A, Slaets P, Vanrumste B, Automated freezing of gait assessment with marker-based motion capture and multi-stage graph convolutional neural networks approaches expert-level detection. arXiv e-prints. 2021;2103–15449. arXiv:2103.15449
Hu K, Wang Z, Mei S, Ehgoetz Martens KA, Yao T, Lewis SJG, Feng DD. Vision-based freezing of gait detection with anatomic directed graph representation. IEEE J Biomed Health Inform. 2020;24(4):1215–25.
Article PubMed Google Scholar
Masiala S, Huijbers W, Atzmueller M, Feature-Set-Engineering for detecting freezing of gait in parkinson’s disease using deep recurrent neural networks. pre-print. 2019. arXiv:1909.03428
Camps J, Samà A, Martín M, Rodríguez-Martín D, Pérez-López C, Alcaine S, Mestre B, Prats A, Crespo MC, Cabestany J, Bayés À, Català A. Deep learning for detecting freezing of gait episodes in Parkinson’s disease based on accelerometers. In: Advances in computational intelligence. Springer; 2017. pp. 344–55.
Sigcha L, Costa N, Pavón I, Costa S, Arezes P, López JM, De Arcas G. Deep learning approaches for detecting freezing of gait in Parkinson’s disease patients through on-body acceleration sensors. Sensors. 2020;20(7):66.
Article Google Scholar
Wang Z, Yan W, Oates T, Time series classification from scratch with deep neural networks: a strong baseline. 2016. arXiv:1611.06455
Castelvecchi D. Can we open the black box of AI? Nature. 2016;538(7623):20–3.
Article CAS Google Scholar
Barredo Arrieta A, Díaz-Rodríguez N, Del Ser J, Bennetot A, Tabik S, Barbado A, Garcia S, Gil-Lopez S, Molina D, Benjamins R, Chatila R, Herrera F. Explainable artificial intelligence (XAI): concepts, taxonomies, opportunities and challenges toward responsible AI. Inf Fusion. 2020;58:82–115.
Article Google Scholar
Ancona M, Ceolini E, Öztireli C, Gross M, Towards better understanding of gradient-based attribution methods for deep neural networks. 2017. arXiv:1711.06104
Bach S, Binder A, Montavon G, Klauschen F, Müller K-R, Samek W. On pixel-wise explanations for non-linear classifier decisions by layer-wise relevance propagation. PLoS ONE. 2015;10(7):0130140.
Article Google Scholar
Böhle M, Eitel F, Weygandt M, Ritter K. Layer-wise relevance propagation for explaining deep neural network decisions in MRI-based Alzheimer’s disease classification. Front Aging Neurosci. 2019;11:194.
Article PubMed PubMed Central Google Scholar
Sturm I, Lapuschkin S, Samek W, Müller K-R. Interpretable deep neural networks for single-trial EEG classification. J Neurosci Methods. 2016;274:141–5.
Article PubMed Google Scholar
Horst F, Lapuschkin S, Samek W, Müller K-R, Schöllhorn WI. Explaining the unique nature of individual gait patterns with deep learning. Sci Rep. 2019;9(1):2391.
Article PubMed PubMed Central Google Scholar
Spildooren J, Vercruysse S, Desloovere K, Vandenberghe W, Kerckhofs E, Nieuwboer A. Freezing of gait in Parkinson’s disease: the impact of dual-tasking and turning. Mov Disord. 2010;25(15):2563–70.
Article PubMed Google Scholar
Nieuwboer A, Rochester L, Herman T, Vandenberghe W, Emil GE, Thomaes T, Giladi N. Reliability of the new freezing of gait questionnaire: agreement between patients with Parkinson’s disease and their carers. Gait Post. 2009;30(4):459–63.
Article Google Scholar
Kadaba MP, Ramakrishnan HK, Wootten ME. Measurement of lower extremity kinematics during level walking. J Orthop Res. 1990;8(3):383–92.
Article CAS PubMed Google Scholar
Davis RB, Õunpuu S, Tyburski D, Gage JR. A gait analysis data collection and reduction technique. Hum Mov Sci. 1991;10(5):575–87.
Article Google Scholar
Canning CG, Ada L, Johnson JJ, McWhirter S. Walking capacity in mild to moderate Parkinson’s disease. Arch Phys Med Rehabil. 2006;87(3):371–5.
Article PubMed Google Scholar
Bowen A, Wenman R, Mickelborough J, Foster J, Hill E, Tallis R. Dual-task effects of talking while walking on velocity and balance following a stroke. Age Ageing. 2001;30(4):319–23.
Article CAS PubMed Google Scholar
Spildooren J, Vercruysse S, Meyns P, Vandenbossche J, Heremans E, Desloovere K, Vandenberghe W, Nieuwboer A. Turning and unilateral cueing in Parkinson’s disease patients with and without freezing of gait. Neuroscience. 2012;207:298–306.
Article CAS PubMed Google Scholar
Zeni JA Jr, Richards JG, Higginson JS. Two simple methods for determining gait events during treadmill and overground walking using kinematic data. Gait Post. 2008;27(4):710–4.
Article Google Scholar
Hsu C-W, Chang C-C, Lin C-J, A practical guide to support vector classification. Technical report, Department of Computer Science, National Taiwan University. 2003. http://www.csie.ntu.edu.tw/~cjlin/papers.html
Ismail Fawaz H, Forestier G, Weber J, Idoumghar L, Muller P-A. Deep learning for time series classification: a review. Data Min Knowl Discov. 2019;33(4):917–63.
Article Google Scholar
Lecun Y, Bottou L, Bengio Y, Haffner P. Gradient-based learning applied to document recognition. Proc IEEE. 1998;86(11):2278–324.
Article Google Scholar
Goodfellow IJ, Bengio Y, Courville A. Deep learning. Cambridge: MIT Press; 2016.
Google Scholar
Lin M, Chen Q, Yan S. Network in network. 2013. arXiv:1312.4400
Srivastava N, Hinton G, Krizhevsky A, Sutskever I, Salakhutdinov R. Dropout: a simple way to prevent neural networks from overfitting. J Mach Learn Res. 2014;15:1929–58.
Google Scholar
Boser BE, Guyon IM, Vapnik VN. A training algorithm for optimal margin classifiers. In: Proceedings of the fifth annual workshop on computational learning theory (COLT’92). New York: Association for Computing Machinery; 1992. pp 144–52.
Cortes C, Vapnik V. Support-vector networks. Mach Learn. 1995;20(3):273–97.
Article Google Scholar
Pedregosa F, Varoquaux G, Gramfort A, Michel V, Thirion B, Grisel O, Blondel M, Prettenhofer P, Weiss R, Dubourg V, Vanderplas J, Passos A, Cournapeau D, Brucher M, Perrot M, Duchesnay É. Scikit-learn: machine learning in python. J Mach Learn Res. 2011;12(85):2825–30.
Google Scholar
Bergstra JS, Bardenet R, Bengio Y, Kégl B, Algorithms for hyper-parameter optimization. In: Shawe-Taylor J, Zemel RS, Bartlett PL, Pereira F, Weinberger KQ, editors. Advances in neural information processing systems, vol 24. Red Hook: Curran Associates, Inc.; 2011. pp 2546–54. http://papers.nips.cc/paper/4443-algorithms-for-hyper-parameter-optimization.pdf
McNEMAR Q. Note on the sampling error of the difference between correlated proportions or percentages. Psychometrika. 1947;12(2):153–7.
Article CAS PubMed Google Scholar
Raschka S. Model evaluation, model selection, and algorithm selection in machine learning. 2018. arXiv:1811.12808
Montavon G, Lapuschkin S, Binder A, Samek W, Müller K-R. Explaining nonlinear classification decisions with deep Taylor decomposition. Pattern Recognit. 2017;65:211–22.
Article Google Scholar
Nieuwboer A, Dom R, De Weerdt W, Desloovere K, Janssens L, Stijn V. Electromyographic profiles of gait prior to onset of freezing episodes in patients with Parkinson’s disease. Brain. 2004;127(Pt 7):1650–60.
Article PubMed Google Scholar
Naghavi N, Miller A, Wade E. Towards real-time prediction of freezing of gait in patients with Parkinson’s disease: addressing the class imbalance problem. Sensors. 2019;19(18):66.
Article Google Scholar
Arami A, Poulakakis-Daktylidis A, Tai YF, Burdet E. Prediction of gait freezing in parkinsonian patients: a binary classification augmented with time series prediction. IEEE Trans Neural Syst Rehabil Eng. 2019;27(9):1909–19.
Article PubMed Google Scholar
Nieuwboer A, Chavret F, Willems A-M, Desloovere K. Does freezing in Parkinson’s disease change limb coordination? J Neurol. 2007;254(9):1268.
Article Google Scholar
Plotnik M, Giladi N, Hausdorff JM. Is freezing of gait in Parkinson’s disease a result of multiple gait impairments? Implications for treatment. Parkin Dis. 2012;2012:459321.
Google Scholar
Barre A, Armand S. Biomechanical ToolKit: open-source framework to visualize and process biomechanical data. Comput Methods Programs Biomed. 2014;114(1):80–7.
Article PubMed Google Scholar
Abadi M, Barham P, Chen J, Chen Z, Davis A, Dean J, Devin M, Ghemawat S, Irving G, Isard M, Kudlur M, Levenberg J, Monga R, Moore S, Murray DG, Steiner B, Tucker P, Vasudevan V, Warden P, Wicke M, Yu Y, Zheng X. TensorFlow: a system for large-scale machine learning. In: Proceedings of the 12th USENIX conference on operating systems design and implementation (OSDI’16). USA: USENIX Association; 2016. pp 265–83.
Bergstra J, Komer B, Eliasmith C, Yamins D, Cox DD. Hyperopt: a python library for model selection and hyperparameter optimization. Comput Sci Discov. 2015;8(1):014008.
Article Google Scholar
Ridgway G, Bipolar Colormap; 2020. https://www.mathworks.com/matlabcentral/fileexchange/26026-bipolar-colormap Accessed 17 June 2020
Pataky TC. One-dimensional statistical parametric mapping in python. Comput Methods Biomech Biomed Engin. 2012;15(3):295–301.
Article PubMed Google Scholar
Raschka S. Mlxtend: providing machine learning and data science utilities and extensions to python’s scientific computing stack. J Open Source Softw. 2018;6:66.
Google Scholar
Goetz CG, Tilley BC, Shaftman SR, Stebbins GT, Fahn S, Martinez-Martin P, Poewe W, Sampaio C, Stern MB, Dodel R, Dubois B, Holloway R, Jankovic J, Kulisevsky J, Lang AE, Lees A, Leurgans S, LeWitt PA, Nyenhuis D, Olanow CW, Rascol O, Schrag A, Teresi JA, van Hilten JJ, LaPelle N. Movement Disorder Society UPDRS Revision Task Force: movement disorder society-sponsored revision of the unified Parkinson’s disease rating scale (MDS-UPDRS): scale presentation and clinimetric testing results. Mov Disord. 2008;23(15):2129–70.
Article PubMed Google Scholar
Hoehn MM, Yahr MD. Parkinsonism: onset, progression and mortality. Neurology. 1967;17(5):427–42.
Article CAS PubMed Google Scholar

Download references

Acknowledgements

We thank the employees of the gait laboratory for technical support during data collection.

Funding

This research did not receive any specific grant from funding agencies in the public, commercial, or not-for-profit sectors.

Author information

Authors and Affiliations

Intelligent Mobile Platform Research Group, Department of Mechanical Engineering, KU Leuven, Andreas Vesaliusstraat 13, 3000, Leuven, Belgium
Benjamin Filtjens, Muhammad Raheel Afzal & Peter Slaets
eMedia Research Lab/STADIUS, Department of Electrical Engineering (ESAT), KU Leuven, Andreas Vesaliusstraat 13, 3000, Leuven, Belgium
Benjamin Filtjens & Bart Vanrumste
Research Group for Neurorehabilitation (eNRGy), Department of Rehabilitation Sciences, KU Leuven, Tervuursevest 101, 3001, Heverlee, Belgium
Pieter Ginis & Alice Nieuwboer
Faculty of Rehabilitation Sciences, REVAL - Rehabilitation Research Center, Hasselt University, Agoralaan Building A, 3590, Diepenbeek, Belgium
Joke Spildooren

Authors

Benjamin Filtjens
View author publications
You can also search for this author in PubMed Google Scholar
Pieter Ginis
View author publications
You can also search for this author in PubMed Google Scholar
Alice Nieuwboer
View author publications
You can also search for this author in PubMed Google Scholar
Muhammad Raheel Afzal
View author publications
You can also search for this author in PubMed Google Scholar
Joke Spildooren
View author publications
You can also search for this author in PubMed Google Scholar
Bart Vanrumste
View author publications
You can also search for this author in PubMed Google Scholar
Peter Slaets
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

Study design by BF, PG, MRA, BV, and PS. Subject recruitment, data collection, and data preparation by JS and AN. Design and implementation of the neural network architecture by BF. Statistics by BF, MRA, and BV. The first draft of the manuscript was written by BF and all authors commented on subsequent revisions. All authors read and approved the final manuscript.

Corresponding author

Correspondence to Benjamin Filtjens.

Ethics declarations

Ethics approval and consent to participate

The study was approved by the local ethics committee of the University Hospital Leuven and all subjects gave written informed consent.

Consent for publication

Not applicable.

Competing interests

The authors declare that there is no competing interests regarding the publication of this article.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary Information

Additional file 1. Table S1

: The evaluated hyperparameter space of the convolutional neural network (CNN).

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated in a credit line to the data.

Reprints and permissions

About this article

Cite this article

Filtjens, B., Ginis, P., Nieuwboer, A. et al. Modelling and identification of characteristic kinematic features preceding freezing of gait with convolutional neural networks and layer-wise relevance propagation. BMC Med Inform Decis Mak 21, 341 (2021). https://doi.org/10.1186/s12911-021-01699-0

Download citation

Received: 23 September 2020
Accepted: 23 November 2021
Published: 07 December 2021
DOI: https://doi.org/10.1186/s12911-021-01699-0

Modelling and identification of characteristic kinematic features preceding freezing of gait with convolutional neural networks and layer-wise relevance propagation