Machine learning-empowered sleep staging classification using multi-modality signals

Satapathy, Santosh Kumar; Brahma, Biswajit; Panda, Baidyanath; Barsocchi, Paolo; Bhoi, Akash Kumar

doi:10.1186/s12911-024-02522-2

Research
Open access
Published: 06 May 2024

Machine learning-empowered sleep staging classification using multi-modality signals

Santosh Kumar Satapathy¹,
Biswajit Brahma²,
Baidyanath Panda³,
Paolo Barsocchi⁴ &
…
Akash Kumar Bhoi⁵

BMC Medical Informatics and Decision Making volume 24, Article number: 119 (2024) Cite this article

1027 Accesses
1 Citations
Metrics details

Abstract

The goal is to enhance an automated sleep staging system's performance by leveraging the diverse signals captured through multi-modal polysomnography recordings. Three modalities of PSG signals, namely electroencephalogram (EEG), electrooculogram (EOG), and electromyogram (EMG), were considered to obtain the optimal fusions of the PSG signals, where 63 features were extracted. These include frequency-based, time-based, statistical-based, entropy-based, and non-linear-based features. We adopted the ReliefF (ReF) feature selection algorithms to find the suitable parts for each signal and superposition of PSG signals. Twelve top features were selected while correlated with the extracted feature sets' sleep stages. The selected features were fed into the AdaBoost with Random Forest (ADB + RF) classifier to validate the chosen segments and classify the sleep stages. This study's experiments were investigated by obtaining two testing schemes: epoch-wise testing and subject-wise testing. The suggested research was conducted using three publicly available datasets: ISRUC-Sleep subgroup1 (ISRUC-SG1), sleep-EDF(S-EDF), Physio bank CAP sleep database (PB-CAPSDB), and S-EDF-78 respectively. This work demonstrated that the proposed fusion strategy overestimates the common individual usage of PSG signals.

Peer Review reports

Introduction

Sleep is a fundamental necessity for humans, crucial for maintaining physical and mental well-being [1]. Inadequate sleep patterns have been observed to lead to difficulties in learning, concentration, and decision-making and can impact social interactions. Prolonged adherence to such sleep behaviors may result in various sleep disorders. Notably, certain sleep disorders, like obstructive sleep apnea (OSA) [2], have direct or indirect associations with chronic diseases, such as an increased risk of stroke [3]. Additionally, insomnia has been linked to conditions like diabetes and cardiovascular diseases [4]. Therefore, assessing sleep quality and employing proper diagnostic procedures to address diverse sleep issues for overall health is imperative. Two main standards, R&K and AASM guidelines, examine sleep patterns and their attributes. R&K rules categorize the entire sleep cycle into seven stages, including Wake (W), Stage1 (S1), Stage2 (S2), Stage3 (S3), Stage4 (S4), Rapid Eye Movement (REM), and movement time. Stages S1 to S4 are considered non-REM sleep stages. In later research, the American Academy of Sleep Medicine (AASM) introduced updated guidelines, consolidating the sleep cycle into five stages: Wakefulness (W), N1, N2, and N3, with changes reflecting the measurement and treatment of S3 and S4 as part of the N3 stage [5].

Experts commonly employ the Polysomnography (PSG) test to assess different types of sleep disorders in subjects. PSG signals typically include an electroencephalogram (EEG) [6], electrocardiogram (ECG) [6], electrooculogram (EOG) [7], and electromyogram (EMG) [8]. These signals are recorded and analyzed visually by experts. The process involves at least two experts, one interpreting the signal waveforms while the other annotating them [9]. In the traditional diagnostic approach, manual inspection is used to observe and label the subject's sleep behavior. However, this method often yields lower performance due to variations in labeling and annotation skills among experts [10]. Additionally, reaching a consensus on sleep stage labels between the two experts can be challenging. As a result, many automated sleep staging systems have been developed to analyze sleep stages based on various sleep disorders, aiming to automate the scoring of sleep stages [11]. Figure 1 illustrates the EEG pattern of sleep stages. The depicted sleep EEG behavior is from subject id-61, a 61-year-old male, sourced from the Physio Bank CAP Sleep (PB-CAPSD) database [12]. This particular subject experienced periodic limb movement disorder. The figure highlights distinct EEG behaviors associated with each sleep stage, annotated to showcase their waveform characteristics. The N1 stage represents a transitional phase between light and deep sleep. In this stage, the EEG predominantly contains alpha waveforms, constituting about 2–5% of total sleep. Moving to stage N2, waveforms such as sleep spindles and k-complexes are prevalent, covering approximately 40–60% of total sleep for one subject [12]. Finally, the REM stage behavior closely resembles the wake stage, featuring sawtooth waves with alpha and theta activities [13]. The interconnected changes in sleep behavior during transitions between stages play a vital role in studying mental and physical health. Individuals with various sleep disorders often deviate from a regular sleep cycle [14].

Therefore, classifying sleep stages, particularly N1 or an extended transition period like N2, is crucial for identifying irregularities during sleep. In routine practice, sleep experts traditionally manually record multiple EEG signals and label them with corresponding sleep stages, making the entire process labor-intensive, time-consuming, and costly [15].

In the intersection of brainwave analysis and machine learning, extracting features from EEG signals plays a pivotal role. Wavelet transform, for instance, can analyze signals at multiple scales, making it valuable for detecting episodic events or signal changes over time. This characteristic renders it suitable for identifying changes in EEG signals, such as sudden increases or decreases in activity, which may be associated with specific events. This suggests a promising avenue for leveraging machine learning techniques to enhance the accuracy of sleep pattern analysis.

Despite the successes seen with both single and multi-modal sleep staging methods, several notable drawbacks persist:

i)
A generalized framework adaptable for the classification task from the conventional five-stage to two-stage sleep stages is lacking.
ii)
Supervised classification models, while effective with known data, may struggle with new records and can misclassify significant sleep stage patterns. Additionally, the features extracted from these models may be limited and fail to capture the complexity of the original signals adequately.
iii)
Misclassification of several epochs as belonging to either N1 or REM stages has been observed, directly impacting the accuracy performance of sleep staging algorithms.

This study aims to leverage multi-modal signal fusions and apply them using machine learning techniques to overcome the limitations of traditional methods in sleep scoring. The objective is to enhance the consistency of polysomnography scoring and develop classifiers with high accuracy for each sleep stage.

Related research

Over the years, the researchers developed different sleep staging methods based on machine learning and deep learning techniques. Most studies can be categorized into i) single-channel-based and multi-channel-based methods. In [16] the authors analyzed the sleep characteristics epochs that were pooled, then screened the features and selected the most suitable features based on relevance. In [17], the authors employed a band-pass filter during pre-processing to eliminate artifacts from the data. Their method yielded superior outcomes compared to existing procedures. Specifically, their approach proved effective for detecting dishonesty in EEG-based Brain-Computer Interface (BCI) systems.

In [18], the authors employed an orthogonal convolutional neural network (OCNN) to extract features from recorded polysomnography signals. They conducted their analysis on two publicly available sleep datasets from UCD and MIT-BIH. The OCNN model achieved accuracies of 88.4% and 87.6% with the UCD and MIT-BIH datasets, respectively. In [19], the author employed multi-modal classification and decision-making systems for sleep staging, incorporating an external neural network. The experimental work utilized the CAP sleep dataset, and the results indicated that the model performed well compared to an individual CNN model. The proposed model achieved a high accuracy of 95.43% for the six-class classification problem. In [20], the author introduced a novel approach for automated scoring of different stages of sleep using EEG signals collected from a single channel. This method utilized a unique cascaded recurrent neural network (RNN) architecture. The EEG data underwent preprocessing 55 times, and frequency-domain features were extracted, with the most relevant features selected via feature reduction techniques. Overall, the model achieved a classification accuracy of 86.7% for the five stages of sleep. The primary focus of this effort was to improve classification performance in sleep stage N1, with the aim of achieving satisfactory results in the remaining sleep stages as well. In reference [21], a novel method for automatic sleep stage categorization using EEG information from a single channel was proposed. The main idea is to directly apply the raw EEG signal to a deep convolutional neural network (CNN), bypassing the traditional feature extraction and selection process used in previous approaches. The suggested network architecture consists of nine convolutional layers followed by two fully connected layers. The proposed method achieved an accuracy above 90% for categorizing two to six classes, representing an improvement over existing methods. Additionally, Cohen's Kappa coefficients were reported as 0.98, 0.94, 0.90, 0.86, and 0.89, respectively, indicating strong agreement between predicted and actual sleep stages. In [22], the author utilized the concept of a weighted undirected network by mapping the feature vector into it. This network's various structural and spectral characteristics were separated. In [23] the author used multi-scale deep neural architectures, in which the decomposed signals were input into the CNN model for further analysis of the sleep patterns. The model resulted in an accuracy of 80.7% using S-EDF and 86.5% with the MASS dataset. In [24], the author used semi-supervised learning techniques for a better presentation of EEG signals for sleep staging. The author used two public datasets for this research work. The model received accuracy of 70.01% and 50.36% with S-EDF and ISRUC-Sleep datasets respectively. In [25], the author introduced a lightweight automated sleep staging system designed specifically for children, utilizing a single-channel EEG signal. The author combined Convolutional Neural Network (CNN) and Long Short-Term Memory (LSTM) models for classifying sleep stages. The experiments were conducted using two datasets: a children's sleep dataset and the Sleep-EDFx dataset. The system achieved an accuracy of 83.06% with the children's sleep dataset using the F4-M1 channel and 86.41% with the Sleep-EDFx dataset with manual feature extraction. In [26], the authors used multi-branch one-dimensional convolutional neural networks and extracted different frequency domain features from single-channel EEG data. The model resulted from 90.31% accuracy, 95.30% specificity, and 65.73% F1score. In reference [26], the authors employed multi-branch one-dimensional convolutional neural networks (CNNs) and extracted various frequency domain features and achieved an accuracy of 90.31%, specificity of 95.30%, and an F1 score of 65.73%. Some of the recent studies on sleep staging are presented in Table 1.

Table 1 Recent research works carried out on automated sleep stage classification using EEG and PSG signals

Machine learning-empowered sleep staging classification using multi-modality signals

Abstract

Introduction

Related research

Materials and methodology

Sleep stages classes

Data description

ISRUC-Sleep subgroup1 database (ISRUC-SG1)

Sleep-EDF database (S-EDF)

Physio Bank CAP Sleep (PB-CAPSD) database

Sleep-EDF-78 dataset

Preprocessing

Features extraction

Feature normalization and reduction

Feature normalization

Feature reduction

Experimental set-up

Testing schemes

Epoch-wise Test (Subject Dependent Test)

Subject-wise Test (Subject Independent Test)

Performance evaluation metrics

Performance evaluation of proposed sleep staging model using an individual feature

Performance evaluation of the proposed sleep staging model using multi-modal signal fusions

Analysis of the sleep staging performance using Epoch-wise (Experiment-1 to Experiment-4)

Analysis of the sleep staging performance using Subject-wise (Experiment-5 to Experiment-8)

Analysis of sleep staging classification performance using single-channel and multi-modal signals fusions

Discussion

Complexity comparison with other approaches

Computation time analysis

Conclusion

Availability of data and materials

References

Funding

Author information

Authors and Affiliations

Contributions

Corresponding authors

Ethics declarations

Ethics approval and consent to participate

Consent for publication

Competing interests

Additional information

Publisher’s Note

Rights and permissions

About this article

Cite this article

Share this article

Keywords

BMC Medical Informatics and Decision Making

Contact us