AI and semantic ontology for personalized activity eCoaching in healthy lifestyle recommendations: a meta-heuristic approach

Chatterjee, Ayan; Pahari, Nibedita; Prinz, Andreas; Riegler, Michael

doi:10.1186/s12911-023-02364-4

Research
Open access
Published: 01 December 2023

AI and semantic ontology for personalized activity eCoaching in healthy lifestyle recommendations: a meta-heuristic approach

Ayan Chatterjee^1,2,
Nibedita Pahari³,
Andreas Prinz¹ &
…
Michael Riegler²

BMC Medical Informatics and Decision Making volume 23, Article number: 278 (2023) Cite this article

823 Accesses
1 Altmetric
Metrics details

Abstract

Background

Automated coaches (eCoach) can help people lead a healthy lifestyle (e.g., reduction of sedentary bouts) with continuous health status monitoring and personalized recommendation generation with artificial intelligence (AI). Semantic ontology can play a crucial role in knowledge representation, data integration, and information retrieval.

Methods

This study proposes a semantic ontology model to annotate the AI predictions, forecasting outcomes, and personal preferences to conceptualize a personalized recommendation generation model with a hybrid approach. This study considers a mixed activity projection method that takes individual activity insights from the univariate time-series prediction and ensemble multi-class classification approaches. We have introduced a way to improve the prediction result with a residual error minimization (REM) technique and make it meaningful in recommendation presentation with a Naïve-based interval prediction approach. We have integrated the activity prediction results in an ontology for semantic interpretation. A SPARQL query protocol and RDF Query Language (SPARQL) have generated personalized recommendations in an understandable format. Moreover, we have evaluated the performance of the time-series prediction and classification models against standard metrics on both imbalanced and balanced public PMData and private MOX2-5 activity datasets. We have used Adaptive Synthetic (ADASYN) to generate synthetic data from the minority classes to avoid bias. The activity datasets were collected from healthy adults (n = 16 for public datasets; n = 15 for private datasets). The standard ensemble algorithms have been used to investigate the possibility of classifying daily physical activity levels into the following activity classes: sedentary (0), low active (1), active (2), highly active (3), and rigorous active (4). The daily step count, low physical activity (LPA), medium physical activity (MPA), and vigorous physical activity (VPA) serve as input for the classification models. Subsequently, we re-verify the classifiers on the private MOX2-5 dataset. The performance of the ontology has been assessed with reasoning and SPARQL query execution time. Additionally, we have verified our ontology for effective recommendation generation.

Results

We have tested several standard AI algorithms and selected the best-performing model with optimized configuration for our use case by empirical testing. We have found that the autoregression model with the REM method outperforms the autoregression model without the REM method for both datasets. Gradient Boost (GB) classifier outperforms other classifiers with a mean accuracy score of 98.00%, and 99.00% for imbalanced PMData and MOX2-5 datasets, respectively, and 98.30%, and 99.80% for balanced PMData and MOX2-5 datasets, respectively. Hermit reasoner performs better than other ontology reasoners under defined settings. Our proposed algorithm shows a direction to combine the AI prediction forecasting results in an ontology to generate personalized activity recommendations in eCoaching.

Conclusion

The proposed method combining step-prediction, activity-level classification techniques, and personal preference information with semantic rules is an asset for generating personalized recommendations.

Peer Review reports

Key Contributions to the Literature

This conceptual study has hypothesized a personalized hybrid activity recommendation generation method in an activity eCoach prototype system.
The daily collection of real-time activity data with a medical-grade wearable activity sensor (e.g., MOX2-5) has served as an input for the activity eCoaching session. Recommendation generation aims to motivate participants to meet their personal activity goals and reduce sedentary time. The individual preference datasets (such as goal setting, response type, and interaction type) have been helpful for the meaningful delivery of personalized recommendation messages.
The autoregression model with residual error minimization technique has shown the potential to improve forecasting performance in time series. Besides, the ensemble approach has been helpful for daily activity level classification on activity sensor data.
We have introduced the application of the ADASYN sampling algorithm for data balancing to avoid prediction biases in machine learning classifiers. Moreover, we have used Mathew’s coefficient (MCC) metric to cross-verify prediction biases.
Semantic ontology has been used to logically represent personal preference data, prediction and classification outcomes, knowledge reasoning, and querying. Combined with a defined ruleset, the SPARQL queries help to generate personalized physical activity recommendations.

Introduction

This section encompasses the background, motivation, current state-of-the-art, and the study's objectives. Additionally, it includes a qualitative comparison with prior research to highlight the uniqueness and innovation brought by this study.

Background

About 60% to 85% of people live a sedentary lifestyle worldwide [1]. The collective effects of the sedentary lifestyle are related to several adverse health outcomes, including increased risk of lifestyle diseases, such as obesity, diabetes type II, high blood pressure, depression, and cardiovascular threats [1,2,3,4,5,6,7,8,9,10]. Regular physical activity has a positive impact on preventing and managing lifestyle diseases. Compared with people with adequate exercise, people with inadequate activity have an increased risk of death by 20% to 30% [10]. An automatic health coach may help people to manage a healthy lifestyle with ubiquitous personalized health state monitoring (e.g., physical activity, nutrition, healthy habits) and tailored recommendations [11,12,13,14]. A coaching process can be “In-person” or “Technology-driven” (via Telematic means) [12]. In-person coaching with manual activity tracking and personalized recommendations is inefficient and repetitive. Therefore, in this regard, an automatic coach can be more efficient. An eCoach system tries to involve users proactively in an ongoing collaborative dialogue to support planning and encourage effective goal management using personalized health and wellness status monitoring and thereby, recommendation generation to meet the lifestyle goal [14].

Recommendation technology, a decision-making approach under complex information environments can be classified as rule-based and data-driven [15,16,17]. The data-driven recommendations use AI algorithms. In contrast, rule-based recommendation technology uses binary logic in a symbolic form to present knowledge in IF–THEN or IF-ELSEIF-THEN rules and infer new knowledge with the reasoning method. A knowledge base (KB) is maintained to store and access such rules and associated messages. Rules can be specified in the form of propositional logic, decision tree, relational algebra, and description logic. Rule-based systems are modular, intelligible, and easy to manage; however, they suffer from symbol grounding problems [16]. The data-driven approach experiences a lack of sufficient data, high computing power, lack of interpretability, re-training for new cases, personalization, and cold-start. Therefore, to overcome the failings of data-driven and rule-based recommendations, a hybrid approach can be useful.

Description logic is the formal knowledge representation of ontology language (e.g., Web Ontology Language (OWL), which balances clarity, complexity, and effectiveness of knowledge description and knowledge reasoning. Semantic Web Rule Language (SWRL), and SPARQL are well-accepted query languages in semantic ontology [4]. An ontology is a formal description of knowledge within a domain and its relationships following a hierarchical structure. Other methods of knowledge representation are thesaurus, topic maps, and logical models. Still, unlike relational database schemas, ontologies express relationships and allow users to join or link multiple concepts together with the following elements: individuals/objects, classes, attributes, relationships, and axioms [4].

Motivation

Behavior and health are strongly connected. Reduction of a sedentary lifestyle with increased physical activity needs self-motivation, self-correlation, and self-management. Tudor-Locke et al. [18] and Matthews et al. [19] showed that human activity varies between the weekends and weekdays. Gardner et al. [20] acknowledged that self-monitoring, and reforming the social and physical environment are the most encouraging strategies for human behavior change besides recommending environmental reorganization, persuasion, and education to improve self-regulation skills. Intervention design to improve physical activity levels and reduce sedentary time varies significantly in content and effectiveness [20,21,22]. Mobile applications used to improve young people's physical activity should include personalized feedback and provide guidance [14]. Only a few available mHealth applications for physical activity have been evaluated, and the evidence is of inferior quality [14].

In the digital activity recommendation system, a tracker is maintained to record daily step count, metabolic equivalent of tasks, kilocalories, and distance to reduce sedentary behavior. Such digital recommendation systems consist of a data collection module, an AI module, and a recommendation generation or decision module. Data are captured over time and analyzed with AI algorithms to generate real-time feedback to accomplish personal activity goals. The decision module recommends changing a person's behavior, daily routine, and activity plan [20]. A walking tracker smartphone app can measure individual activity levels and enable self-monitoring [23, 24]. Most of the modern consumer-based activity sensors (e.g., Fitbit, Actigraph, MOX2-5, Pedometer, Garmin, and smartwatches (e.g., Apple, Samsung, Huawei)) based smartphone apps contain a variety of behavior change models or theories [25,26,27,28]; however, they experience lack of a genuine eCoaching flavor. A meta-analysis from Qiu et al. [29] and Stephenson et al. [30] concluded that using a pedometer has a small but significant effect on reducing sedentary time. Just wearing an activity tracker (even without any form of guidance) can stimulate the passion for performing physical activities to improve the quality of life.

Only a few studies have investigated the use of actionable, data-driven predictive models [31]. Dijkhuis et al. [32] analyzed Hanze University's personalized physical activity coaching with AI algorithms to improve sedentary lifestyles. They collected daily step data to train AI classifiers to estimate the probability of achieving hourly step goals and followed by feedback generation with a web-based coaching application. Hansel et al. [33] designed a fully automated web-based coaching program. They used pedometer-based activity or step monitoring in a random group of Type 2 diabetes and abdominal obesity patients to increase their physical activity. Pessemier et al. [34] used raw accelerometer data for individual activity recognition, accepted personal preferences for activity recommendation planning, and generated personalized recommendations with tag-based recommender and rule-based filter. Amorim et al. [35] and Oliveira et al. [36] performed activity monitoring with a Fitbit activity sensor on a group of random trials. They accomplished a statistical analysis to discover the efficacy of a multimodal physical activity intervention with supervised exercises, health coaching, and activity monitoring on physical activity levels of patients suffering from chronic, nonspecific low back pain. Petsani et al. [37] designed an eCoach system for older people to increase faithfulness to exergame-based physical activities. They followed the inclusion of eCoaching guidelines set by the human therapists/doctors or a familiar person chosen by the user who can access their persistent health and wellness data and involve in the coaching process. They remarked that health eCoaching is a complex process that needs careful planning and integration of different scientific domains, such as psychology, computer science, health informatics, and medical science. Braber et al. [38] incorporated the eCoaching concept in personalized diabetes management where lifestyle data (e.g., dietary intake, physical activity, glycemic value) were recorded and integrated with clinical rules to give customized coaching to improve adherence to lifestyle recommendations.

Chatterjee et al. [12] focused on creating a meaningful, context-specific ontology to model non-intuitive, raw, and unstructured observations of personal and person-generated health data (e.g., sensors, interviews, questionnaires) using semantic metadata to create a logical abstraction for rule-based health risk prediction and thereby, personalized lifestyle recommendation generation in a health eCoach system. Villalonga et al. [39] conceptualized an ontology-based automated reasoning model for generating personalized motivational messages for activity coaching considering behavioral characteristics. Thus, ontology can be a good alternative for rule-based decision-making with robust design flexibility in object-oriented design paradigms.

Improvement of physical activity in combination with wearable activity sensors and digital activity trackers, eCoach features can be promising and motivating to its participants. The application of AI to eCoaching is new. Therefore, real-time data analysis and, thereby, the generation of personalized recommendations with eCoaching is missing in existing literature with the following search string in well-reputed PubMed or Medline database: with a search string: ((ecoach OR e-coach) AND (activity monitoring) AND (Healthy lifestyle or lifestyle) AND (activity or physical activity or exercise) AND (Sensor or activity sensor or activity tracker) AND (recommendation or recommendation generation) AND (data driven or data-driven or classification or prediction or regression or forecasting or rule-based or rule based or ruleset or knowledge base or knowledge-based or hybrid)). Different activity monitoring and lifestyle coaching smartphone applications are available online; however, they are too generic and lack appropriate design and development guidelines, and eCoaching features [12].

State-of-the-art

The state-of-the-art is to generate personalized recommendations using AI and interpretable semantic rules to motivate participants to achieve their activity goals. A goal type can be of two types – short-term goals (e.g., weekly) and/or long-term goals (e.g., monthly). Success in short-term goals (STG) attainment may help in achieving long-term goals (LTG) when the LTGs are the summation of STGs.

Our assumed hypothesis is that an eCoach system can generate meaningful, automatic, and personalized recommendation plans to accomplish individual lifestyle goals. To prove the concept, we have conceptualized the design of the ActieCoach prototype system for physical activity as a study case. ActieCoach can collect activity and personal preference data from actual participants with wearable activity sensors, questionnaires, and self-reported forms, respectively, and thereby, process collected data to forecast daily step count, classify individual activity levels, and combine the outcomes in an ontology model for semantic knowledge representation to generate of personalized recommendations with a query engine against a defined semantic rule set. The semantic rules in an ontology can show a direction to enhance the understandability of recommendation generation with IF-ELSE conditions in a logical tree structure. Most activity trackers, involving mobile apps and smart wearable devices (e.g., smartwatches), predict future activity in terms of "steps" as a point prediction with time-series forecasting, probabilistic approaches, or specific rules. However, point prediction is a very abstract concept. Therefore, a probabilistic interval prediction approach may be encouraging. Preliminary research has been found on sensor data with AI technology and combining the predictive analysis result with semantic rules for hybrid recommendation generation. Moreover, this research adds arguments to attain ethical aspects of AI by addressing a collection of ethical data, data governance, testing for bias, explainable AI, and continuous model improvement with incremental model designing.

This study is novel as no similar work has been published as revealed from the literature search. Recommendation technology has a broad application domain. We have considered studies that are only related to lifestyle recommendations, either personal or group-level. A qualitative comparison between our study and the related studies has been made in Table 1 based on the following parameters: hybrid recommendations (data-driven and rule-based), ontology modeling, interval prediction, observation with activity sensors, preference settings, and logical recommendation generation. The high-level descriptions of the used terminologies are specified in Additional file 3: Appendix A.1. The study by Pessemier et al. [34] focused on recommendation generation at the “Community” level; however, our research targets activity coaching and recommendation generation at the “Personal” level.

Table 1 A qualitative comparison between our study and the related healthy lifestyle recommendation studies

AI and semantic ontology for personalized activity eCoaching in healthy lifestyle recommendations: a meta-heuristic approach

Abstract

Background

Methods

Results

Conclusion

Key Contributions to the Literature

Introduction

Background

Motivation

State-of-the-art

Aim of the study

Design of the eCoach system

Ontology modeling and algorithm design for personalized recommendations

Algorithm 1

Datasets

PMData public datasets

MOX2-5 private datasets

Methods

Feature selection

Combining features from datasets

Data labeling for classification

Data balancing for classification

Auto regression model with residual error minimization

Ensemble classification algorithms

Evaluation metrics

Probabilistic interval prediction

Validation study

Verification of the classifiers

Verification of personalized recommendation generation and visualization

Results

Experimental setup

Experimental results

Discussion

Key findings

Relevance

Limitations and future scope

Conclusion

Availability of data and materials

Abbreviations

References

Acknowledgements

Funding

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Ethics approval and consent to participate

Consent for publication

Competing interests

Additional information

Publisher’s Note

Supplementary Information

Additional file 1.

Additional file 2:

Additional file 3: Appendix A.1: Table A.1.

Rights and permissions

About this article

Cite this article

Share this article

Keywords

BMC Medical Informatics and Decision Making

Contact us