Skip to main content

Use of sentiment analysis for capturing hospitalized cancer patients' experience from free-text comments in the Persian language

Abstract

Purpose

Today, the Internet provides access to many patients' experiences, which is crucial in assessing the quality of healthcare services. This paper introduces a model for detecting cancer patients' opinions about healthcare services in the Persian language, both positive and negative.

Method

To achieve the objectives of this study, a combination of sentiment analysis (SA) and topic modeling approaches was employed. All pertinent comments made by cancer patients were collected from the patient feedback form of the Tehran University of Medical Science (TUMS) Cancer Institute (CI) in Iran, from March to October 2021. Conventional evaluation metrics such as accuracy, precision, recall, and F-measure were utilized to assess the performance of the proposed model.

Result

The experimental findings revealed that the proposed SA model achieved accuracies of 89.3%, 92.6%, and 90.8% in detecting patients' sentiments towards general services, healthcare services, and life expectancy, respectively. Based on the topic modeling results, the topic "Metastasis" exhibited lower sentiment scores compared to other topics. Additionally, cancer patients expressed dissatisfaction with the current appointment booking service, while topics such as "Good experience," "Affable staff", and "Chemotherapy" garnered higher sentiment scores.

Conclusion

The combined use of SA and topic modeling offers valuable insights into healthcare services. Policymakers can utilize the knowledge obtained from these topics and associated sentiments to enhance patient satisfaction with cancer institution services.

Peer Review reports

Introduction

The Web has become a rich data environment that provides a considerable amount of information with the cooperation of all Internet users without any restrictions [1, 2]. Today, patients and physicians are turning to online platforms such as blogs, social media, and healthcare websites to share their comments and leverage the perspectives of others on health issues [3]. Many unstructured data about healthcare quality are available on the Internet that have not been systematically recorded. Free-form text responses, when compared to structured questionnaires, may be less biased by the feedback collector and, therefore, more representative. However, analyzing large quantities and obtaining meaningful results becomes more challenging [4, 5]. Also, the enormous amount of news disseminated through social media makes manual verification unfeasible. Therefore, automatic analysis of online customer reviews, particularly the emotions expressed in comments, and fake news detection are considered hot research topics [6]. Natural language processing (NLP) is one of the technologies used in these fields. NLP is a branch of artificial intelligence (AI) that analyzes and understands human language. The goal of NLP is to design and build systems that can analyze human languages. One of the applications of NLP that data scientists highly regard is sentiment analysis (SA) [7]. SA is the field of study that analyzes people's opinions, sentiments, appraisals, evaluations, attitudes, and emotions towards entities such as products, services, organizations, individuals, issues, events, topics, and their characteristics. Sentiments can be positive, negative, or neutral and are assigned a numerical score indicating their effectiveness [8]. Today, many patients' experiences are available on the Internet [9]. SA allows us to understand and use this information more effectively to improve the quality of healthcare [10].

By leveraging SA, healthcare providers can gain valuable insights into their patients’ needs and preferences, allowing them to tailor their services and marketing strategies to better meet those needs. This can lead to increased patient referrals and higher revenue [5, 11]. Social media analysis provides the necessary capacity to monitor public health. Social media provides a platform for patients with chronic diseases, including cancer, to share their quality of life indicators with other patients during and after treatment [12].

Cancer is an important disease affecting millions of people and families worldwide [13]. For many reasons, such as the severity of some cases, the side effects of some treatments, and the deaths of other patients, cancer patients are often affected by serious emotional disorders such as depression [14]. Monitoring patients' mood is an important part of their treatment. Cancer patients use online social media and virtual cancer communities to exchange messages and support each other during treatment [15]. SA methods can be used to detect cancer patients' moods by analyzing their messages in online communities [16].

SA researches are expanding incredibly quickly. However, despite the fact that a sizable portion of the data is available in other languages, the majority of research efforts are focused on English language [17].

In the healthcare domain, most research efforts in the field of SA are focused on the English language [18, 19], while research is more limited to other languages including Persian.

SA in every language has specific prerequisites. Therefore, the direct use of methods, tools, and resources developed for the English language in Persian has limitations [20]. Therefore, it seems necessary to analyze patients' emotions in the Persian Language.

In this paper, we presented a model for understanding cancer patients' unstructured opinions about their care in the TUMS CI.

Even though there have been numerous studies on SA in the field of cancer [21,22,23,24,25,26,27], this is the first one to be carried out in Iran for this particular patient population in Persian language.

Related works

This section discusses several kinds of research in the field of SA using either English or other languages in the healthcare domain.

Alexander et al. [18] performed SA and topic modeling on unstructured comments on the England patient feedback web service. The results of sentiment analysis showed that patient feedback can be accurately classified into positive or negative sentiments, and the topic modeling approach can be used to identify topics in patient reviews. Also, data from this website was used by Greaves et al. [10] to understand patients' unstructured opinions about their care. They used SA techniques to categorize online free text comments by patients as positive or negative descriptions of their health care. In this study, a model is presented to automatically predict whether a patient would recommend a hospital, whether the hospital is clean, and whether patients are treated with dignity. Their results show that, from free text, a logically accurate assessment of patients' views on various aspects of a hospital's performance can be predicted.

Clark et al. [12] conducted research on English-language tweets related to the experiences of breast cancer patients based on SA. The results of this study showed that patients shared positive experiences about treatment, increasing support and spreading awareness. The results of this study show that social media can provide a positive outlet for patients to talk about their needs and concerns regarding their healthcare coverage and treatment needs.

Balakrishnan et al. [21] examined the dynamics of emotions in patient narratives in a breast cancer community group to identify changes in emotions, thoughts, stress, and coping mechanisms during treatment options, particularly chemotherapy, radiation, and surgery. The sentiment dynamics of users' posts were analyzed using a deep learning model. A sentiment change analysis was performed to measure the change in users' satisfaction levels. This leads to a better understanding of the different needs of patients during treatment.

Edara et al. [19] provided a distributed framework for analyzing the sentiments of cancer patients from various online cancer support communities. In this study, a dataset was constructed by collecting patients' opinions from different domains using the Twitter API. The proposed distributed framework replaces traditional sentiment analysis approaches with a Long Short-Term Memory (LSTM) neural network for analyzing large volumes of data in a potential stream. The results of this study showed that most patients expressed positive opinions about their illness, while some expressed negative and neutral opinions.

In the study by Baker et al. [28], the response of individuals to colorectal cancer was analyzed and used to predict the future of this disease. The dataset for this study was collected from Twitter. LSTM models, Gated Recurrent Units (GRU), and Convolutional Neural Networks (CNN) were evaluated. The results showed that the GRU model outperformed the LSTM and CNN models in terms of accuracy and provided more stable results.

Modave et al. [22] used topic modeling and sentiment analysis techniques to understand discussion topics and quantify twitter users' perceptions and sentiments regarding breast cancer. Their results show that the attitude of ordinary people towards breast cancer changes from time to time.

Alamoodi et al. [29] examined the general sentiments and topics discussed during different waves of the COVID-19 pandemic in Malaysia. In this study, a lexicon-based approach was used to analyze sentiment in Twitter data. The results of topic modeling and SA showed a consistent presence of positive sentiments throughout all quarantine waves.

Pandesenda et al. [30] measured the service quality of mobile health services provided by Alodokter through the SA of customer comments in Indonesian from the Google Play Store. It is Indonesia's number-one digital health firm. System quality, interaction quality, and information quality are categorized using sentiment analysis and the Fast Large-Margin classification methodology. The findings of this study show that system quality receives the majority of good customer feedback, and interaction quality receives the majority of positive customer feedback.

Alayba et al. [31] used machine learning (ML) methods in the opinion classification to positive and negative classes. They introduce an Arabic language dataset in health services by collecting from Twitter.

Del-Arco et al. [32] used the Corpus Of Patient Opinions in Spanish (COPOS) and ML to mine Spanish patient opinions. They have carried out experiments with the main methodologies employed in the task of polarity classification (semantic orientation and machine learning).

Freedman et al. [33] used ML techniques to understand barriers to breast cancer treatment. They investigated the sentiments and barriers expressed about breast cancer treatments by Internet users.

The first study in the field of healthcare using SA and machine learning in the Persian language was done by Khaleghparast et al. [34] to identify various hospital wards and staff members from patients' remarks and to categorize messages into good and negative categories.

As a conclusion from previous related works SA in the field of health is presented on the comments and feedback published on different web platforms from health markets, and different social networks to the websites of healthcare centers with different goals and in different languages.

Methodology

This work was part of a project evaluated and approved by the Ethics Committee under the number IR.TUMS.IKHC.REC.1399.377. All preprocessing and modeling steps were done in the Python version 3 environment. The block diagram of the research is shown in Fig. 1.

Fig. 1
figure 1

The block diagram of the research methodology

Each of the modeling steps is as follows:

  • Normalization: If punctuation marks and Persian words are not written in the same way, the texts used will not be analyzable by computer systems. In the process of normalization, spelling signs, letters, spaces between words, abbreviations, etc. are converted to a standard form without making any meaningful changes to the text.

  • Spell-checking: Spell checkers help quickly identify spelling errors in the text. This capability, if implemented properly, helps create texts without spelling mistakes and also increases the speed and accuracy of error detection.

  • Stemming: In any language, words appear in different forms depending on their semantic and syntactic role in the sentence. This different form indicates the different meanings of these words, but since they are all derived from the same root, they will have relatively close meanings. For this reason, in many NLP and information retrieval applications, it is necessary to convert all derivatives of a word to its root.

  • Sentence splitting: It involves the segmentation of a sentence into two or shorter sentences. It is a key component of sentence simplification, has been shown to help human comprehension, and is a useful preprocessing step for NLP tasks such as summarization and relation extraction.

  • Polarity detection: It is employed to ascertain the sentiment of a text, indicating whether it is positive, negative, or neutral.

  • Aggregation: Refers to a mechanism used to compute the overall sentiment score of reviews.

Figure 2 illustrates a simplified representation of the proposed model using pseudocode, which is an informal method for describing algorithms without the need for a specific programming language [35].

Fig. 2
figure 2

Pseudo code for the proposed model

Data collection

All available cancer patients' feedback was collected between March and October 2021 from a patients' feedback form in the Tehran University of Medical Sciences Cancer Institute (TUMS CI), the most referred cancer hospital in Iran located in Imam Khomeini Hospital Complex. This institute is a leading organization in the field of cancer research and treatment. The TUMS CI consists of three divisions: education, research, diagnosis, and treatment. It was established in 1949 [36].

The patients' feedback form was prepared to receive the experiences of patients on the TUMS CI website.

Based on a literature review and the review of common complaints in Iranian hospitals, main categories were prepared to receive unstructured opinions from patients. The results of this step were reviewed in an expert panel session by five experts, including two medical informatics specialists, two healthcare services management specialists, and one oncologist, who had at least three years of experience at the Universities of Medical Sciences.

The nominal group technique (NGT) was used to evaluate search results. It allows stakeholders to directly produce items for needs assessment surveys [37]. Our goal was to use NGT discussions to develop online free text feedback items. Two NGT groups were conducted. Participants shared items, and a master list was compiled, and then reviewed by participants to remove or merge overlapping items. Once a final list was generated, participants independently rated them on a scale from 1 to 10. Lists generated in the NGT discussions were subsequently reviewed and integrated into a single list by research team members.

The final unstructured feedback form was made available online to cancer patients. They could provide free-text feedback about the TUMS CI in 3 main unstructured sections:

  • Healthcare services in TUMS CI: Patients could provide feedback in this section about the treatment process, dignity, respect, staff cooperation, involvement in decisions, etc.

  • TUMS CI's general services include an option for patients to describe their free-text experience with the accommodation environment, cleanliness, quality of food, and other relevant aspects in this section, etc.

  • Cancer patients' life expectancy: Patients may express their free-text comments regarding their desire to continue their current treatment process and their concerns as cancer patients, etc.

These patients are free to feedback on each section. The patient feedback form helps us to collect valuable comments from patients about the clinic, hospital, their level of satisfaction, etc. All comments and reviews were stored in an Excel file for preprocessing and analysis.

Preprocessing

The Persian language has particular challenges for which it is impossible to use many existing methods to analyze sentiments in English. One of the main challenges in Persian texts is the existence of words with different writing formats. Also, the existence of short space or pseudo-space as the space inside the word to separate words is one of the other challenges of the Persian language. Also, some letters in this language have different Unicode, such as "ی" and "ک" which has made marking and dividing texts in Persian more challenging than in other languages [38]. Before working on the texts to make the letters and spacing consistent, they must be pre-processed. At this point, all of the characters need to be replaced with their standardized counterparts. When dealing with Persian script, which is similar to Arabic script, there are often issues with the pronunciation of Arabic characters, including ("ک"، "ی") and ("ئ"، "أ" ،" ؤ"،" إ"). To rectify the issues related to these letters, the first step is to normalize them.

In Persian language, there is a module called hazm for normalizing the texts. It is a Python library for digesting Persian text [39]. In this research, the hazm library was used to carry out all preprocessing steps. A part of the codes can be seen in Additional file 1.

In the first stage, to prepare posts and comments, existing punctuation such as ("?"، "!"، "()") and so on were removed from the texts. Also, to reduce the comments' size and improve the accuracy, stop words that do not significantly affect the sentiment analysis process were removed. For this purpose, the list of existing Persian stop words previously suggested by Dolamic et al. [40] was used. In the next step, the stemming process was performed to reduce the different forms of words used in the comments [41].

The main problems of the Persian language that were solved in the pre-processing stage are as follows:

Different encoding for some characters:

Persian script processing often has issues when trying to use Arabic characters for certain characters. In the corpus, these characters have been standardized according to the Academy of Persian Language and Literature (APLL). APLL is the regulatory body for the Persian language, headquartered in Tehran, Iran [42].

Extra spaces

During the preprocessing stage, the text corpus was stripped of any extra spaces, half-spaces, and tabs.

Variety of text with uppercase and lowercase letters

In some cases, the letter "آ" is used incorrectly in words featuring it, such as writing "اب" instead of "آب". To resolve this issue, words beginning with the letter "ا" were altered to feature the uppercase form of the letter "آ". Additionally, words with the letter "آ" (other than their primary character) were changed to their lowercase form.

Different ways to add suffixes to main words

Various methods of appending suffixes to main words include the use of suffixes such as ("تر"، "ترین" ، "هایم" ، "هایش") and so on, which are placed at the end of the words in three different forms, including ("مناسبتر" ،"مناسب تر" ،"مناسبتر"). To make this type of words systematic, all suffixes are affixed to the end of the words.

Different ways of adding prefixes to main words

The prefixes of "می‌"، "نمی‌"، "درمی‌"، "برمی‌"، "بی‌" come in three different forms: "می رود" ،"میرود", and "می‌رود". To standardize this type of words, all prefixes should be added to the beginning of the words using a half-space.

Removing the character "-"

The character stretches out the words, such as "بــــــر" becoming "بر".

Different ways of attaching the components of compound words

A compound word is when two or more distinct words are combined to create a single, new term with a distinct meaning. For instance, the Persian word “پیرمرد” is composed of two separate words, “پیر” and “مرد”, each of which has its meaning. All the compound words from the corpus have been altered to one unified form.

Multiple spelling words

Persian words have multiple spelling forms. These words may contain homophones (ا/ ع ؛ ت/ ط ؛ ث/ س/ ص ؛ح/ ه ؛ ذ/ ز ؛ ض/ ظ ؛ غ/ ق) like "اتاق" and "اطاق", for which the correct form according to the APLL is"اتاق". Its reliable resources were used to verify and standardize multiple spelling words in the corpus.

The treatment of negations

Persian negation is very important because its structure is used in everyday conversation. Persian negation is the process that turns an affirmative statement (I am happy) into its opposite denial (I am not happy). In Persian, verbs are negated with the particle "نـ". This particle, being a word of its own, should be written separately but under the current custom of writing, it is written attached to the verb:

  • - رفتم (I went) → نرفتم (I did not go)

  • - خندیدم (I laughed) → نخدیدم (I did not laugh)

We use a rule-based approach to detect negation for sentiment analysis that largely draws information from lexicons. In this approach, every verb with the particle "ن ", "نا ", "ضد " (important shifters) is considered as negated [20].

Sentiment analysis

In this study, to decide on the polarity of each review and comment, the method proposed by Basiri et al. [43] was used. It is based on the evidence theory of Dempster-Shafer and sentence-level sentiment aggregation. The polarity detection of each sentence was determined using SentiStrength and an existing library. SentiStrength is an algorithm that combines a lexicon-based approach with linguistic information and rules to handle negation, misspelling, punctuation, and emoticons. It was specifically developed to handle sentiment analysis in informal and concise English text [44]. We used the translation of SentiStrength for Persian which was presented in [43]. SentiStrength contained positive and negative terms based on the degree of influence from -5 to + 5. With the SentiStrength method, the maximum positive and minimum negative scores of a sentence are calculated to determine the overall polarity of an opinion. Based on this method, the maximum absolute value of the positive/negative score was considered the opinion score. This strategy calculated the opinion score based on the number of positive and negative sentences (Eq. 1).

Score aggregation strategies

In order to calculate the overall sentiment score of reviews, an aggregation mechanism is required:

$$\mathrm{V}(\mathrm{P})=\frac{\sum_{i=1}^{n}(u\left({T}_{i}\left(P\right)\right)\times Polarity\left({T}_{i}\left(P\right)\right))}{\sum_{i=1}^{n}u({T}_{i}\left(P\right))}$$
(1)

where V is the overall valuation (score) of a product P. It can be computed as a weighted average of the polarity of each individual review \({T}_{i}\left(P\right)\) where the weights indicate their relative usefulness [45]. This method computes the aggregated polarity of a product by \({u(T}_{i}\left(P\right))\). In other words, it computes the overall polarity as a simple average of individual review polarities.

Topic modeling

Topic modeling is an unsupervised NLP technique where untagged documents are used to create a collection of topics represented by a list of words that occur most often in each topic. In this study, the latent Dirichlet allocation (LDA) topic modeling approach was used to categorize each comment into computer-generated topics. LDA first considered all topics obtained from documents as hypotheses and then tried to select topics that best represent the documents. LDA calculated the probability of a word for a given topic as well as the probability of a topic for a given document [46].

To choose the final value of the number of topics (k), several LDA models with different values of k including [5, 10], were created. Then the output of these LDA models was manually checked by evaluators. K should be set large enough to cover all possible topics with minimal overlap. Finally, k = 6 was selected.

Model evaluation

The model's accuracy is evaluated by splitting the data into two parts, the training set and the test set. The training set is usually broken down even further, with one section used to train the model and another used to validate it. Parameters and models are tested and tuned using the training set until the highest predictive score is reached. The test set is then used to test the best model obtained. In this study, the most optimal result was attained with a composition ratio of 90:10.

The SA model performance was evaluated in terms of precision, sensitivity, specificity, and accuracy, F-measure (Table 1).

Table 1 The performance evaluation metrics

Table 2 shows the confusion matrix. The values of true positive (TP) and true negative (TN) give information when the classifier is doing data classification is true, while false positive (FP) and false negative (FN) give information when the wrong classifier is inside the performed data classification.

Table 2 Confusion matrix

In addition, we use the area under the receiver operating characteristic curve (AUC) to evaluate the proposed model. It means how much the model is capable of distinguishing between classes.

Result

Table 3 listed the number of comments and reviews and how many words and sentences were contained in each comment.

Table 3 The dataset content statistics

The age frequency of the participants in the data collection phase is shown in Fig. 3. The highest frequency of participants was the age range of 30 to 60 years.

Fig. 3
figure 3

Age-frequency of patients participating in the free-text feedback form

Table 4 shows a few examples from the corpus.

Table 4 Examples of comments

The patients' comments and reviews on each section were rated in one of the five predefined options by two experts, where 5 indicates the highest level of satisfaction and 1 indicated the lowest level of patient satisfaction. Two medical informatics experts rated the comments and reviews. In case of doubt, the sentiment was discussed and resolved between them.

For example, for the cleanliness of the hospital, there are five options: "completely clean", "very clean", "clean", "relatively clean", and "dirty". Also, the patients' comments and reviews related the dignity and respect were scored with the options of "always", "most of the time", "sometimes", "rarely", and "not at all". Regarding satisfaction with the treatment process, five options "completely hopeful", "hopeful", "relatively hopeful", "disappointed", and "completely disappointed" were used for rating. Comments rated 3, 4, and 5 were classified as positive, and comments rated 1 and 2 were categorized as negative.

According to Fig. 4, patients' satisfaction is evident in most of the comments. Also, by drawing words cloud on the comments related to the staff's behavior with the patients, the words "راضی ", "خوشحال ", "صمیمی ", "مهربان ", and "خوب " were the most repeated (Fig. 5). The English translation of each of the words respectively is equal to "satisfied", "glad", "friendly", "kind", and "good".

Fig. 4
figure 4

The frequency of categories of patients' comments based on the five levels of satisfaction

Fig. 5
figure 5

Persian words cloud of patients' comments about the healthcare services

After applying the SA model to the comments and reviews, 89.3% of the quantitative ratings of patients' comments about "general services" and our prediction from the sentiment analysis were in agreement (Table 5).

Table 5 SA model performance

Similarly, SA on comments about "healthcare services" and "life expectancy" obtained 92.6 and 90.8% agreement between quantitative rating and sentiment analysis, respectively.

Table 7 reports the sentiment scores for each of the topics in each section. The topics identified by the LDA topic modeling approach were reviewed and labeled by AN, who is a GP in the CI. The analysis generated a score for positive or negative sentiment between − 1 (most pessimistic) and 1 (most optimistic). Examples of comments and their score can be found in Table 6.

Table 6 Frequency of topics and related sentiment score

Topics that were innately associated with positive sentiments, such as "affable staff" and "good experience," have higher sentiment scores than inherently harmful topics such as a "crowded room" "good experience," "affable staff," and "chemotherapy" had higher sentiment scores compared with the other topics. "Metastasis" had lower sentiment scores than other topics (Fig. 6).

Fig. 6
figure 6

Relationship between topic frequency and sentiment

Discussion

This study aimed to provide an automatic solution for analyzing cancer patient feedback about CI's healthcare services, general services, and life expectancy. This work examined NLP techniques, including a Lexicon-based method and topic modeling in the Persian language. SA caused automated analysis of patient comments. Also, the sentiment of comments was identified as positive or negative. The top 6 topics in all three categories were extracted through topic modeling. The experimental results showed that the patients' sentiments about healthcare services, general services, and life expectancy could be detected by the proposed model with an accuracy of 92.6%, 89.3%, and 90.8%, respectively.

In the category of life expectancy, "hopeful" as a positive word compared to "metastasis", "coronavirus", and "death" were the most frequent among patients' comments. The frequency of the words "cleaned & disinfected environment," "the bed sheet," and "accompanying the patient" was observed in comments related to general services. The sentiment score related to each topic showed that the patients are satisfied with the cleanliness of the cancer department, but they feel dissatisfied with their beds and the conditions of their companions.

Regarding healthcare services, "booking appointments" with the associated average sentiment score of -0.161049 mentioned that cancer patients might be unhappy with the current appointment booking service. Appointment scheduling was also an important determinant of patient satisfaction. Also, Alexander et al., in their study using SA and topic modeling on patients' experiences, introduced appointment booking as an issue that caused patients' dissatisfaction [18]. Studies showed that a patient's ability to book their non-urgent appointments significantly impacts patient satisfaction. Every day, people around the world strive to make their lives easier through technological advancements. No one anticipates wasting time, effort, and money waiting in line at the counter, especially when visiting a hospital. The development of an online appointment management system as a solution for patients’ online scheduling of hospital appointments with doctors based on their availability is helpful. These systems aimed to organize patient knowledge based on physician availability, hospital and specialist schedules, and patient appointments. These systems could be designed to automate daily hospital activities such as room activities, admission of the last patient, and doctor visits [47].

Cancer recurrence or metastasis was a predominant concern for many cancer patients and could contribute to experiencing a constant oscillation between feelings of hope and despair [48]. Metastasis was one the most frequent topics with the highest associated negative sentiment (-0.304987) in all three categories. It mentioned that recurrence or metastasis was a predominant concern for many cancer patients and could contribute to experiencing a constant oscillation between feelings of hope and despair. In a study by Mazza et al. [49] to investigate how to describe the lived experience of metastatic breast cancer on social media, they found that the effectiveness of treatment until recurrence or new metastasis was one of the concerns of cancer patients. Zhang et al. [50] focused on an extensive general breast cancer community and performed a sentiment analysis on all their online posts. They found that the posts published by older users were primarily about chemotherapy or metastasis/recurrence.

Accompanying the patient with the associated sentiment score of -0.125527 was the most negative sentiment in the general services category. It mentioned that cancer patients might be unhappy with the current services provided to their families and did not have a good experience. A good experience means that patients and their families would return to the hospital when they need more services and may even recommend the hospital to others. Hospitals today are designed with inspiration from hotels to increase the satisfaction of patients and their families. The concept of hospital hoteling is hospitality and non-medical services provided by patients and their families from the moment they enter the hospital until the final discharge, which could cause satisfaction or dissatisfaction. Hospital managers should be familiar with hoteling and have enough information to create strategies to improve health care services and patient satisfaction [51].

Social media usage is usually highest between 18–29-year-olds. Recent studies showed that the use of social networks in the age group over 65 has tripled in the past decade [49]. Examining the age group of participants in this study showed that 27.5% of the comments were related to people over 60 years old, which is comparable to people under 30 years of age, which was 8.9%. Therefore, patient feedback listening was a valuable technique for systematically reviewing the rich data that online platforms offer across the age spectrum.

Our results reinforce previous findings that it was possible to analyze the sentiments of patients' comments with a reasonable degree of accuracy and that it was possible to identify salient aspects of the comments [10, 52]. These results suggested a potential mechanism for using a large amount of text in the hospital complaint from where people describe their care and that further exploration of the information contained in free text comments may be an important way to understand the patient experience. It is considered an additional source of information to complete the traditional survey methods. The words expressing the patients' dissatisfaction and concern in the extracted free texts indicate the high noise level of the hospital environment, delay and crowding of appointments, high treatment costs, lack of support protocol for patients' companions, and problems in air conditioning systems. The extracted words express the patients' satisfaction with the free texts, show the proper behavior of treatment personnel, satisfaction with the cleanliness of the healthcare environment, and satisfaction with the food quality.

One of the main goals of any organization is to satisfy customers and clients by providing good quality services. Customer satisfaction is one of the leading indicators of growth and development. Client satisfaction is determined by speed, accuracy, precision in providing services, how clients are treated, and the availability of appropriate information. The developed model in this research provides the possibility of analyzing patients' opinions by recognizing the opinions of patients about different aspects. By receiving the opinions that were received through the online forms available in the complaints section of the hospital and also the comments provided on social networks, the proposed model could predict the sentiments of patients (positive and negative) according to the criteria of healthcare services, general services, and life expectancy.

As mentioned in Table 7, the existing SA methods include ML, deep learning, and lexicon-based methods [34]. In this research, a lexicon-based method was used.

Table 7 Comparison of the critical characteristics of prior works

SA in the field of health is presented on the comments and feedback published on different web platforms from health markets, and different social networks to the websites of healthcare centers with different goals and in different languages. In this research, we designed an unstructured patient feedback form, which will be the basis for receiving the online opinions of cancer patients on the TUMS CI website.

In the medical field, the purpose of SA and opinion mining is to augment healthcare services rendered to patients by evaluating their consent or discontent expressed in their comments. The amount of research that has been conducted in the field of SA of cancer patients is limited compared to other scopes.

There were few studies on SA in the Persian language [53,54,55] and so far, only one research [34] has been done in the field of healthcare. Our study is the first SA study conducted in the field of cancer in Iran. Compared to research [34] we also used a lexicon-based method but did not use ML methods. No topic modeling methods were used in their study. In both studies, the basis of the corpus was not the social network, but the information collected from the hospital's criticism and complaints form. In the corpus of the present study, similar to the study [34] patients' opinions about the care and non-care services of the hospital were analyzed. As a difference in our research, the life expectancy of cancer patients was also interpreted.

The corpus of this research is collected from the feedback recorded in the hospital's complaint and suggestions form, and the nature of the hospital's website is not a microblogging environment. So, to maintain the efficiency and scalability of the proposed model, the corpus should periodically feed from the recorded comments on the website.

Limitations

Ironic sentences and words are one of the problems of SA, which is still considered an open challenge to develop tools for the English language. We also faced this problem in this research, which could be mentioned as one of the reasons for reducing the accuracy of the used model compared to the qualitative method. For example, the sentence "if the food were as good as the treatment staff, the situation would be wonderful" had a very high positive meaning from the system's point of view. According to this sentence, food quality is considered good from the system's point of view, but its meaning is negative.

The distinguishing feature of the research conducted in the corpus and the methods used. It is possible to compare the efficiency of the conducted research when they are all applied to the same corpus. Therefore, it was not possible to compare the accuracy of the current research with previous studies of the Persian language.

Conclusion

By applying SA to healthcare, these analytical methods could interpret contextual information about the patient experience on a large scale. The results of this research could be used to identify patients' opinions about the current state of the cancer department. A combination of SA and topic modeling helps collect insightful information about healthcare services. Policymakers could use the learnings of these topics and related sentiments to improve patient satisfaction with cancer institution services.

In this research, only the unstructured experiences of the patients were analyzed. In future research, we would like to compare the results of SA with quantitative ratings provided by patients on a Likert scale to test the accuracy of prediction. For this, we will design a researcher-made structured questionnaire according to the goals of this research.

In the future study, we will develop a patient satisfaction analysis system based on the proposed model in this research. Considering that there was no unstructured form on the website of the TUMS CI and active social media that would meet the research objectives, we designed the complaints and criticisms form for the first time in line with the research objectives and distributed it to the patients. Considering the positive results of this research, in the future study, we will develop a patient satisfaction analysis system based on the proposed model in this research-based comments that will be gathered from the online designed form. There will be further studies and experiments on using deep neural network and recurrent neural network architectures to increase the accuracy of the results.

Availability of data and materials

The data generated, analyzed and used during the current study are not publicly available due to Shiraz University of Medical Sciences policy, but are available from the corresponding author upon the reasonable request.

References

  1. Yue L, Chen W, Li X, Zuo W, Yin M. A survey of sentiment analysis in social media. Knowl Inf Syst. 2019;60(2):617–63.

    Article  Google Scholar 

  2. Jindal K, Aron R. A systematic study of sentiment analysis for social media data. Materials today: proceedings. 2021 .

  3. Campbell L, Evans Y, Pumper M, Moreno MA. Social media use by physicians: a qualitative study of the new frontier of medicine. BMC Med Inform Decis Mak. 2016;16(1):1–11.

    Article  Google Scholar 

  4. Khanbhai M, Anyadi P, Symons J, Flott K, Darzi A, Mayer E. Applying natural language processing and machine learning techniques to patient experience feedback: a systematic review. BMJ Health Care Inform. 2021;28(1):e100262.

  5. Abualigah L, Alfar HE, Shehab M, Hussein AMA. Sentiment analysis in healthcare: a brief review. Recent advances in NLP: the case of Arabic language. 2020; 129–41.

  6. Alonso MA, Vilares D, Gómez-Rodríguez C, Vilares J. Sentiment analysis for fake news detection. Electronics. 2021;10(11):1348.

    Article  Google Scholar 

  7. Chong WY, Selvaretnam B, Soon L-K, editors. Natural language processing for sentiment analysis: an exploratory analysis on tweets. 2014 4th international conference on artificial intelligence with applications in engineering and technology. IEEE; 2014.

  8. Kanakaraj M, Guddeti RMR, editors. NLP based sentiment analysis on Twitter data using ensemble classifiers. 2015 3Rd international conference on signal processing, communication and networking (ICSCN). IEEE; 2015.

  9. Abirami AM, Askarunisa A. Sentiment analysis model to emphasize the impact of online reviews in healthcare industry. Online Information Review. 2017.

  10. Greaves F, Ramirez-Cano D, Millett C, Darzi A, Donaldson L. Use of sentiment analysis for capturing patient experience from free-text comments posted online. J Med Internet Res. 2013;15(11): e2721.

    Article  Google Scholar 

  11. Lai ST, Mafas R, editors. Sentiment Analysis in Healthcare: Motives, Challenges & Opportunities pertaining to Machine Learning. 2022 IEEE International Conference on Distributed Computing and Electrical Circuits and Electronics (ICDCECE). IEEE; 2022.

  12. Clark EM, James T, Jones CA, Alapati A, Ukandu P, Danforth CM, et al. A sentiment analysis of breast cancer treatment experiences and healthcare perceptions across twitter. arXiv preprint arXiv:180509959. 2018.

  13. Ahmad S, Manzoor S, Siddiqui S, Mariappan N, Zafar I, Ahmad A, et al., editors. Epigenetic underpinnings of inflammation: Connecting the dots between pulmonary diseases, lung cancer and COVID-19. Seminars in cancer biology. Elsevier; 2022.

  14. Han CJ, Yang GS, Syrjala K. Symptom experiences in colorectal cancer survivors after cancer treatments: A systematic review and meta-analysis. Cancer Nurs. 2020;43(3):E132.

    Article  PubMed  PubMed Central  Google Scholar 

  15. Rodrigues RG, das Dores RM, Camilo-Junior CG, Rosa TC. SentiHealth-Cancer: a sentiment analysis tool to help detecting mood of patients in online social networks. Int J Med Inform. 2016;85(1):80–95.

  16. Adikari A, De Silva D, Ranasinghe WK, Bandaragoda T, Alahakoon O, Persad R, et al. Can online support groups address psychological morbidity of cancer patients? An artificial intelligence based investigation of prostate cancer trajectories. PLoS ONE. 2020;15(3): e0229361.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  17. Dashtipour K, Poria S, Hussain A, Cambria E, Hawalah AY, Gelbukh A, et al. Multilingual sentiment analysis: state of the art and independent comparison of techniques. Cogn Comput. 2016;8:757–71.

    Article  Google Scholar 

  18. Alexander G, Bahja M, Butt GF. Automating Large-scale Health Care Service Feedback Analysis: Sentiment Analysis and Topic Modeling Study. JMIR Med Inform. 2022;10(4): e29385.

    Article  PubMed  PubMed Central  Google Scholar 

  19. Edara DC, Vanukuri LP, Sistla V, Kolli VKK. Sentiment analysis and text categorization of cancer medical records with LSTM. J Ambient Intell Humaniz Comput. 2023;14(5):5309–25.

    Article  Google Scholar 

  20. Rajabi Z, Valavi M. A survey on sentiment analysis in Persian: a comprehensive system perspective covering challenges and advances in resources and methods. Cogn Comput. 2021;13(4):882–902.

    Article  Google Scholar 

  21. Balakrishnan A, Idicula SM, Jones J. Deep learning based analysis of sentiment dynamics in online cancer community forums: An experience. Health Informatics J. 2021;27(2):14604582211007536.

    Article  PubMed  Google Scholar 

  22. Modave F, Zhao Y, Krieger J, He Z, Guo Y, Huo J, et al. Understanding perceptions and attitudes in breast cancer discussions on Twitter. Studies in health technology and informatics. 2019;264:1293.

    PubMed  PubMed Central  Google Scholar 

  23. Edara DC, Vanukuri LP, Sistla V, Kolli VKK. Sentiment analysis and text categorization of cancer medical records with LSTM. J Ambient Intell Hum Comput. 2019;10(7):1–17.

  24. Qiu B, Zhao K, Mitra P, Wu D, Caragea C, Yen J, et al., editors. Get online support, feel better--sentiment analysis and dynamics in an online cancer survivor community. 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing. IEEE; 2011.

  25. Vehviläinen-Julkunen K, Turpeinen S, Kvist T, Ryden-Kortelainen M, Nelimarkka S, Enshaeifar S, et al. Experience of ambulatory cancer care: understanding patients’ perspectives of quality using sentiment analysis. Cancer Nurs. 2021;44(6):E331–8.

    Article  PubMed  Google Scholar 

  26. Dalayya S, Elsaid STFA, Ng KH, Song TL, Lim JBY. Sentiment Analysis to Understand the Perception and Requirements of a Plant-Based Food App for Cancer Patients. Hum Behav Emerg Technol. 2023;2023:1–11.

  27. Ofek N, Caragea C, Rokach L, Biyani P, Mitra P, Yen J, et al., editors. Improving sentiment analysis in an online cancer survivor community using dynamic sentiment lexicon. 2013 international conference on social intelligence and technology. IEEE; 2013.

  28. Baker MR, Mohammed EZ, Jihad KH, editors. Prediction of Colon Cancer Related Tweets Using Deep Learning Models. International Conference on Intelligent Systems Design and Applications. Springer; 2022.

  29. Alamoodi A, Baker MR, Albahri O, Zaidan B, Zaidan A, Wong W-K, et al. Public sentiment analysis and topic modeling regarding COVID-19’s three waves of total lockdown: a case study on movement control order in Malaysia. KSII Trans Internet Inf Syst. 2022;16(7):2169–90.

    Google Scholar 

  30. Pandesenda AI, Yana RR, Sukma EA, Yahya A, Widharto P, Hidayanto AN, editors. Sentiment Analysis of Service Quality of Online Healthcare Platform Using Fast Large-Margin. 2020 International Conference on Informatics, Multimedia, Cyber and Information System (ICIMCIS). IEEE; 2020.

  31. Alayba AM, Palade V, England M, Iqbal R, editors. Arabic language sentiment analysis on health services. 2017 1st international workshop on arabic script analysis and recognition (asar). IEEE; 2017.

  32. del Arco FMP, Valdivia MTM, Zafra SMJ, González MDM, Cámara EM. COPOS: corpus of patient opinions in Spanish. Application of sentiment analysis techniques. Procesamiento del Lenguaje Natural. 2016;57:83–90.

  33. Freedman RA, Viswanath K, Vaz-Luis I, Keating NL. Learning from social media: utilizing advanced data extraction techniques to understand barriers to breast cancer treatment. Breast Cancer Res Treat. 2016;158:395–405.

    Article  PubMed  PubMed Central  Google Scholar 

  34. Khaleghparast S, Maleki M, Hajianfar G, Soumari E, Oveisi M, Golandouz HM, et al. Development of a patients’ satisfaction analysis system using machine learning and lexicon-based methods. BMC Health Serv Res. 2023;23(1):1–12.

    Article  Google Scholar 

  35. Acharjee UK, Arefin M, Hossen KM, Uddin MN, Uddin MA, Islam L. Sequence-to-sequence learning-based conversion of pseudo-code to source code using neural translation approach. IEEE Access. 2022;10:26730–42.

    Article  Google Scholar 

  36. Tehran University of Medical Science Cancer Institute; Available from: https://en.tums.ac.ir/appraisetoraise/en/content/513/the-cancer-institute-ci-.

  37. Rezaee R, Asadi S, Yazdani A, Rezvani A, Kazeroon AM. Development, usability and quality evaluation of the resilient mobile application for women with breast cancer. Health Science Reports. 2022;5(4): e708.

    Article  PubMed  PubMed Central  Google Scholar 

  38. Shamsfard M. Challenges and open problems in Persian text processing. Proc LTC. 2011;11:65–9.

    Google Scholar 

  39. Kasmaiee S, Kasmaiee S, Homayounpour M. Correcting spelling mistakes in Persian texts with rules and deep learning methods. Scientific Reports. 2023;13(1):19945.

  40. Dolamic L, Savoy J. When stopword lists make the difference. J Am Soc Inform Sci Technol. 2010;61(1):200–3.

    Article  Google Scholar 

  41. Dolamic L, Savoy J, editors. Persian language, is stemming efficient? 2009 20th International Workshop on Database and Expert Systems Application. IEEE; 2009.

  42. Academy of Persian Language and Literature. Available from: https://apll.ir/.

  43. Basiri ME, Naghsh-Nilchi AR, Ghasem-Aghaee N. Sentiment prediction based on dempster-shafer theory of evidence. Mathematical Problems in Engineering. 2014; 2014.

  44. Thelwall M, Buckley K, Paltoglou G. Sentiment strength detection for the social web. J Am Soc Inform Sci Technol. 2012;63(1):163–73.

    Article  Google Scholar 

  45. Calvo RA, Mac KS. Emotions in text: dimensional and categorical models. Comput Intell. 2013;29(3):527–43.

    Article  Google Scholar 

  46. Blei DM, Ng AY, Jordan MI. Latent dirichlet allocation. J Mach Learn Res. 2003;3(Jan):993–1022.

  47. Jebamani B, Murugeswari R, Nagaraj P. Online Appointment Management System in Hospitals Using Distributed Resource Allocation Algorithm. Sentimental Analysis and Deep Learning: Springer; 2022. p. 291–301.

  48. Vidnes TK, Wahl AK, Andersen MH. Patient experiences following liver transplantation due to liver metastases from colorectal cancer. Eur J Oncol Nurs. 2013;17(3):269–74.

  49. Mazza M, Piperis M, Aasaithambi S, Chauhan J, Sagkriotis A, Vieira C. Social Media Listening to Understand the Lived Experience of Individuals in Europe With Metastatic Breast Cancer: A Systematic Search and Content Analysis Study. Front Oncol. 2022;12.

  50. Zhang S, Bantum E, Owen J, Elhadad N, editors. Does sustained participation in an online health community affect sentiment? AMIA Annual Symposium Proceedings; 2014: American Medical Informatics Association.

  51. Shirzadi SM, Sheibani-Tehrani D, Shirzadi NS, Yaghoubi M. Identifying factors affecting hoteling quality in Iranian public hospitals using DEMATEL approach. J Educ Health Promot. 2020;9.

  52. Alemi F, Torii M, Clementz L, Aron DC. Feasibility of real-time satisfaction surveys through automated analysis of patients' unstructured comments and sentiments. Qual Manag Health Care. 2012;21(1):9–19.

  53. Shams M, Shakery A, Faili H, editors. A non-parametric LDA-based induction method for sentiment analysis. The 16th CSI international symposium on artificial intelligence and signal processing (AISP 2012). Shiraz, Iran: IEEE; 2012.

  54. Bagheri A, Saraee M, de Jong F, editors. Sentiment classification in Persian: Introducing a mutual information-based method for feature selection. 2013 21st Iranian conference on electrical engineering (ICEE). IEEE; 2013.

  55. Saraee M, Bagheri A, editors. Feature selection methods in Persian sentiment analysis. Natural Language Processing and Information Systems: 18th International Conference on Applications of Natural Language to Information Systems, NLDB 2013, Salford, UK, June 19–21, 2013 Proceedings 18. Springer; 2013.

Download references

Acknowledgements

We thank the Research Deputy of the Tehran University of Medical Sciences for financially supporting this project. We also would like to thank all experts who participated in this study.

Funding

The authors disclosed receipt of the following financial support for the research, authorship, and/or publication of this article: This work was supported by the Tehran University of Medical Sciences [grant number 1399.377].

Author information

Authors and Affiliations

Authors

Contributions

AY and AN planned the study design and coordinated the conduct of the study. MK and MSH collected the study data and participated in the drafting of the manuscript. AY carried out the analysis and interpretation of the data and drafted this paper. AY provided the technical counseling on study design and critically revised the manuscript. All authors read and approved the final manuscript.

Corresponding author

Correspondence to Azin Nahvijou.

Ethics declarations

Ethics approval and consent to participate

The study protocol was approved before the initiation of this study by the Ethics Committee of Tehran University of Medical Sciences (approval ID: IR.TUMS.IKHC.REC.1399.377). The study was performed in compliance with this institutional guideline, ethical guidelines for clinical research of the Iran government. The participants gave their informed consent to participate.

Consent for publication

Not applicable.

Competing interests

The authors declare no competing interests.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary Information

Additional file 1.

Some parts of the codes.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated in a credit line to the data.

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Yazdani, A., Shamloo, M., Khaki, M. et al. Use of sentiment analysis for capturing hospitalized cancer patients' experience from free-text comments in the Persian language. BMC Med Inform Decis Mak 23, 275 (2023). https://doi.org/10.1186/s12911-023-02358-2

Download citation

  • Received:

  • Accepted:

  • Published:

  • DOI: https://doi.org/10.1186/s12911-023-02358-2

Keywords