- Open Access
Comparison of usability evaluation methods for a health information system: heuristic evaluation versus cognitive walkthrough method
BMC Medical Informatics and Decision Making volume 22, Article number: 157 (2022)
There are differences of opinion regarding the selection of the most practical usability evaluation method among different methods. The present study aimed to compare two expert-based evaluation methods in order to assess a nursing module as the most widely used module of a Hospital Information System (HIS).
Five independent evaluators used the Heuristic Evaluation (HE) and Cognitive Walkthrough (CW) methods to evaluate the nursing module of Shafa HIS. In this regard, the number and severity of the recognized problems according to the usability attributes were compared using two evaluation methods.
The HE and CW evaluation methods resulted in the identification of 104 and 24 unique problems, respectively, of which 33.3% of recognized problems in the CW evaluation method overlapped with the HE method. The average severity of the recognized problems was considered to be minor (2.34) in the HE method and major (2.77) in the CW evaluation method. There was a significant difference in terms of the total number and average severity of the recognized problems by these methods (P < 0.001). Based on the usability attribute, the HE method identified a larger number of problems concerning all usability attributes, and a significant difference was observed in terms of the number of recognized problems in both methods for all attributes except ‘memorability’. Also, there was a significant difference between the two methods based on the average severity of recognized problems only in terms of ‘learnability’.
The HE method identified more problems with lower average severity while the CW was able to recognize fewer problems with higher average severity. Regarding the evaluation goal, the HE method was able to be used to improve the effectiveness and satisfaction of the HIS. Furthermore, the CW evaluation method is recommended to identify usability problems with the highest average severity, especially in terms of ‘learnability’.
In recent years, healthcare organizations in many countries have primarily invested in developing and implementing various information technology systems [1, 2]. Moreover, the usability of health information technology systems has always been an area of concern for such organizations . Well-designed health information systems can support the flow of clinical information in several ways, thereby improving patient care . The nursing module of the Hospital Information System (HIS) can be regarded as one of the most crucial and widely used healthcare information systems , which needs to be integrated into nurses' daily practices . In this regard, a system's efficient and effective use heavily depends on its proper design and the fulfillment of users' expectations and a poor design and improper usage can trigger several problems in accepting and using the system [6,7,8]. Problems would lead to user dissatisfaction [9, 10], increased error rates, and decreased safety and quality of patient care services . Accordingly, various usability attributes should be concerned in developing interactive health information systems such as the nursing module . According to the International Organization for Standardization (ISO), usability attributes encompass efficiency, effectiveness and satisfaction , while Nielsen considers learnability, efficiency, memorability, error and satisfaction as usability attributes of the information systems .
The usability evaluation of information system user interfaces can be performed by several methods . These methods are divided into two groups: user-based testing methods, in which users identify the problems which prevent them from performing tasks, and expert-based inspection methods, in which experts identify general problems with the system user interface [15, 16]. Each of these methods has its own advantages .
The Heuristic Evaluation (HE) and Cognitive Walkthrough (CW) are two common expert-based evaluation methods, which can recognize system usability problems effortlessly, quickly and economically [15, 17, 18]. In the HE method, the system user interface is evaluated concerning a set of predefined principles, known as heuristics . This method, performed by three to five evaluators, can recognize up to 80% of the usability problems . As a task-specific and structured evaluation method, the CW evaluation method adopts the principles of cognitive psychology to simulate cognitive processes and user actions to perform specific tasks using a computer system [21, 22].
Although there are many usability problems in information systems, most problems can be recognized and modified by proper planning and selecting an appropriate evaluation method . In other words, improving the usability of health information systems requires a deep understanding of various usability evaluation methods and relevant knowledge of how to implement these methods . However, there might be several challenges and inconsistencies in selecting the most appropriate usability evaluation method and implementing the findings obtained from each method . In this case, if there are a large number of evaluation methods available to be applied in a study, the findings of previous studies, particularly comparative studies, should be considered while selecting the most appropriate method so that they can be highly beneficial in describing the advantages and disadvantages of each method and predicting what is to be achieved by applying each method. Consequently, the most appropriate evaluation method can be selected based on the evaluation goal.
Numerous studies have compared the findings of the HE and CW evaluation methods and other usability evaluation methods [24,25,26,27,28,29,30]. In a study done by Khajouei et al. , the HE and CW evaluation methods were compared based not only on the number and severity of the recognized problems but also on a blend of ISO and Nielsen usability attributes. Khajouei et al.'s study , as the only research in this regard, was carried out on the clinical module of a medical office management system. Regarding the limitations of their concerned system, it was specifically designed for physicians with low functional capabilities and had a limited number of installations across the country compared to other health information systems. Accordingly, it seems necessary to perform similar studies to compare the abovementioned evaluation methods on large-scale health information systems with more capabilities, which are expected to have a more appropriate design due to the large number of their installations and users across the country. Also, the breadth of the evaluated information system makes the results of the comparison between the HE and CW methods more generalizable to other health information systems and we can be more confident about the results of comparing the two methods in the selection of the most appropriate evaluation method in other health information systems. This study aimed to compare the number, severity, and type of the recognized problems according to ISO and Nielsen’s usability attributes using the HE and CW evaluation methods in a nursing module of one of the largest HISs installed in many hospitals across the country.
The HE and CW evaluation methods were compared in the nursing module of Shafa HIS (developed by Tirajeh Rayaneh Co.). By the time this research study was conducted, the system had been installed in more than 200 hospitals across the country. It enjoys various capabilities as follows: admitting patients and allocating bed and room for patients in the inpatient ward, recording requests for paraclinical services and monitoring the results, recording requests for a surgical procedure, recording requests to transfer and move a patient to another ward and recording a patient's information at the discharge time. The nurses and secretaries working in inpatient wards have completely incorporated this HIS into their daily lives. This nursing module was adopted using a comparative evaluation as it encompasses the largest group of users and is the most prominent clinical module in any HIS.
In the present study, the usability evaluation of the nursing module of Shafa HIS was performed in the laboratory of health information technology at Kashan University of Medical Sciences using the HE and CW evaluation methods. The results of these expert-based evaluation methods were then compared.
Nielsen and Molich first introduced this method . According to the Nielsen approach, a summarized list of heuristic principles is given to evaluators, which can be used as a guideline and then the user interface is independently examined by each evaluator, following which relevant usability problems are recognized . Ten heuristics principles were developed by Nielsen, including visibility of system status, the match between the system and the real world, user control and freedom, consistency and standards, helping users recognize, diagnose, and recover from errors, error prevention, recognition rather than recall, flexibility and efficiency of use, aesthetic and minimalist design, and help and documentation that must be observed in the user interface design . According to this method, evaluators are asked not to share ideas with one another before the evaluation is completed as a single evaluator might fail to recognize a large number of problems, while a variety of evaluators can recognize a wide range of unique issues. Hence, more comprehensive results can be yielded after the findings of a number of HE evaluations are combined . In case of limited time and resources, the usability problems will quickly and economically be identified through this method by three to five evaluators [20, 33, 34].
Cognitive walkthrough evaluation
The CW is a popular expert-based method with an emphasis on ease of learning the system [22, 35]. This method is considered suitable when users need to masterfully learn a new application or function by learning through exploration. To carry out the CW, the system user interface design needs to be precisely descripted and task scenario, assumptions about the users and the scope of use along with the series of actions that users take to successfully perform a given task need to be determined. Then, a series of cognitive processes followed by users during the performance of a number of actions are simulated by an evaluator or a group of evaluators so as to perform specific tasks. During this phase, the evaluators try to decide on the actions which are not easy for ordinary users by learning the interface behavior and the influence it has on the user. Therefore, this evaluation method could be performed at an early phase of system development to meet user needs .
As only three to five evaluators are needed to do the HE and CW evaluations , the maximum number of evaluators (five) did the evaluation in a random and purposeful fashion. Three evaluators were Ph.D. students in health information management and two evaluators had M.Sc. degrees in health information technology. The evaluators had previous experience with performing the HE and CW evaluations and were familiar with various healthcare information systems . Furthermore, the processes and workflow of the nursing information system were also explained to them.
In this study, the CW evaluation was primarily done by the evaluators in order to prevent the adverse effect of learning the system on the CW evaluation results and, after a period of two weeks, the HE evaluation was performed. Khajouei et al. , in order to avoid the effect of learning the system on the results of the HE and CW methods, conducted their study in two phases by reversing the order of evaluators of each method in the first and second phases and after the washing period; no significant difference was observed between the number of recognized problems in the first and second phases of the HE and CW evaluations. Hence, in this study, in order to prevent the effect of learning, the HE evaluation was done two weeks after the CW evaluation method. Nielsen's usability principles were explained to the evaluators and they were told to evaluate the user interface in accordance with the checklist. In order to prevent bias, the HE evaluation was done on those parts of the nursing module in which five CW evaluation scenarios had been performed.
A method suggested by Polson and Lewis was used to do the CW evaluation . Based on nurses’ daily routine tasks and secretaries of inpatient wards, five scenarios were designed using the nursing module based on nurses' opinions and the approval of the head nurse. For each scenario, users' goals and sub-goals, the series of actions performed for each task and the system response were defined. Table 1 shows an example of a scenario including tasks and actions. Each evaluator independently imagined what a real user would do based on background information available to system users and then accomplished the series of actions needed for each task. Following looking at the system from users' point of view, each evaluator determined (a) users’ goals which would lead to actions, (b) if the prompts and labels of the interface would make the user perform the correct task based on the correct goals and (c) the effect of users’ goals on response to the feedback from the interface after performing the action. Any potential issues were reported to the researcher . The comments, questions, and ambiguities raised by the evaluators as well as the problem and its location were recorded by the researcher, who was also considered as the observer. In the final phase of the evaluation process, the evaluators reviewed their problem lists and either added a comment or corrected a previously given comment, if necessary. Then, in a meeting with the researcher and evaluators, all lists were compared and duplicate problems were eliminated and finally a list of individual usability problems was prepared. Finally, this list was provided to the evaluators, who independently determined the level of severity of each problem on a scale ranging from 0 to 4 according to the frequency of the problem, its impact on users, and its persistence . Problem severity was graded as follows:
0 = No problem: This is not a usability problem.
1 = Cosmetics: Correction is required only if more time is available.
2 = Minor: There is a low priority for problem correction.
3 = Major: There is a high priority for problem correction.
The ISO and Nielsen usability attributes were used to recognize usability problems [12, 13]. Table 2 shows the usability attributes of both the ISO and Nielsen. Evaluators independently assigned the recognized problems to one of the usability attributes.
Following a two-week period, once the CW evaluation was completed, five evaluators were asked to independently evaluate the user interface of the nursing module using the Nielsen's usability principles checklist, which was based on the Xerox heuristics checklist . This checklist had been designed according to Nielsen's ten usability principles and included 254 items with multiple-choice questions where options included "Yes," "No," and "Not Applicable" answers. The validity and reliability of the checklist had previously been confirmed in a study by Rezaei-Hachesu et al. . The recognized problems were then listed in the problem report form, which comprised a four-column table including "problem title," "problem description," "location of the problem," and "violated usability principle" as the head of columns.
Subsequently, the recognized problems were analyzed in a meeting with the five evaluators, where the repeated problems were discarded and a comprehensive list of unique problems was made. Moreover, disagreements about the identified problems were resolved in this meeting. Finally, similar to the CW evaluation, the evaluators independently determined the level of severity of each problem on a scale of 0–4 and assigned the recognized problems to a specific usability attribute.
The data was analyzed using SPSS Statistics for Windows, version 20.0 (SPSS Inc., Chicago, Ill., USA) using both descriptive and inferential statistical techniques. The average severity of the usability problems was calculated. Then, each usability problem was assigned into one of the five categories shown in Table 3, according to its average severity . Furthermore, each problem was associated with one of the usability attributes in which it had the most frequency. If the problem was classified into more than one usability attribute based on its maximum frequency, the problem was associated with the most relevant usability attribute according to the evaluators' opinion .
The one-variable Chi-square test was used to compare the total number of recognized usability problems and the number of recognized problems in terms of usability attributes by the two methods. Moreover, the Chi-square test was used to compare the number of recognized problems according to the usability attributes in each of the two evaluation methods. The average severity levels of the problems identified in both methods were compared using the Mann–Whitney U test. A significance level of 0.05 was considered in this study.
The present study compared the usability evaluation results obtained by the HE and CW methods in the nursing module of an HIS. The HE method was able to identify 317 problems, of which 104 unique problems remained after merging the problems and discarding the repeated ones. Similarly, 46 problems were recognized in the CW evaluation, of which 24 unique problems remained after merging the problems and discarding the repeated ones; the one-variable Chi-square test indicated a significant difference between the number of the recognized problems in the two evaluation methods (P < 0.001).
In this study, 57 problems (54.8%) were considered as minor problems and 33 of them (31.7%) were considered as major problems out of the unique problems recognized in the HE method. In the CW evaluation, 12 problems (50%) were considered as major problems and 10 problems (41.7%) were considered as minor problems out of the unique problems recognized. The average severity of usability problems was 2.34 and 2.77 in the HE and CW methods, respectively, and the Mann–Whitney U test revealed a significant difference based on these figures (Table 4).
The number and average severity of the recognized problems according to the usability attributes are shown in Fig. 1.
According to Chi-square test, the difference between the number of the recognized problems based on usability attribute in the HE method was significant (P < 0.001). Most of the problems recognized through the HE method were related to effectiveness and satisfaction. In the CW method, while most recognized problems were related to learnability and efficiency, no significant difference was observed between the number of the recognized problems based on usability attributes in this method (P = 0.156).
According to Table 5, a significant difference was observed between the total number of the identified problems in the HE and CW evaluation methods (P < 0.001). The HE method identified more problems in all usability attributes than the CW method and there was a significant difference between the two methods based on efficacy, effectiveness, satisfaction, learnability and error (P < 0.001). However, the number of recognized problems in terms of memorability did not show a statistically significant difference (P > 0.05).
From among the total number of recognized problems in two methods (n = 128), 80% were identified by the HE method, 13.3% by the CW method and 6.7% by both methods. In other words, 33.3% (n = 8) of the recognized problems in the CW method overlapped with the recognized problems in the HE method. Regarding efficiency, effectiveness and error, one problem and for learnability and memorability, three and two problems were identified by both methods.
The usability evaluation of the nursing module of Shafa HIS was performed by the HE and CW evaluation methods. The higher number of problems was recognized by the HE method as compared to the CW method, while the average severity of recognized usability problems by the HE was less than that of the CW method.
A significant difference was observed between the total number of usability problems recognized by the HE and CW methods in the usability evaluation of the nursing module. In the studies conducted by Jeffries et al.  and Frøkjær and Lárusdóttir , it was also indicated that the HE method can recognize more usability problems compared to the CW evaluation, which is compatible with the results of the present study. It should be noted that the number of recognized problems in the HE method depends on the expertise and skill of the evaluators [26, 42–45]. In another study performed by Khajouei et al. , it was indicated that no significant difference was seen between the numbers of recognized problems in these two evaluation methods, which was incompatible with the findings of the present study. This may be because Nielsen's usability principles checklist was used in the present study, while in the study conducted by Khajouei et al. , no usability checklist was used to carry out the HE method. As a way to confirm this notion, another study done by Khajouei et al.  showed that using a checklist in performing the HE method can lead to identifying a higher number of usability problems than when the evaluation is performed without a checklist.
Based on the obtained results, more than half of the recognized problems by the HE method were considered as minor problems, while %50 of the problems recognized by the CW evaluation were considered as major problems and the average severity of usability problems in this method was significantly higher than that of the HE method. Some studies indicated that most of the recognized problems by the HE method are in the minor or cosmetic categories [14, 44, 46], which is compatible with the findings of the present study. Contrary to these findings, other studies reported that most of the recognized problems by the HE method were major and catastrophic [47, 48]. In a study performed by Jeffries et al. , it was revealed that two-thirds of the problems identified by the HE method were low-severity problems. The difference in the types of recognized problems due to investigating different health information systems can be considered as a source of this inconsistency. The skill and expertise of evaluators, which is highly effective in estimating the severity of recognized problems , can be another reason for the inconsistency observed in findings.
The comparison of the average severity between the two methods based on the usability attribute showed that the average severity of learnability problems in the CW method was significantly higher than that of the HE method. Cuomo and Bowen  proposed that the CW evaluation was the superior method in identifying certain problems that either affect user performance or focus on how users interact with the user interface while performing specific tasks . Therefore, as a task-independent method, the HE method illustrates various aspects of a system user interface and identifies specific problems that users hold in low regard for taking corrective actions as compared with the CW method.
The HE method recognized more problems in each of the usability attributes than the CW method, leading to a statistically significant difference between the number of recognized problems in both methods based on all usability attributes except memorability. In the study conducted by Khajouei et al. , the number of satisfaction problems identified in the HE method was significantly higher than that of the CW method, which is consistent with the results of the present study. However, they also did not observe a significant difference in the number of problems among other usability attributes, which is not consistent with the results of the present study. As previously mentioned, Nielsen’s usability principles checklist was used in the study done by of Khajouei et al. , therefore more problems in each of the usability attributes were observed in the HE method compared to the CW method in this study, which was not unexpected. However, we have no assumptions about the generalizability of this result in the evaluation of other information systems, and a similar study on other information systems might yield different results.
In another study done by Khajouei et al. , the HE method identified a significantly higher number of problems related to satisfaction, learnability and error attributes than the think aloud method as one of the usability testing methods, which is consistent with the results of the present study. Usability testing is a widely used technique to evaluate system utility in achieving goals from the perspective of end users  and is used to assess the ease of system use and identify the problems that users encounter when working with the system . Nielsen considers usability testing to be the most appropriate method due to end-user participation. However, he suggests expert-based evaluation methods in order to gather additional usability information . According to a study conducted by Paz et al. , many usability problems identified through usability testing had previously been identified through the HE evaluation; however, there were significant differences regarding the significance of the problems identified through both methods. Therefore, it would be desirable to perform user-based and expert-based evaluation methods in order to validate the results of each method.
Comparison of the two methods in terms of the overlap between the recognized problems showed that only one-third of the recognized problems in the CW method overlapped with the recognized problems through the HE method. In fact, neither of the two methods alone can identify all usability problems, and the HE method, while identifying more problems in each of the usability attributes, cannot replace the CW evaluation method.
Based on the results of the HE evaluation method, a significant difference was observed between the number of recognized problems based on usability attributes. Most of the attribute problems in this method were related to the effectiveness and satisfaction. In the study done by Khajouei et al. , the highest number of recognized problems in the HE method was related to satisfaction and the lowest number of problems was related to effectiveness, and there was no significant difference between the number of problems regarding different attributes. It is acknowledged that health care is a complex environment for information systems due to complex contextual dynamics and rapid changes in its operating context [53, 54]. Furthermore, integration of healthcare information systems into work processes affects their use by healthcare workers . Implementing HISs in such an environment is more complex and costly than implementing other information systems . Successful implementation of an HIS depends on user satisfaction. Therefore, factors influencing user satisfaction should be considered when designing, developing or adopting such systems . Moreover, user satisfaction is often used as a measurement of users’ perception of the effectiveness of an information system . As a rule of thumb, effectiveness can be achieved when users perform their activities and tasks wholly and accurately via the system . In a complex healthcare environment, the effectiveness of health information systems reduces errors by improving legibility, completeness and accuracy of information required for users to make decisions and perform their tasks, having a positive effect on improving quality of care [60,61,62]. It can be stated that, according to the results of the present study and Khajouei et al.'s study , the HE method is a validated method to identify satisfaction problems. Also, according to the results of this study, the HE method is powerful for identifying the problems of effectiveness, which is contrary to the findings reported by Khajouei et al. . Therefore, further studies of this type on different systems are needed to confirm this finding and determine the capabilities of each method to identify other attributes.
According to the results, while in the CW method, the highest number of recognized problems was related to learnability and efficiency, no significant difference was observed between the number of recognized problems based on the usability attribute in this method. In the study done by Khajouei et al. , the highest number of problems identified through the CW evaluation method was related to learnability. Learnability refers to the speed at which new or novice users learn a system user interface to perform various tasks . Efficiency refers to the users' time and mental effort and the costs involved to achieve their goals with minimum resources and costs [28, 64]. Within the dynamic healthcare environment, healthcare information systems are being constantly implemented, changed and updated [65,66,67]. Meanwhile, usability problems and users’ unfamiliarity with the user interface lead users to spend more time completing tasks by the system, hence, decreasing the efficiency [59, 62]. On the other hand, training time in HISs is highly limited for the healthcare workers such as nurses who are essentially required to learn how to work with the system while doing their duties . By performing the CW evaluation and modifying the usability problems recognized by this method can ultimately decrease user cognitive load and system learning time and increase system efficiency.
Strengths and weaknesses
This study compared two expert-based usability evaluation methods in a nursing module as the most important and widely used HIS module. Moreover, five evaluators participated in this study, which is an adequate number of evaluators to perform the HE and CW evaluations. Furthermore, the evaluators had previous experience with conducting several HE and CW evaluations and either had work experience or were familiar with various HISs and, according to previous studies [42,43,44], the participation of skilled and experienced evaluators is beneficial in identifying the most significant number and the most severe usability problems. Another strength of this study was that five scenarios were identified based on nurses' most important and frequent daily tasks performed via the nursing module. These scenarios were formed according to the opinion of three experienced nurses working in the inpatient ward, which the head nurse then approved.
However, evaluating the nursing module of the HIS can be considered as a limitation in this study. Since designing different user interfaces for various HISs can influence the number and severity of recognized problems and consequently the number of problems attributed to each usability attribute in each of the HE and CW methods, the findings of this study cannot be fully generalized to all HISs. Another limitation of this study is the possibility of the effect of CW evaluation on the results of the HE evaluation method, while, according to Khajouei et al.'s findings , performing the CW before the HE has no effect on its results. However, in this study, the number of problems identified in the HE method was more than that of the CW evaluation method.
Comparing the HE and CW methods regarding the usability evaluation of a nursing module, it seems that the HE method can identify a higher number of problems compared to the CW evaluation method due to its broad overview of a system design based on a set of predefined principles, while the evaluation of specific tasks in detail is avoided. However, users may be less concerned with adopting corrective measures for these problems. Additionally, this study indicated that performing a reasonable number of tasks in the CW evaluation leads to identifying fewer problems with higher average severity than the HE method. Also, according to the results of this study, the HE evaluation seems to prove more effective and accurate in identifying usability problems affecting effectiveness and satisfaction and the CW method seems to be more suitable for identifying learnability problems with higher average severity.
Availability of data and materials
The required data to analyze and achieve the purpose of this study are presented in this published article and its Additional file 1.
Hospital Information System
International Organization for Standardization
Carayon P, Hoonakker P. Human factors and usability for health information technology: old and new challenges. Yearb Med Inform. 2019;28(1):071–7.
Farzandipour M, Ahmadi M, Sadoughi F. Adopting confidentiality principles for electronic health records in Iran: a Delphi study. J Med Syst. 2011;35(3):333–43.
Ammenwerth E, Ehlers F, Hirsch B, Gratl G. HIS-Monitor an approach to assess the quality of information processing in hospitals. Int J Med Inform. 2007;76(2–3):216–25.
Farzandipour M, Meidani Z, Riazi H, Jabali MS. Nursing information systems requirements: a milestone for patient outcome and patient safety improvement. Comput Inform Nurs. 2016;34(12):601–12.
Tubaishat A. Perceived usefulness and perceived ease of use of electronic health records among nurses: application of technology acceptance model. Inform Health Soc Care. 2018;43(4):379–89.
Farzandipour M, Riazi H, Jabali MS. Proposing electronic health record usability requirements based on enriched ISO 9241 metric usability model. Acta Inform Med. 2018;26(1):29.
Gholami M, Damanabi S, Hachesu PR, Ghyassi FS. Evaluation of nursing information systems using the HIS-monitor instrument: nurses perspectives. Iran J Med Inform. 2019;8(1):9.
Moghaddasi H, Rabiei R, Asadi F, Ostvan N. Evaluation of nursing information systems: application of usability aspects in the development of systems. Healthc Inform Res. 2017;23(2):101–8.
Chow SK, Chin WY, Lee HY, Leung HC, Tang FH. Nurses’ perceptions and attitudes towards computerisation in a private hospital. J Clin Nurs. 2012;21(11–12):1685–96.
Farzandipour M, Meidani Z, Riazi H, Sadeqi JM. Task-specific usability requirements of electronic medical records systems: lessons learned from a national survey of end-users. Inform Health Soc Care. 2018;43(3):280–99.
Bowman S. Impact of electronic health record systems on information integrity: quality and safety implications. Perspect Health Inf Manag. 2013;10(Fall):1c.
ISO 9241-11. Ergonomic reqirements for Office work with visual display terminals (VDTs)—part 11: guidance on usability. International Organization for Standardization, Geneva, Switzerland, ISO 9241-11:1998;1998.
Nielsen J. Usability 101: introduction to usability. Fremont: Nielsen Norman Group; 2012.
Joshi A, Perin DMP, Amadi C, Trout K. Evaluating the usability of an interactive, bi-lingual, touchscreen-enabled breastfeeding educational programme: application of Nielson’s heuristics. J Innov Health Inform. 2015;22(2):265–74.
Jaspers MW. A comparison of usability methods for testing interactive health technologies: methodological aspects and empirical evidence. Int J Med Inform. 2009;78(5):340–53.
Khajouei R, Farahani F. A combination of two methods for evaluating the usability of a hospital information system. BMC Med Inform Decis Mak. 2020;20:84.
Bligård L-O, Osvalder A-L. Enhanced cognitive walkthrough: development of the cognitive walkthrough method to better predict, identify, and present usability problems. Adv Hum Comput Interact. 2013. https://doi.org/10.1155/2013/931698.
Qiu S, Du L, Han T, Hu J. Flavor explore: rapid prototyping and evaluation of user interfaces. In: International conference on distributed, ambient, and pervasive interactions. Las Vegas: Springer; 2018. p. 114–123.
Nielsen J. Usability engineering. San Diego: Morgan Kaufman, Academic Press; 1993.
Nielsen J. Heuristic evaluation in usability inspection methods (RL Mack). New York: Wiley; 1994.
Kushniruk AW, Patel VL. Cognitive and usability engineering methods for the evaluation of clinical information systems. J Biomed Inform. 2004;37(1):56–76.
Polson PG, Lewis C, Rieman J, Wharton C. Cognitive walkthroughs: a method for theory-based evaluation of user interfaces. Int J Man Mach Stud. 1992;36(5):741–73.
Mendes D. The reliability of usability evaluations. Waltham: Bentley University; 2011.
Cuomo DL, Bowen CD. Understanding usability issues addressed by three user-system interface evaluation techniques. Interact Comput. 1994;6(1):86–108.
Frøkjær E, Lárusdóttir MK. Prediction of usability: comparing method combinations. In: 10'th International conference of the information ressources management association. Hershey, PA, USA; 1999. p. 284–257.
Jeffries R, Miller JR, Wharton C, Uyeda K. User interface evaluation in the real world: a comparison of four techniques. In: Proceedings of the SIGCHI conference on Human factors in computing systems: reaching through technology. New York: ACM Press; 1991. p. 119–124.
Khajouei R, Hasman A, Jaspers MW. Determination of the effectiveness of two methods for usability evaluation using a CPOE medication ordering system. Int J Med Inform. 2011;80(5):341–50.
Khajouei R, ZahiriEsfahani M, Jahani Y. Comparison of heuristic and cognitive walkthrough usability evaluation methods for evaluating health information systems. J Am Med Inform Assoc. 2017;24(e1):e55–60.
Maguire M, Isherwood P. A comparison of user testing and heuristic evaluation methods for identifying website usability problems. In: Marcus A, Wang W, editors. Design, user experience, and usability: theoty and practice: 7th international conference (DUXU 2018), Las Vegas, NV, USA, 15–20 July 2018, Part I. p. 429–438.
Yen P-Y, Bakken S. A comparison of usability evaluation methods: heuristic evaluation versus end-user think-aloud protocol—an example from a web-based communication tool for nurse scheduling. In: AMIA annual symposium proceedings, American Medical Informatics Association; 2009. p. 714–718.
Nielsen J, Molich R. Heuristic evaluation of user interfaces. In: Proceedings of the SIGCHI conference on human factors in computing systems. Empowering People, Seattle, Washington, DC, USA; 1990. p. 249–256.
Ahmed SZ. A comparison of usability techniques for evaluating information retrieval system interfaces. Perform Meas Metrics. 2008;9(1):48–58.
Brewster S, Dunlop M. Mobile human-computer interaction-mobile HCI 2004: 6th international symposium, Glasgow, UK; 2004. p. 3160.
Khajouei R, Gohari SH, Mirzaee M. Comparison of two heuristic evaluation methods for evaluating the usability of health information systems. J Biomed Inform. 2018;80:37–42.
Zaini NA, Noor SF, Wook TS. Evaluation of api interface design by applying cognitive walkthrough. Int J Adv Comput Sci Appl. 2019;10(2):306–15.
Farzandipour M, Nabovati E, Tadayon H, Sadeqi JM. Usability evaluation of a nursing information system by applying cognitive walkthrough method. Int J Med Inform. 2021;152:104459.
Nielsen J. Severity ratings for usability problems. Papers Essays. 1995;54:1–2.
Okhovati M, Karami F, Khajouei R. Exploring the usability of the central library websites of medical sciences universities. J Librariansh Inform Sci. 2017;49(3):246–55.
Pierotti D. Heuristic evaluation-a system checklist. Xerox Corporation, Society for Technical Communication; 1995. p. 12.
Rezaei-Hachesu P, Pesianian E, Mohammadian M. Evaluating usability of radiology information systems in hospitals of tabriz university of medical sciences. Acta Inform Med. 2016;24(1):42.
Ahmadian L, Salehi F, Abedinzadeh A, Khatibi F. Usability evaluation of a radiology information system. J Health Adm. 2017;20(69):67–78.
Desurvire H, Kondziela J, Atwood ME. What is gained and lost when using methods other than empirical testing. In: Posters and short talks of the 1992 SIGCHI conference on Human factors in computing systems. New York: ACM; 1992. p. 125–126.
Atashi A, Khajouei R, Azizi A, Dadashi A. User Interface problems of a nationwide inpatient information system: a heuristic evaluation. Appl Clin Inform. 2016;7(01):89–100.
Lilholt PH, Jensen MH, Hejlesen OK. Heuristic evaluation of a telehealth system from the Danish TeleCare North Trial. Int J Med Inform. 2015;84(5):319–26.
Nielsen J. Finding usability problems through heuristic evaluation. In: Proceedings of the SIGCHI conference on human factors in computing systems. Onterey: ACM; 1992. p. 373–380.
Ebnehoseini Z, Tara M, Meraji M, Deldar K, Khoshronezhad F, Khoshronezhad S. Usability evaluation of an admission, discharge, and transfer information system: a heuristic evaluation. Open Access Maced J Med Sci. 2018;6(11):1941.
Nabovati E, Vakili-Arki H, Eslami S, Khajouei R. Usability evaluation of Laboratory and Radiology Information Systems integrated into a hospital information system. J Med Syst. 2014;38(4):35.
SadeqiJabali M, Nabovati E, Farzandipour M, Farrahi SR, Abbasi R. Evaluation of the usability of admission and medical record information system: a heuristic evaluation. J Health Biomed Inform. 2019;6(2):80–9.
Lewis C, Wharton C. Cognitive walkthroughs. In: Handbook of human-computer interaction. Second Completely Revised Edition, Amsterdam: Elsevier; 1997. p. 717–732.
McDaniel AM, Hutchison S, Casper GR, Ford RT, Stratton R, Rembusch M. Usability testing and outcomes of an interactive computer program to promote smoking cessation in low income women. In: Proceedings of the AMIA symposium. American Medical Informatics Association; 2002. p. 509.
Basyir M, Mellyssa W. Evaluating the quality of emergency reporting mobile application on usage service decision. In: IOP conference series: materials science and engineering. IOP Publishing; 2019. p. 012147.
Paz F, Paz FA, Villanueva D, Pow-Sang JA. Heuristic evaluation as a complement to usability testing: a case study in web domain. In: 12th international conference on information technology-new generations. IEEE; 2015. p. 546–551.
Cho S, Mathiassen L, Nilsson A. Contextual dynamics during health information systems implementation: an event-based actor-network approach. Eur J Inf Syst. 2008;17(6):614–30.
Andersson A, Vimarlund V, Timpka T. Management demands on information and communication technology in process-oriented health-care organizations: the importance of understanding managers’ expectations during early phases of systems design. J Manag Med. 2002;16(2–3):159–69.
Wakefield DS, Halbesleben JR, Ward MM, Qiu Q, Brokel J, Crandall D. Development of a measure of clinical information systems expectations and experiences. Med Care. 2007;45(9):884–90.
Farzandipour M, Meidani Z, Nabovati E, SadeqiJabali M, Dehghan BR. Technical requirements framework of hospital information systems: design and evaluation. BMC Med Inform Decis Mak. 2020;20(1):1.
Kalankesh LR, Nasiry Z, Fein RA, Damanabi S. Factors influencing user satisfaction with information systems: a systematic review. Galen Med J. 2020;9:e1686.
Aggelidis VP, Chatzoglou PD. Hospital information systems: measuring end user computing satisfaction (EUCS). J Biomed Inform. 2012;45(3):566–79.
Farrahi R, Jeddi FR, Nabovati E, SadeqiJabali M, Khajouei R. The relationship between user interface problems of an admission, discharge and transfer module and usability features: a usability testing method. BMC Med Inform Decis Mak. 2019;19(1):172.
Bryan C, Boren S. The use and effectiveness of electronic clinical decision support tools in the ambulatory/primary care setting: a systematic review of the literature. Inform Prim Care. 2008;16(2):79–91.
Hardenbol AX, Knols B, Louws M, Meulendijk M, Askari M. Usability aspects of medication-related decision support systems in the outpatient setting: a systematic literature review. Health Inform J. 2020;26(1):72–87.
Abbasi F, Khajouei R, Mirzaee M. The efficiency and effectiveness of surgery information systems in Iran. BMC Med Inform Decis Mak. 2020;20(1):1–8.
Johnson C, Johnston D, Crowle P. EHR usability toolkit: a background report on usability and electronic health records. Rockville: Agency for Healthcare Research and Quality; 2011.
Meidani Z, Farzandipour M, Farrokhian A, Haghighat M. A review on laboratory tests’ utilization: a trigger for cutting costs and quality improvement in health care settings. Med J Islam Repub Iran. 2016;30:365.
Ross J, Stevenson F, Lau R, Murray E. Factors that influence the implementation of e-health: a systematic review of systematic reviews (an update). Implement Sci. 2016;11(1):1–2.
Anwar F, Shamim A. Barriers in adoption of health information technology in developing societies. Int J Adv Comput Sci Appl. 2011;2(8):40–5.
Leung RC. Health information technology and dynamic capabilities. Health Care Manag Rev. 2012;37(1):43–53.
Berge M, Fossum M, Fruhling A. A cognitive walkthrough and focus group study of nursing personnel to improve EHRs used in nursing homes. In: Scandinavian conference on health informatics. Linköping: Linköping University Electronic Press; 2012. p. 13–18.
The authors would like to express their gratitude to the esteemed Vice-Chancellor of Kashan University of Medical Sciences and the management of the Health Information Technology Laboratory at Kashan University of Medical Sciences and the evaluators who participated in usability evaluation. Moreover, we would like to sincerely thank Mr. Hamidreza Tadayon for his assistance during this research.
The present research was supported by the Research Council of Kashan University of Medical Sciences [Grant No. 97031] and did not receive any grants from non-profit organizations and funding agencies in the public and commercial sectors.
Ethics approval and consent to participate
The present study was approved by the Research Ethics Committee of Kashan University of Medical Sciences Research Council (Number: 1397.18) and conducted following the guidelines of the Declaration of Helsinki. In accordance with the opinion of the above-mentioned Ethics Committee and given the fact that no information about participants is provided in this paper, evaluators who participated in usability evaluations had information about the study and they gave verbal consent to participate in it as informed consent. Moreover, by calling and giving information about the study and obtaining permission from the company supplying the Shafa HIS (Tirajeh Rayaneh Co) as informed consent, the usability evaluations were performed on the nursing module of this system and a list of recognized usability problems was provided to the Shafa HIS providers that can assist them in improving their software usability.
Consent for publication
The authors declare that they have no competing interests.
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
About this article
Cite this article
Farzandipour, M., Nabovati, E. & Sadeqi Jabali, M. Comparison of usability evaluation methods for a health information system: heuristic evaluation versus cognitive walkthrough method. BMC Med Inform Decis Mak 22, 157 (2022). https://doi.org/10.1186/s12911-022-01905-7
- Hospital information system
- Heuristic evaluation
- Cognitive walkthrough
- User-computer interface