This article has Open Peer Review reports available.
Health numeracy in Japan: measures of basic numeracy account for framing bias in a highly numerate population
- Masako Okamoto†1Email author,
- Yasushi Kyutoku†2,
- Manabu Sawada3,
- Lester Clowney2,
- Eiju Watanabe2,
- Ippeita Dan2 and
- Keiko Kawamoto1
© Okamoto et al.; licensee BioMed Central Ltd. 2012
Received: 31 December 2011
Accepted: 31 August 2012
Published: 11 September 2012
Health numeracy is an important factor in how well people make decisions based on medical risk information. However, in many countries, including Japan, numeracy studies have been limited.
To fill this gap, we evaluated health numeracy levels in a sample of Japanese adults by translating two well-known scales that objectively measure basic understanding of math and probability: the 3-item numeracy scale developed by Schwartz and colleagues (the Schwartz scale) and its expanded version, the 11-item numeracy scale developed by Lipkus and colleagues (the Lipkus scale).
Participants’ performances (n = 300) on the scales were much higher than in original studies conducted in the United States (80% average item-wise correct response rate for Schwartz-J, and 87% for Lipkus-J). This high performance resulted in a ceiling effect on the distributions of both scores, which made it difficult to apply parametric statistical analysis, and limited the interpretation of statistical results. Nevertheless, the data provided some evidence for the reliability and validity of these scales: The reliability of the Japanese versions (Schwartz-J and Lipkus-J) was comparable to the original in terms of their internal consistency (Cronbach’s α = 0.53 for Schwartz-J and 0.72 for Lipkus-J). Convergent validity was suggested by positive correlations with an existing Japanese health literacy measure (the Test for Ability to Interpret Medical Information developed by Takahashi and colleagues) that contains some items relevant to numeracy. Furthermore, as shown in the previous studies, health numeracy was still associated with framing bias with individuals whose Lipkus-J performance was below the median being significantly influenced by how probability was framed when they rated surgical risks. A significant association was also found using Schwartz-J, which consisted of only three items.
Despite relatively high levels of health numeracy according to these scales, numeracy measures are still important determinants underlying susceptibility to framing bias. This suggests that it is important in Japan to identify individuals with low numeracy skills so that risk information can be presented in a way that enables them to correctly understand it. Further investigation is required on effective numeracy measures for such an intervention in Japan.
Active involvement of patients in decision making for their own medical care, such as deciding whether or not to accept a particular treatment (informed consent), or choosing among medical-care options (informed choice), is becoming a worldwide practice . In Japan, informed consent was codified in the “Medical Service Act” in 1997, and is now common practice . Recent surveys show that moving beyond informed consent, Japanese patients are involved in informed decision making in clinical practice  and that they prefer this involvement . Consequently, it is very important to ensure that patients correctly understand medical information, so that their decisions, possibly made in life-threatening situations, reflect their true will.
One way to address this issue would be to assess patients’ health numeracy, the ability to understand probabilistic and mathematical concepts . Health numeracy has gained increasing attention, as the amount of quantitative information, such as the probability of survival outcomes for different medical treatments, is increasingly present in medical risk information [6, 7]. Previous studies assessing health numeracy have shown that individuals with low numeracy are more likely to misunderstand risk information, and their risk evaluations tend to be more influenced by context, such as how the related numbers are framed (reviewed in [5, 8–11]). While many of those studies used healthy respondents, studies with actual patients have shown the influence of numeracy on their disease-related decision making (e.g. [12–15]). Therefore, to ensure accurate medical-risk communication, it is important to know whether patients have a sufficient level of health numeracy.
Despite its importance, however, health numeracy and assessment scales have been largely understudied in many countries, including Japan. In fact, previous research is mostly from the United States and Europe, with research in other countries just beginning (e.g. [16–18]). These pioneering studies on cross-cultural comparisons have shown considerable differences in numeracy levels [16, 19], as well as the association of numeracy with decision making [20, 21] across different countries. Therefore, it is important to determine how previous findings apply to different countries, and develop strategies suitable for each case.
In Japan, the limited attention paid to health numeracy might be a reflection of the common belief that the majority of Japanese people enjoy basic numeracy. For example, since its inception in 2000, the Programme for International Student Assessment has rated the mathematical ability of Japanese students (aged 15 to 16 years) as higher than the international average, whereas that in the United States has been below average. While this might imply that Japanese patients are able to correctly use numerical medical data, recent pioneering work in Japan suggests this might not be the case . In their study, Takahashi et al. developed a 7-item scale to test the ability of Japanese patients to interpret medical information (TAIMI), with 3 of these items especially relevant to numeracy. Unexpectedly, more than 50 percent of respondents made mistakes in 2 of these items. Those 2 items evaluated the effect of a medicine, one presenting information as a fraction and the other as a natural frequency. This result suggests that Japanese health numeracy may not be as high as expected, and that there is a need for further investigation specifically focusing on health numeracy.
The aim of this study is twofold: 1) to evaluate Japanese versions of numeracy scales, and 2) to assess the health numeracy of Japanese adults.
We chose two well-known health numeracy scales that focus on the basic understanding of math and probability, the 3-item Schwartz scale , and its expanded version, the 11-item Lipkus scale . To date, these scales are among the most frequently used instruments in numeracy studies. Since the focus of these tests is different from that of TAIMI, we were interested in how Japanese people performed in these health numeracy scales. Japanese versions of these scales (Schwartz-J and Lipkus-J) were prepared using forward and backward translation procedures .
The reliability of the scales was assessed for internal consistency. The original scales were shown to be unidimensional; the original Schwartz study did not conduct a factor analysis, but Lipkus and colleagues evaluated the factor structure of their scale which included the Schwartz items . However, there are also studies showing multi-factor structure on these scales . Since factor structure could be different depending on the nature of a target population , an exploratory factor analysis was conducted to explore factor structure for the Japanese version. Convergent validity of numeracy scales was evaluated by their correlation with existing measures of health numeracy . As mentioned above, TAIMI has a two-factor structure where three items are specifically relevant to numeracy . Thus, we examined correlations between TAIMI scores and the Schwartz-J and Lipkus-J scores. We expected a positive correlation between performance on TAIMI and the scales translated in this study, specifically for the numeracy items of TAIMI (TAIMI-num).
To determine whether numeracy levels measured by these scales have any influence on medical risk communication in Japan, we examined the association between framing bias and performance on numeracy scales. Previous studies found that those with low scores on the Lipkus scale are more susceptible to framing effects [12, 19, 27, 28], an effect whereby different phrasing influences participant decisions based on mathematically identical data. We tested whether this can be seen with Japanese samples using the Schwartz-J and Lipkus-J scales. To approximate the Japanese population, we used quota sampling (n = 300) according to age, gender, and education level. In so doing, this study explores a method for assessing health numeracy in the Japanese population, with the aim of improving medical-risk communication.
Characteristics of the study sample (n = 300)
n = 300
Age (range, 20–69: % for each decade)
Gender (% women)
Education (% low attainment)
Household income (million yenb, %)
3 ≤ 5
3 ≤ 8
Hospital use (last year, % occasional or often)
all (possible range 0–7, mean ± SD)
4.7 ± 1.7
num (possible range 0–3, median ± IQR)
2.0 ± 2.0
Questionnaire items, correct response rate, and factor loading for each item for Schwartz-J and LipkusJ
1 Imagine that we rolled a fair, six-sided die 1,000 times. Out of 1,000 rolls,how many times do you think the die would come up even (2, 4, or 6)?
2 *In a lottery, the chance of winning a 1,000 yen prize is 1%.What is your best guess about how many people would win a 1,000 yen prize if 1,000 people each buy a single ticket to this lottery?
3 *In a lottery, the chance of winning a car is 1 in 1,000. What percent of tickets to this lottery win a car?
4 Which of the following numbers represents the biggest risk of getting a disease? 1 in 100, 1 in 1000, 1 in 10
5 Which of the following numbers represents the biggest risk of getting a disease? 1%, 10%, 5%
6 If Person A’s risk of getting a disease is 1% in ten years, and person B’s risk is double that of A’s, what is B’s risk?
7 If Person A’s chance of getting a disease is 1 in 100 in ten years, and person B’s risk is double that of A’s, what is B’s risk?
8 If the chance of getting a disease is 10%, how many people would be expected toget the disease:A: Out of 100?
9 If the chance of getting a disease is 10%, how many people would be expected toget the disease:B: Out of 1000?
10 If the chance of getting a disease is 20 out of 100, this would be the same as having a ____% chance of getting the disease.
11 The chance of getting a viral infection is .0005. Out of 10,000 people, about how many of them are expected to get infected?
Convergent validity of the scales was indicated by significant positive correlations between the scores of each scale and TAIMI. As expected, the associations were stronger with TAIMI-num, the numeracy component of the TAIMI scale (Spearman’s ρ, with Schwartz-J, 0.42; Lipkus-J9, 0.46), than with TAIMI-all (Spearman’s ρ, with Schwartz-J, 0.29; Lipkus-J9, 0.33).
Performance on the numeracy scales
Respondents spent 4.9 min ± 0.3 (mean ± SD) completing the Lipkus-Jall, which included Schwartz-J. On average, the item-wise correct response rate was nearly 90 percent, which is much higher than that of the American sample reported in the original study  (Table 2). This high level of performance indicates a high level of basic numeracy among the Japanese public. In particular, in the comparison of risk section that was presented in the same format (items 4 and 5), more than 95% of responses were correct. However, there were two other items where nearly 30 percent of participants made mistakes. These were conversion of fractions to percentages (item 3), and conversion of percentages to frequencies (item 11), both of which included decimal numbers. This suggests that a significant proportion of Japanese people become confused when dealing with such data.
Total score for each numeracy measure
Mean ± SD
2.4 ± 0.8
8.0 ± 1.5
9.6 ± 1.8
Median ± IQR
3.0 ± 1.0
9.0 ± 1.0
10.0 ± 2.0
Numeracy and framing bias
To examine whether numeracy is an important consideration for medical risk communication in a Japanese sample, we examined the degree of framing bias for two respondent groups - those scoring low, and those scoring high in the Schwartz/Lipkus scales. A median split of the measures (grouping based on Schwartz-J, cut-off score ≤ 2; grouping based on Lipkus-J9, cut-off score ≤ 9) was made because, as in previous studies, the distribution of scores for both of these scales was skewed [12, 28].
Difference in framing bias between low and high performance groups for each numeracy measure
Numeracy score (Median ± IQR)
2 ± 1
7 ± 2
Difference in rated risk (mean ± SD)b
−0.22 ± 0.85
−0.02 ± 0.81
−0.22 ± 0.89
−0.00 ± 0.78
Because numeracy scores were influenced by education and gender, we examined whether these factors also influenced risk perception bias. We did not find significant effects for education (p > 0.5) or gender (p > 0.2) on the framing effect. This suggests that numeracy is a more important determinant of risk perception bias than the demographic characteristics that were examined in the current study.
In this study, we evaluated Japanese numeracy by translating and applying the Schwartz and Lipkus scales, the widely used health numeracy scales that focus on the understanding of basic math and probability. Translated versions of both scales showed certain reliability and validity, however, the Japanese sample’s high performance caused the score distributions to be negatively skewed, imposing limitations on the psychometric evaluations of the scales. In this section, we first discuss Japanese numeracy in light of our results. Then we address the validity and limits of Lipkus-J and Schwartz-J, and future directions for the application of health numeracy measures.
The current study suggests that basic understanding of math and probability is quite high among Japanese: correct response rates for Lipkus-J items were much higher than those found in the original US samples , and in more recent studies on probabilistic national German and US samples . This is consistent with the results of the Programme for International Student Assessment (PISA), where the national average math score for Japanese students has been surpassing those of both the US and Germany since its inception [34–37]. The relatively high attainment of math skills during school education, as assessed by PISA, might partly account for the generally high numeracy of the Japanese. The current result is also in line with the recent study assessing the numeracy skills of students at top universities in 15 counties . Although linking top-university level performance with that of the general population is not straightforward, Japan was second best in having the smallest proportion of respondents falling into the lowest quartile.
However, in spite of generally high numeracy, the performance of Japanese sample on the Schwartz-J and Lipkus-J tests still accounted for susceptibility to the framing effect, which can influence patients’ decisions regarding their medical options, such as acceptance of surgery (e.g. [12, 19, 38]; however, empirical results on framing effects in a clinical setting are mixed, reviewed in ). A number of previous studies using the original Schwartz and Lipkus scales have shown a numeracy effect on understanding and decision making based on medical information (reviewed in [5, 8–11]). Moreover, studies have been advancing for communicating quantitative risk information with consideration of patients’ numeracy, such as supplementing numerical data with visual or verbal aides, using natural frequencies rather than probabilities, or presenting risks with both negative and positive frames (reviewed in [10, 11, 40–43]). Considering our results and these earlier findings, such care would be called for when communicating medical information to those with low numeracy in Japan, and possibly in other countries where general math performance is deemed to be high.
Regarding instruments to identify those with low numeracy, both the Schwartz-J and Lipkus-J scales demonstrated certain reliability and validity, with Cronbach’s α being comparable with those of original scales, convergent validity being supported by their positive correlation with other health literacy and numeracy measure (TAIMI, ), criterion validity being suggested by their association with the susceptibility to framing bias, and content validity being ensured in the original scales. However, we also found a pronounced ceiling effect, which confounded the analysis we have applied, and limited the psychometric qualities of the scales.
Ceiling effects pose multiple psychometric limitations . First, they suggest that scales are less able to differentiate among those with high numeracy. Second, statistical methods applicable for data analysis become limited, as many popular methods assume a normal distribution, and possibly giving in erroneous results when this assumption is violated [44–46]. Non-parametric alternatives are not always sufficient. For example, in the current study, we had to use a median split, making it difficult to examine the relationship between numeracy scores and framing effects in depth.
A third limitation is that the means to evaluate the validity of the scale. For example, respondents’ performances can be confounded with other factors such as motivation . However, ensuring discriminant validly is not straightforward with data having a ceiling effect, as, for example, a weak correlation between motivation and numeracy scores might be due to the ceiling effect, rather than the variables being truly unrelated. Similarly, examining the relationships between measured ability with other closely related abilities such as working memory  would be confounded by the ceiling effect. Thus, use of Lipkus-J and Schwartz-J with high numeracy sample requires careful consideration of those limitations.
In fact, negative skew for the original Schwartz and Lipkus scales have been noted in a number of earlier studies [28–33], and the limitations mentioned above have been pointed out [16, 27]. In response to those concerns, new numeracy scales have recently been developed: the Berlin Numeracy Test (BNT, ) and the Abbreviated Numeracy Scale (ANS, ). While both scales were built on the works of Schwartz and Lipkus, they have a wider range of difficulty. As a result, they have better psychometric characteristics, especially when used with high-performance samples. Considering the generally high numeracy of Japanese, those new scales might be more suitable for assessing numeracy in Japan, and this should be explored.
Meanwhile, Lipkus-J and Schwartz-J could be useful for assessing those having low numeracy. In the above-mentioned studies that developed new numeracy scales, the effectiveness of the original Lipkus and Schwartz scales is indicated for assessing groups with low numeracy [16, 27]. In fact, positive skew was observed in some of the samples studied using BNT , where easier tests would work better. This is an important point to consider when clinical applications are in scope, because some patients are likely to be under physical and psychological stress, which might result in lower numeracy. For instance, a recent clinical study using the Lipkus scale found the numeracy of epilepsy patients to be significantly lower than healthy controls even though educational attainments were lower in the control group . This issue also bears on the test’s validity; where the psychometric characteristics of scales could differ across population groups or settings . Considering possible difference between patients and healthy groups, the use of the numeracy scales translated here, as well as the above-mentioned new scales, should be explored using patient samples so that more effective numeracy measures for the patient population can be discovered.
Finally, the possible influence of volunteer bias  should be noted when interpreting the current results. Although demographics of the sample matched those of the Japanese adult population, the test respondents were those who voluntarily agreed to participate in a survey concerning numbers. Therefore, the results could be biased towards those who are more interested in solving numerical problems, and not actually representative of the population. In fact, the average total score of TAIMI in the current study was 4.7, which is higher than that of 3.9 in the original report (Internet survey, n = 6047, ). This disparity might be due to differences in sample composition between Takahashi et al.’s work and ours (there were more females and elderly in their study, and no education levels were reported). However, it is also possible that the numeracy reported here is higher than average. This issue should be addressed in future random-selection population-based surveys.
The current study highlights the importance of considering health numeracy in Japan. As assessed by the Japanese versions of internationally used health numeracy scales, the basic understanding of math and probability by Japanese people was shown to be high, but still not sufficient for many to avoid framing bias. Thus, to improve numerical medical risk communication in Japan, it would be necessary to assess health numeracy, screening those with low numeracy to provide them with appropriate care. Although efforts have been made [16, 22], numeracy has not gained much attention in Japan. By evaluating the health numeracy of Japanese, and its measurement instruments, our study is a step towards improving medical risk communication.
Translation of the Schwartz and Lipkus scales
The Japanese version of the Lipkus scale (Lipkus-J), which includes the Schwartz scale (Schwartz-J), was prepared using forward and backward translation . Each translation process was conducted by independent professional translators. Two raters (a bilingual Japanese-English individual and a native English speaker) evaluated the concordance between the back-translated items and the originals. Forward-backward translation was repeated until both raters rated all the back-translated items as semantically concordant with the original. After the translation, some expressions (for example, a lottery prize in dollars) were changed to suit the Japanese context. Finally, the understandability of the resultant wording was checked by students, office workers, and researchers recruited at Jichi Medical University, and Obihiro University of Agriculture & Veterinary Medicine (n = 22), and minor changes in wording were made.
A set of questions in which subjects were asked to rate the risk level of a surgical procedure when risk information was presented in two different frames (survival rate, “991 in 1000 people survive this surgery”, and death rate, “9 in 1000 people die from this surgery”), were adopted from the Medical Data Interpretation test . Framing was manipulated within subjects, separating the two differently framed questions with 12 irrelevant ones. A four-point scale was used to rate risk level (1 = not risky, 2 = slightly risky, 3 = risky, 4 = very risky). The framing effect was evaluated by examining the difference between risk rating scores obtained for the two frames.
As mentioned in the Introduction, TAIMI  contains health numeracy items for Japanese adults. To examine how performance on TAIMI relates to that on the numeracy scales used in the current study, TAIMI was included in the survey.
Survey and participants
An online survey company (Cross Marketing, Tokyo, Japan) was contracted to collect responses (n = 300), and recruitment e-mails were sent to a participant pool maintained by a different online survey company (Research Panel, Tokyo, Japan, n > 1.4 million). Participants voluntarily agreed to complete the online survey. We created 20 blocks of subjects, each defined by gender, age group (20-29, 30-39, 40-49, 50-59, 60-69 years old), and education level (low attainment [high school or lower], high attainment [high school or higher]). The quota was set so that the sample composition roughly matched the Japanese adult population (Additional file 1: Table S1). Participants were recruited until the quota was filled. Students and medical professionals were excluded.
The survey included the Lipkus-J scale (which incorporates the Schwartz-J), TAIMI, measures for framing bias, and some other measures of health and risk attitudes (not reported here). The web page was designed in such a way that respondents could not proceed to the next question without completing the current one, so there were no non-response items in the survey. The survey was conducted in March 2011.
Item responses for Schwartz-J, Lipkus-J, and TAIMI were first dichotomized to be either correct or incorrect, and the percentage of individuals with the correct response was determined for each item. As in original scales, the total score was calculated as the number of correct items for each respondent.
An exploratory factor analysis with binary variables was conducted using Mplus version 6.12 . The method used employs tetrachoric correlation with weighted least squares means and variance adjusted (WLSMV) estimation method. This method accommodates dichotomous observation, and has been indicated to be robust to ceiling effects [53–56].
Numbers of factors of the new scales were determined by parallel analysis. In the analysis, random datasets for the same number of items and participants as actual observation were generated. Eigen values were extracted for each random dataset, and actual observation. Only those factors with eigen values greater than the average eigen values obtained from random datasets were deemed to be meaningful . Subsequently, exploratory factor analysis with number of factors determined by parallel analyses was performed to examine the factor loadings. Criteria for factor loadings were set to be .35 and above . The consistencies of scales were evaluated according to classical test theory, including Cronbach’s alpha , item-total correlation, and descriptive statistics of items.
Convergent validity of numeracy scales has been evaluated through correlation with existing measures for health numeracy . We examined correlations between TAIMI scores and Schwartz-J, and Lipkus-J scores. TAIMI has a two-factor structure where three items are especially relevant to numeracy . Therefore, we expected a positive correlation between performance on TAIMI and the scales translated in this study, especially for the numeracy items of TAIMI (TAIMI-num).
Because the assumption of a normal distribution was not satisfied for test statistics, we used non-parametric tests for examining the effects of demographic characteristics on test performance, and of numeracy levels on framing bias. The Wilcoxon signed rank test was used for pair-wise comparison between two conditions. The Mann-Whitney test was used to compare between two groups, and the Kruskal–Wallis test followed by a pair-wise Mann-Whitney test with Bonferroni correction was used to compare between three or more groups. Significance levels were set at p < 0.05. The program, IBM SPSS Statistics 19 was used for most statistical analysis, with M-plus version 6.12  was used for factor analysis.
The institutional ethics committee of the National Food Research Institute granted approval for the study, and permission was also obtained from management section of the Obihiro University of Agriculture & Veterinary Medicine. The methodology used in this study followed the principles of the Helsinki Declaration. Collection of on-line data complied with requirements specified in Japanese Industrial Standards “Personal information protection management systems - Requirements” (JIS Q 15001). Written (electrical) consent was obtained from all the participants.
We thank Drs. Y. Takahashi and T. Shinbo, the developers of TAIMI for their help in using TAIMI. We also thank Profs. H. Kanuka and S. Kawazu for their support in conducting the research, Mr. Kitazawa, Messrs. Sugimoto, Noguchi and Yamauchi for their assistance in data preparation, and ELCS for proofreading the manuscript. This work was supported in part by a grant from the Global COE Program from Japanese Ministry of Education, Science, Sports, Culture and Technology (MEXT), Programme for Promotion of Basic and Applied Researches for Innovations in Bio-oriented Industry (MO), Grant-in-Aid for Young Scientists (B) 20700779 (MO) and 23700921 (YK) from MEXT, Grant-in-Aid (B) 23300247 from MEXT, and grants from the Japan Science and Technology Agency, under the Strategic Promotion of Innovative Research and Development Program, and Comprehensive Research on Disability, Health and Welfare from Health and Labour Sciences Research Grants (ID). None of the funding bodies had any role in the study design, collection, analysis and interpretation of the data, writing of the paper, or in the decision to submit the manuscript for publication.
- Hellenthal N, Ellison L: How patients make treatment choices. Nat Clin Pract Urol. 2008, 5: 426-433.View ArticlePubMedGoogle Scholar
- Japanese Cabinet Office: Surveys for the Measures for the Aging Society.http://www8.cao.go.jp/kourei/ishiki/h19/kenko/zentai/index.html,
- Partridge JC, Martinez AM, Nishida H, Boo NY, Tan KW, Yeung CY, Lu JH, Yu VY: International comparison of care for very low birth weight infants: parents' perceptions of counseling and decision-making. Pediatrics. 2005, 116: e263-e271. 10.1542/peds.2004-2274.View ArticlePubMedGoogle Scholar
- Alden DL, Merz MY, Akashi J: Young adult preferences for physician decision-making style in Japan and the United States. Asia Pac J Public Health. 2012, 24: 173-184. 10.1177/1010539510365098.View ArticlePubMedGoogle Scholar
- Peters E: Beyond Comprehension: The role of numeracy in judgments and decisions. Curr Dir Psychol Sci. 2012, 21: 31-35. 10.1177/0963721411429960.View ArticleGoogle Scholar
- Nelson W, Reyna VF, Fagerlin A, Lipkus I, Peters E: Clinical implications of numeracy: theory and practice. Ann Behav Med. 2008, 35: 261-274. 10.1007/s12160-008-9037-8.View ArticlePubMedPubMed CentralGoogle Scholar
- Gaissmaier W, Gigerenzer G: Statistical illiteracy undermines informed shared decision making. Z Evid Fortbild Qual Gesundhwes. 2008, 102: 411-413. 10.1016/j.zefq.2008.08.013.View ArticlePubMedGoogle Scholar
- Reyna VF, Nelson WL, Han PK, Dieckmann NF: How numeracy influences risk comprehension and medical decision making. Psychol Bull. 2009, 135: 943-973.View ArticlePubMedPubMed CentralGoogle Scholar
- Lipkus IM, Peters E: Understanding the role of numeracy in health: proposed theoretical framework and practical insights. Health Educ Behav. 2009, 36: 1065-1081. 10.1177/1090198109341533.View ArticlePubMedPubMed CentralGoogle Scholar
- Fagerlin A, Ubel PA, Smith DM, Zikmund-Fisher BJ: Making numbers matter: present and future research in risk communication. Am J Health Behav. 2007, 31: S47-S56. 10.5993/AJHB.31.s1.7.View ArticlePubMedGoogle Scholar
- Garcia-Retamero R, Okan Y, Cokely ET: Using visual aids to improve communication of risks about health: a review. ScientificWorldJournal. in pressGoogle Scholar
- Choi H, Wong JB, Mendiratta A, Heiman GA, Hamberger MJ: Numeracy and framing bias in epilepsy. Epilepsy Behav. 2011, 20: 29-33. 10.1016/j.yebeh.2010.10.005.View ArticlePubMedGoogle Scholar
- Lipkus IM, Peters E, Kimmick G, Liotcheva V, Marcom P: Breast cancer patients' treatment expectations after exposure to the decision aid program adjuvant online: the influence of numeracy. Med Decis Making. 2010, 30: 464-473. 10.1177/0272989X09360371.View ArticlePubMedPubMed CentralGoogle Scholar
- Gardner PH, McMillan B, Raynor DK, Woolf E, Knapp P: The effect of numeracy on the comprehension of information about medicines in users of a patient information website. Patient Educ Couns. 2011, 83: 398-403. 10.1016/j.pec.2011.05.006.View ArticlePubMedGoogle Scholar
- Estrada CA, Martin-Hryniewicz M, Peek BT, Collins C, Byrd JC: Literacy and numeracy skills and anticoagulation control. Am J Med Sci. 2004, 328: 88-93. 10.1097/00000441-200408000-00004.View ArticlePubMedGoogle Scholar
- Cokely ET, Galesic M, Schulz E, Ghazal S, Garcia-Retamero R: Measuring risk literacy: the Berlin numeracy test. Judgm Decis Mak. 2012, 7: 25-47.Google Scholar
- Peters E, Baker DP, Dieckmann NF, Leon J, Collins J: Explaining the effect of education on health: a field study in Ghana. Psychol Sci. 2010, 21: 1369-1376. 10.1177/0956797610381506.View ArticlePubMedGoogle Scholar
- Liberali JM, Reyna VF, Furlan S, Stein LM, Pardo ST: Individual differences in numeracy and cognitive reflection, with implications for biases and fallacies in probability judgment. J Behav Decis Mak. 2012, 25: 361-381. 10.1002/bdm.752.View ArticlePubMedGoogle Scholar
- Garcia-Retamero R, Galesic M: How to reduce the effect of framing on messages about health. J Gen Intern Med. 2010, 25: 1323-1329. 10.1007/s11606-010-1484-9.View ArticlePubMedPubMed CentralGoogle Scholar
- Pachur T, Galesic M: Strategy selection in risky choice: The impact of numeracy, affect, and cross-cultural differences. J Behav Decis Mak. in pressGoogle Scholar
- Garcia-Retamero R, Galesic M: Communicating treatment risk reduction to people with low numeracy skills: a cross-cultural comparison. Am J Public Health. 2009, 99: 2196-2202. 10.2105/AJPH.2009.160234.View ArticlePubMedPubMed CentralGoogle Scholar
- Takahashi Y, Sakai M, Fukui T, Shimbo T: Measuring the ability to interpret medical information among the Japanese public and the relationship with inappropriate purchasing attitudes of health-related goods. Asia Pac J Public Health. 2011, 23: 386-398. 10.1177/1010539509344882.View ArticlePubMedGoogle Scholar
- Schwartz LM, Woloshin S, Black WC, Welch HG: The role of numeracy in understanding the benefit of screening mammography. Ann Intern Med. 1997, 127: 966-972.View ArticlePubMedGoogle Scholar
- Lipkus IM, Samsa G, Rimer BK: General performance on a numeracy scale among highly educated samples. Med Decis Making. 2001, 21: 37-44.View ArticlePubMedGoogle Scholar
- Steiner DL, Norman GR: Health measurement scales. A practical guide to their development and use. 2003, Oxford University Press, New York, 3Google Scholar
- Nunnally JC, Bernstein IH: Psychometric theory. 1994, McGraw-Hill, New York, 3Google Scholar
- Weller JA, Dieckmann NF, Tusler M, Mertz CK, Burns WJ, Peters E: Development and testing of an abbreviated numeracy scale: A rasch analysis approach. J Behav Decis Mak. in pressGoogle Scholar
- Peters E, Vastfjall D, Slovic P, Mertz CK, Mazzocco K, Dickert S: Numeracy and decision making. Psychol Sci. 2006, 17: 407-413. 10.1111/j.1467-9280.2006.01720.x.View ArticlePubMedGoogle Scholar
- Peters E, Dieckmann N, Dixon A, Hibbard JH, Mertz CK: Less is more in presenting quality information to consumers. Med Care Res Rev. 2007, 64: 169-190. 10.1177/10775587070640020301.View ArticlePubMedGoogle Scholar
- Peters E, Slovic P, Västfjäll D, Mertz CK: Intuitive numbers guide decisions. Judgm Decis Mak. 2008, 3: 619-635.Google Scholar
- Schapira MM, Walker CM, Sedivy SK: Evaluating existing measures of health numeracy using item response theory. Patient Educ Couns. 2009, 75: 308-314. 10.1016/j.pec.2009.03.035.View ArticlePubMedPubMed CentralGoogle Scholar
- Hanoch Y, Miron-Shatz T, Cole H, Himmelstein M, Federman AD: Choice, numeracy, and physicians-in-training performance: the case of Medicare Part D. Health Psychol. 2010, 29: 454-459.View ArticlePubMedGoogle Scholar
- Galesic M, Garcia-Retamero R: Statistical numeracy for health: a cross-cultural comparison with probabilistic national samples. Arch Intern Med. 2010, 170: 462-468. 10.1001/archinternmed.2009.481.View ArticlePubMedGoogle Scholar
- OECD: PISA. 2000,http://www.oecd.org/pisa/pisaproducts/, Technical Report,Google Scholar
- OECD: PISA. 2003,http://www.oecd.org/pisa/pisaproducts/, Technical Report,Google Scholar
- OECD: PISA. 2006,http://www.oecd.org/pisa/pisaproducts/, Technical Report,Google Scholar
- OECD: PISA. 2009,http://www.oecd.org/pisa/pisaproducts/, Technical Report,View ArticleGoogle Scholar
- Okan Y, Rocio G, Cokely ET, Maldonado A: Individual differences in graph literacy: Overcoming denominator neglect in risk comprehension. J Behav Decis Mak. 2011, 25: 390-401.View ArticleGoogle Scholar
- O'Keefe DJ, Jensen JD: The relative persuasiveness of gain-framed and loss-framed messages for encouraging disease prevention behaviors: a meta-analytic review. J Health Commun. 2007, 12: 623-644. 10.1080/10810730701615198.View ArticlePubMedGoogle Scholar
- Ancker JS, Senathirajah Y, Kukafka R, Starren JB: Design features of graphs in health risk communication: a systematic review. J Am Med Inform Assoc. 2006, 13: 608-618. 10.1197/jamia.M2115.View ArticlePubMedPubMed CentralGoogle Scholar
- Lipkus IM: Numeric, verbal, and visual formats of conveying health risks: suggested best practices and future recommendations. Med Decis Making. 2007, 27: 696-713. 10.1177/0272989X07307271.View ArticlePubMedGoogle Scholar
- Kurz-Milcke E, Gigerenzer G, Martignon L: Transparency in risk communication: graphical and analog tools. Ann N Y Acad Sci. 2008, 1128: 18-28. 10.1196/annals.1399.004.View ArticlePubMedGoogle Scholar
- Hanoch Y, Pachur T: Nurses as information providers: facilitating understanding and communication of statistical information. Nurse Educ Today. 2004, 24: 236-243. 10.1016/j.nedt.2004.01.004.View ArticlePubMedGoogle Scholar
- Uttl B: Measurement of individual differences: lessons from memory assessment in research and clinical practice. Psychol Sci. 2005, 16: 460-467.PubMedGoogle Scholar
- Muthén B: Moments of the censored and truncated bivariate normal distribution. Br J Math Stat Psychol. 1990, 43: 131-143. 10.1111/j.2044-8317.1990.tb00930.x.View ArticleGoogle Scholar
- Sheng Y, Sheng Z: Is coefficient alpha robust to non-normal data?. Front Psychol. 2012, 3: 1-13.View ArticleGoogle Scholar
- Duckworth AL, Quinn PD, Lynam DR, Loeber R, Stouthamer-Loeber M: Role of test motivation in intelligence testing. Proc Natl Acad Sci U S A. 2011, 108: 7716-7720. 10.1073/pnas.1018601108.View ArticlePubMedPubMed CentralGoogle Scholar
- Cokely ET, Kelley CM: Cognitive abilities and superior decision making under risk: A protocol analysis and process model evaluation. Judgm Decis Mak. 2009, 4: 20-33.Google Scholar
- Messick S: Validity of psychological assessment: Validation of inferences from persons' responses and performances as scientific inquiry into score meaning. Am Psychol. 1995, 50: 741-749.View ArticleGoogle Scholar
- Heiman GW: Research methods in psychology. 2002, Houghton Mifflin, Boston & New York, 3Google Scholar
- Schwartz LM, Woloshin S, Welch HG: Can patients interpret health information? An assessment of the medical data interpretation test. Med Decis Making. 2005, 25: 290-300. 10.1177/0272989X05276860.View ArticlePubMedGoogle Scholar
- Mplus user's guide. 1998–2010,http://www.statmodel.com/ugexcerpts.shtml, 6,
- Beauducel A, Herzberg PY: On the performance of maximum likelihood versus means and variance adjusted weighted least squares estimation in CFA. Structural Equation Modeling: A Multidisciplinary Journal. 2006, 13: 186-203. 10.1207/s15328007sem1302_2.View ArticleGoogle Scholar
- Flora DB, Curran PJ: An empirical evaluation of alternative methods of estimation for confirmatory factor analysis with ordinal data. Psychol Methods. 2004, 9: 466-491.View ArticlePubMedPubMed CentralGoogle Scholar
- Muthén B: Dichotomous factor analysis of symptom data. Latent Variable Models for Dichotomous Outcomes: Analysis of Data from the Epidemiological Catchment Area Program. Edited by: Eaton WW, Bohrnstedt GW. 1989, Sage Periodicals Press, Newbury Park, CA, 19-65. Sociological methods & researchGoogle Scholar
- Muthén BO, du Toit SHC, Spisic D: Robust inference using weighted least squares and quadratic estimating equations in latent variable modeling with categorical and continuous outcomes.http://pages.gseis.ucla.edu/faculty/muthen/articles/Article_075.pdf,
- Bernstein IH, Rush AJ, Carmody TJ, Woo A, Trivedi MH: Clinical vs. self-report versions of the quick inventory of depressive symptomatology in a public sector sample. J Psychiatr Res. 2007, 41: 239-246. 10.1016/j.jpsychires.2006.04.001.View ArticlePubMedGoogle Scholar
- Comrey AL, Lee HB: A First Course in Factor Analysis. 1992, Lawrence Erlbaum Associates, Hillsdale, NJ, 2Google Scholar
- Cronbach LJ: Coefficient alpha and the internal structure of tests. Psychometrika. 1951, 16: 297-334. 10.1007/BF02310555.View ArticleGoogle Scholar
- The pre-publication history for this paper can be accessed here:http://www.biomedcentral.com/1472-6947/12/104/prepub
This article is published under license to BioMed Central Ltd. This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.