Utilization of the PICO framework to improve searching PubMed for clinical questions
© Schardt et al. 2007
Received: 15 February 2007
Accepted: 15 June 2007
Published: 15 June 2007
Skip to main content
© Schardt et al. 2007
Received: 15 February 2007
Accepted: 15 June 2007
Published: 15 June 2007
Supporting 21st century health care and the practice of evidence-based medicine (EBM) requires ubiquitous access to clinical information and to knowledge-based resources to answer clinical questions. Many questions go unanswered, however, due to lack of skills in formulating questions, crafting effective search strategies, and accessing databases to identify best levels of evidence.
This randomized trial was designed as a pilot study to measure the relevancy of search results using three different interfaces for the PubMed search system. Two of the search interfaces utilized a specific framework called PICO, which was designed to focus clinical questions and to prompt for publication type or type of question asked. The third interface was the standard PubMed interface readily available on the Web. Study subjects were recruited from interns and residents on an inpatient general medicine rotation at an academic medical center in the US. Thirty-one subjects were randomized to one of the three interfaces, given 3 clinical questions, and asked to search PubMed for a set of relevant articles that would provide an answer for each question. The success of the search results was determined by a precision score, which compared the number of relevant or gold standard articles retrieved in a result set to the total number of articles retrieved in that set.
Participants using the PICO templates (Protocol A or Protocol B) had higher precision scores for each question than the participants who used Protocol C, the standard PubMed Web interface. (Question 1: A = 35%, B = 28%, C = 20%; Question 2: A = 5%, B = 6%, C = 4%; Question 3: A = 1%, B = 0%, C = 0%) 95% confidence intervals were calculated for the precision for each question using a lower boundary of zero. However, the 95% confidence limits were overlapping, suggesting no statistical difference between the groups.
Due to the small number of searches for each arm, this pilot study could not demonstrate a statistically significant difference between the search protocols. However there was a trend towards higher precision that needs to be investigated in a larger study to determine if PICO can improve the relevancy of search results.
Practicing evidence-based medicine (EBM) requires integration of clinical experience, the best available research evidence, and the values and preferences of the patient into the clinical decision-making process . The steps in practicing EBM are centered on the patient and involve asking well-focused questions, searching for the best available evidence, appraising that evidence for validity, and then applying the results to the care of the patient. Supporting 21st century health care and the practice of EBM requires ubiquitous access to clinical information and to knowledge-based resources. Clinicians and educators currently utilize a variety of resources and interfaces to search the biomedical literature to answer clinical questions. PubMed is a service of the U.S. National Library of Medicine (NLM) that provides access to over 16 million citations from MEDLINE and other life science journals dating back to the 1950s . Since 1997, PubMed has been freely available to physicians, researchers, and the public. The information obtained from literature searches in PubMed can have a significant impact on patient care and clinical outcomes. Crowley et al reported on a study of 625 clinical questions asked by residents during an in-hospital general medicine rotation. Seventy-seven percent of the answers to these questions came from MEDLINE and the information from the articles changed patient management 47% of the time . Klein et al. showed that conducting a MEDLINE search early in the hospitalization of a patient could significantly lower costs, charges, and lengths of stay . Westbrook et al reported that the use of an online information retrieval system improved the quality of clinicians' answers to clinical questions by 21% . The literature also reports that many clinical questions go unanswered due to difficulties formulating a relevant question , forgetting the question , lack of access to information resources, and lack of skills in searching .
Formulating a well-focused question is the first and arguably the most important step in the EBM process. Without a well-focused question, it can be very difficult and time consuming to identify appropriate resources and search for relevant evidence. Practitioners of EBM often use a specialized framework, called PICO, to form the question and facilitate the literature search. PICO stands for Patient problem, Intervention, Comparison, and Outcome.  The PICO framework can be expanded to PICOTT, adding information about the Type of question being asked (therapy, diagnosis, prognosis, harm, etc.) and the best Type of study design for that particular question. Using this framework helps the clinician articulate the important parts of the clinical question most applicable to the patient and facilitates the searching process by identifying the key concepts for an effective search strategy. [10, 11]
PubMed includes a feature called Clinical Queries, which helps identify citations with appropriate study design by linking the type of question (therapy, diagnosis, etiology and prognosis) to a stored search strategy that retrieves the appropriate research methodology. The Clinical Queries are based on extensive research by the Hedges Study Team at McMaster University  and have been shown to improve searching results. Combining the PICO framework with the PubMed Clinical Queries has the potential to improve the efficiency of literature searching. Bergus et al reported that questions with at least a defined intervention (I) and outcome (O) were more likely to be answered than questions with one or none of these parameters . The objectives of this pilot study were to measure the relevancy or precision of PubMed search results when using a PICO search framework, to test feasibility of the study design and to identify possible trends.
Precision of search results
16.40% – 53.60%
15.84% – 40.16%
1.42% – 8.58%
2.42% – 9.58%-
1.50% – 6.50%
-0.43% – 2.43%
-0.14% – 2.14%
The success of the search was measured by comparing the number of relevant citations retrieved to the total number of article retrieved in the final set. The research team identified the relevant articles for each clinical question. The criteria for an article being included as relevant was that it addressed the specific clinical question, including patient, intervention and outcome, and that it was of the best study methodology based on the type of question. For example, a therapy question needed to be answered by a randomized controlled trial, systematic review, or meta-analysis, while a prognosis question required a prospective cohort study. Two researchers conducted PubMed searches for each question. These two and a third researcher selected the relevant articles from the pooled results of the searches. A fourth researcher also reviewed the results and reconciled any disagreement among the other reviewers.
Participants using the handheld devices wrote down their IP address, the time, and the number of citations in the set that best addressed the clinical question. This information was matched with data collected by the project server at NLM and was used to verify the final result set. Participants using the PC workstation were asked to save their final set results to the study account in MyNCBI. Screen captures, a means of saving the image of the search, were made of their complete search history to back up their saved strategies. For all participants, the search terms, date and time of the search, and the Unique Identifiers for the citations retrieved were collected by the system transactions logs and stored on the NLM project server or in MyNCBI. There were no time constraints on any of the participants. The DUMC Institutional Review Board approved the study method and all participants signed an informed consent form.
The primary outcome measurement was the precision of the final result set selected to answer the clinical question. Precision was defined as the ratio of relevant citations retrieved to the total number of citations retrieved in the set. The higher the precision the more efficient the search, as more of what is retrieved is also relevant. The measurement of precision is appropriate in the clinical setting, where it is often desirable to find a few good articles, as opposed to a research setting, where the measurement of recall, finding all possible relevant articles, is more important. We also examined the number of terms and filters used in the search strategy and user satisfaction with the search interface.
Baseline characteristics of study participants
Protocol A N = 10
Protocol B N = 10
Protocol C N = 10
Previous EBM training
Previous searching training
How often do you search MEDLINE?
Monthly or less
In addition to quantitative comparisons, this pilot study provided an opportunity for more open-ended insight into how practitioners search. Some searches within each question (Question 1 = 3/30, Question 2 = 9/30, and Question 3 = 27/30) did not retrieve any relevant citations. A qualitative assessment of search strategies that produced no acceptable results revealed three common types of errors: ambiguous mapping of subject headings (MeSH); selecting the wrong publication type; and limiting search queries to just words in the title. The error that accounted for the largest number of non-productive searches was related to the MEDLINE indexing structure or MeSH (Medical Subject Headings). In question three, most searchers used the phrase "African Americans", which maps to the MeSH "African Americans" and retrieves 29077 citations [searched 4/18/07]. However, the common word "blacks" maps to the broader MeSH term "African continental ancestry group" and retrieves 48195 citations [searched 4/18/07]. Most of the relevant citations used either the word "blacks" or were indexed to the broader MeSH term "African continental ancestry group." The second most common error was related to selecting the wrong study design or Clinical Query for the type of question being searched. This was a common problem for Protocol A, which only listed therapy study designs. The third error affected Protocol C and involved limiting search terms to the title field. While searchers often select their articles based on relevant words in the title, a problem arises when more than one word or phrase is appropriate to the topic. For example, a "peg" is also called a "percutaneous endoscopic gastrostomy tube" or "feeding tube." Limiting the search to the word "peg" in the title eliminates mapping to appropriate subject headings (MeSH) such as intubation, gastrointestinal and therefore may exclude relevant articles on the topic that use alternative terminology. Understanding these errors can help in teaching effective searching and in developing better search systems.
Perceptions of ease of use and time spent searching were approximately the same across all three protocols, as were the numbers of terms used in each search, regardless of protocol used. However, of the 30 searches performed using Protocol B (PubMed/PICO/Clinical Queries), 25 (83%) actually incorporated the Clinical Queries into the strategy, as opposed to only 2 (7%) of the searches using Protocol C (PubMed/Web). Both Protocol A and B facilitated the use of publication types and the Clinical Queries by prompting the searcher to consider these elements in the strategy. While these elements were also available in Protocol C, they are either behind the Limits tab or listed in the PubMed Services menu.
Effective searching requires a series of steps to lead the practitioner from the clinical question or bedside to informed decision making. In March of 2005, over 68,000,000 searches were conducted in PubMed, from seven million unique IP addresses.  This study attempted to show that targeted interfaces structured to help clinicians search PubMed using the PICO format could affect the relevancy of the results, which ultimately could improve patient care. While this study was intended to look only at the actual search process using the intermediate measure of precision, examining the impact this can have on clinical care requires further research. Even though the standard PubMed search screen provides access to the Clinical Queries function, searchers in Protocol C used it infrequently. Placing the Clinical Queries option directly on the search screen, as in Protocol B, increased the likelihood of their use. This study has several limitations. First, as this study was designed as a pilot for a larger study, the sample size was small and may not have been adequately powered to show a significant difference. Second, we provided the subjects with well-focused questions, which might have been unrealistic in their simplicity and transfer to the PICO template. In the clinical setting the work of generating questions from clinical situations and translating them into the PICO format may be much more difficult. The success of the search may be much more dependent on asking a good question than on the search framework. Third, we recruited our subjects from a busy general medicine rotation, where EBM is taught and residents are expected to search the literature to address clinical questions. These residents may be more skilled in database searching than at other institutions and therefore generalizing the results to a less experienced population may not be appropriate. Finally, searching PubMed from a handheld may have resulted in some frustration with the searching process, especially if participants were not familiar with the specific devices. Although all residents within the Internal Medicine Residency Program had been given PDAs, none had previously used the device to search PubMed. While this study was not designed to address the technical issues of using handheld devices for searching PubMed, we did test the effectiveness of the search template, which NLM had designed specifically for use on the PDA. Using a handheld device and a PICO template to search the biomedical literature can address some of the barriers of time and access to resources for answering clinical questions.
Searches performed on a PICO-formatted screen retrieved a higher percentage of relevant citations than searches performed on the standard PubMed search interface. However, due to the small number of searches for each arm, this pilot study could not demonstrate a statistically significant difference between the search protocols. There was a suggestion of a trend towards higher precision that needs to be investigated in a larger study to determine if PICO can improve the relevancy of search results.
This article is published under license to BioMed Central Ltd. This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.