BMC Medical Informatics and Decision Making

Background: The outcome of patients with bacteraemia is influenced by the initial selection of adequate antimicrobial therapy. The objective of our study was to clarify the influence of different crude data correction methods on a) microbial spectrum and ranking of pathogens, and b) cumulative antimicrobial susceptibility pattern of blood culture isolates obtained from patients from intensive care units (ICUs) using a computer based tool, MONI. Methods: Analysis of 13 ICUs over a period of 7 years yielded 1427 microorganisms from positive results. Three different data correction methods were applied. Raw data method (RDM): Data without further correction, including all positive blood culture results. Duplicate-free method (DFM): Correction of raw data for consecutive patient's results yielding same microorganism with similar antibiogram within a two-week period. Contaminant-free method (CFM): Bacteraemia caused by possible contaminants was only assumed as true bloodstream infection, if an organism of the same species was isolated from > 2 sets of blood cultures within 5 days. Results: Our study demonstrates that different approaches towards raw data correction – none (RDM), duplicate-free (DFM), and a contaminant-free method (CFM) – show different results in analysis of positive blood cultures. Regarding the spectrum of microorganisms, RDM and DFM yielded almost similar results in ranking of microorganisms, whereas using the CFM resulted in a clinically and epidemiologically more plausible spectrum. Conclusion: For possible skin contaminants, the proportion of microorganisms in terms of number of episodes is most influenced by the CFM, followed by the DFM. However, with exception of fusidic acid for gram-positive organisms, none of the evaluated correction methods would have changed advice for empiric therapy on the selected ICUs.


Background
Nosocomial infections are estimated to affect 6-12% of hospitalized patients [1].Of these infections, bacteraemia and fungaemia have the most significant effect on mortality.The outcome of patients with bacteraemia is influenced by the initial selection of adequate antimicrobial therapy [2].Generally, selecting the antibiotic of choice for the treatment of an infection is a multi-factorial process, which includes site of infection, intrinsic activity of the drug according to microbiological susceptibility testing results, pharmacokinetics of the drug at the site of infection, and potential side effects.However, before considering these factors, the choice of agent is mainly dependant on knowledge of the organisms likely to be involved.Contrarily, in clinical settings, microbiological verification of an infection and susceptibility of the causative pathogen are usually not available at the time of clinical diagnosis of an infection.In order not to endanger the patient, a calculated (empirical) antimicrobial therapy often has to be started without exact knowledge of the causative pathogen and its antimicrobial susceptibility profile.Furthermore, very little changes occur in the antimicrobial management even at the time when antimicrobial susceptibility results are available to physicians [3].
To assist physicians at an early stage with the empiric antibiotic choice for treating blood stream infections, a close liaison with the clinical microbiologists is important.By regularly monitoring and analyzing blood culture results and by calculating the most frequently isolated microorganisms together with their cumulative susceptibility profiles, the clinical microbiologist can narrow the plausible cause of bacteraemia and susceptibility to antibiotics.Therefore, analysis of occurrence of pathogens and their cumulative susceptibility profiles is widely used in hospitals, and recent attempts on how to standardize these tasks were only developed within the last years [4].However, applying these definitions manually on large datasets is time consuming, error-prone, and therefore, needs development of expensive software tools.
For blood cultures, there are two concerns that demand correction of crude data before generating statistics on the frequency of occurrence of pathogens and their cumulative antibiotic susceptibility profile.One refers to the fact that in course of treatment and monitoring of the patient, repetitive isolates are obtained, and it is generally believed that by omission of these duplicates the final result will not be biased by multiple cultures of one identical organism during a single infectious episode.The second consideration is the difficulty of interpretation of microbiological test results in distinguishing true episodes of infection from possible contamination of specimens [5].In clinical practice, merging several other laboratory results together with the clinical aspect of the affected patient makes decision on this subject.Looking at microbiologic data alone often results in overestimation of the real size of the situation, and this is especially true for blood culture results.Several studies have found Staphylococcus epidermidis to be the most common microorganism isolated from blood cultures, accounting for 30-60% of all episodes [6][7][8][9].Although these findings were explained partly by the increasing use of intravascular devices, which can serve as portal of entry to the bloodstream, they may be as well due to the retrospective nature of many studies and the lack of criteria for differentiation between contaminated blood cultures and true bacteraemia.Two studies [5,10] could demonstrate that a simplified surveillance definition for nosocomial bloodstream infections based on microbiology data alone yielded comparable result to the Centers for Diseases Control and Prevention's (CDC's) definition for primary bloodstream infection [11] for possible skin contaminant isolates with an agreement rate of 75% [5] and 91% [10], respectively.
Based on these considerations, the objective of our study was to clarify the influence of different crude data correction methods on a) microbial spectrum and ranking of pathogens, and b) cumulative antimicrobial susceptibility pattern of blood culture isolates obtained from patients from intensive care units (ICUs).

Methods
The Vienna General Hospital is a 2,140 beds tertiary care referral institution that serves as the teaching facility of the Medical University of Vienna.The hospital cares for about 90,000 inpatients annually.Daily, more than 500 patient samples are sent to the Division of Clinical Microbiology for further processing.After analyzing samples, a clinical microbiologist verifies the results, and findings are downloaded to the hospital's central laboratory database, stored and then forwarded to the sample sender.Simultaneously, the results are downloaded to the MONI (Monitoring of Nosocomial Infections) database, which has a relational, Oracle-based architecture located on a separate server.MONI is a database and surveillance system designed to monitor and detect nosocomial infections [12][13][14][15].Currently, the system offers three categories of applications: a) administration tools, which enable to control database settings and adjusting database parameters; b) a database query tool (FlexScan), which allows to extract records in accordance with manually selected restrictions (for example sender, microorganism, sample, antibiotic resistance profile), or to conduct automatic queries based on knowledge-based rules, resulting in stratified standard analysis of proportions of microorganisms and cumulative antimicrobial susceptibility profiles; c) an automatic surveillance tools using rules for monitoring of alert organisms (e.g.methicillin-resistant Staphylococcus aureus) and antibiotic resistance profiles (MONI/ ALERT), monitoring of cross infections (MONI/CROSS), and monitoring of frequencies and trends (MONI/TREND).The hospital's infection control team analyzes the intelligent alarms generated by MONI, and reacts to them by initiating necessary countermeasures.
No ethical approval was needed for this study, as data acquisition followed Austrian federal law on data safety, and data pre-processing and analysing occurred anonymously.

Data acquisition and processing
Data acquisition was performed using the system's database query tool FlexScan.Since preliminary reports of blood culture results can introduce considerable redundancy into a database of microbiology results, only the final reports from each sample are downloaded to the MONI database.For purpose of this study, three separate queries were conducted, one using the programs duplicate result filter (duplicate-free method -DFM), the other without any data correction (raw data method -RDM).For the purpose of evaluating a new data correction rule, a third query was performed using definitions published by Yokoe et al. [10] (contaminant-free method -CFM).
Following the selection of a time period (1 January 1998 -31 December 2004), a sampling material (blood culture), and the wards under study (13 ICUs: 7 surgical, 5 medical, and 1 neuro-surgical), the program was started.After performing the queries, results were stored in a dynamic comma separated (*.csv) file, and analyzed for more detail using a standard spreadsheet application (MS Excel 2000, Microsoft Corporation, Redmond, Virginia).Data from each patient included the following variables: patient identification number, family name, first name, date of birth, patient's age at sample collection, protocol number of the blood culture report, date of sampling, sender's department and ward, sampling material (blood culture), identified organism, and antimicrobial susceptibility pattern in terms of resistant, susceptible, or intermediate susceptible.

Definition of data correction rules Raw data method (RDM)
Raw data were defined as data acquired by MONI without any further correction.Hence, they included all positive blood culture results obtained from patients of 13 ICUs from 1 January 1998 -31 December 2004 without any further manipulation.

Duplicate-free method (DFM)
(figure 1) The duplicate-free method was defined as correction of raw data by elimination of duplicate results.A duplicate result was defined as any result presenting the same microorganism (genus and species) from the same material (in this study blood culture) with a similar antimicrobial susceptibility pattern (at least 85% similarity) obtained from the same patient within a two-week period starting from the last positive culture with the identical organism.Intermediate susceptibility results were also considered as resistant results.The accepted differences of not more than 15% in antibiotic susceptibility was related most to the clinical practice of testing additional antibiotics or to omit re-testing of previously tested compounds.Hence, in most cases the difference pertained more to missing or additionally tested compounds.Only the first result from a patient -representing one episode of bacteraemia -was enrolled for further analysis.
Contaminant-free method (CFM) (figure 2) Considering the problem of contamination with skin organisms, we applied the definitions published earlier [10].Possible skin contaminants were defined as organisms, which are part of the normal skin flora, including coagulase-negative staphylococci, Corynebacterium species, alpha-or non-hemolytic streptococci [5], Bacillus species, Propionibacterium acnes, micrococci, and Neisseria species other than N. gonorrhoeae and N. meningitidis.All coagulase-negative staphylococci were differentiated on the species level, and not on the genus level alone.Other bacteria and fungi were regarded as obligate pathogens (e.g.Salmonella typhi, Staphylococcus aureus) and therefore always considered as true cause of bacteraemia, whereby only the first isolate within a 14-day period was counted as one episode.Bacteraemia caused by a possible skin contaminant organism was assumed as true if an organism of the same species with a similar antimicrobial susceptibility pattern (at least 85% similarity) was isolated from two or more sets of blood cultures obtained from the same patient within 5 days starting from the last positive culture with the identical organism.In this case, this was counted as a single episode of bacteraemia.If only one isolate was recorded within this period, the blood culture was regarded to be contaminated and was excluded from further analysis.

Calculation and interpretation of antimicrobial susceptibility pattern
Because a gram stain is usually the first available information which can be provided to a physician, susceptibility results calculated by each method were summarized for organisms grouped according to their gram stain result.Based on empirical considerations, we excluded an antimicrobial substance as possible choice for the calculated antibiotic treatment of a bacteraemia, if its resistance reached ≥ 10% against a given microorganism.For all isolates, antimicrobial susceptibility testing was performed by Kirby-Bauer disk-diffusion test according to the Clinical and Laboratory Standards Institute (CLSI, before 2005 known as the National Committee for Clinical Laboratory Standards, NCCLS) guidelines [16], if appropriate.Results of cumulative antimicrobial susceptibility were presented as percentage of resistance against a tested antimicrobial substance.

Statistical analysis
Differences of proportions between RDM, DFM and CFM were calculated by applying the χ 2 test, which gives the probability that an observed difference between two means or proportions is caused by chance.A P-value of less than 0.05 was considered significant.Calculations were performed using Epi Info 2000 v 1.1.2a(Centers for Disease Control and Prevention, Atlanta, GA 30333, USA).

Microbial spectrum and ranking of microorganisms
Analysis from 13 ICUs (01/1998 -12/2004) yielded 1427 microorganisms from positive blood culture results.The spectrum of microorganisms, calculated by RDM, DFM, and CFM, respectively, is shown in Table 1.By means of all data correction methods, Staphylococcus epidermidis and Staphylococcus aureus were the most common organisms.However, regarding their proportion expressed as percentage of all episodes, there are significant differences.Table 2 and Table 3 show the difference of proportions obtained by the different data correction methods as well as the results of significance testing for possible skin contaminants and obligate pathogens, respectively.Although definitions for DFM and CFM did not differ for obligate pathogens, due to the different size of denominators statistically significant differences with respect to proportion of episodes could be observed.While there were differences in proportions for all microorganisms, no statistically significant difference was found for RDM vs. DFM.
Among the 333 microorganisms representing 23.3% of all positive blood cultures summarized as "others" in Table 1

Results of antimicrobial susceptibility testing and appropriateness of antibiotics for empirical therapy
Figure 3 shows differences in the antimicrobial resistance profile of gram-positive, Figure 4 of gram-negative organisms.Considering an empirical cut-off level of ≥ 20% of resistant organisms to an antibiotic as limit for appropriateness of this agent for empirical therapy of infection caused by gram-negative organisms, no data correction method would have yielded a different recommendation for empirical therapy.However, for gram-positive organisms, only the application of the CFM changed signifi-Flowchart of the "if/then" rules of contaminant-free method (CFM) Figure 2 Flowchart of the "if/then" rules of contaminant-free method (CFM).

Discussion
For small hospitals, a manually conducted review and analysis of microbiological data is achievable, but limited by the small numbers of isolates and therefore ensuing a decreased reliability of results.At large institutions, there is often an enormous pool of microbiological data, yielding more accurate estimations on the prevalence of pathogens, but a manually conducted review is unattainable.
A large volume of clinical data demands the use of computer systems, which analyze data in accordance with programmed "if/then" rules derived from models used by medical experts.Naturally, direct transfer of human medical decision-making behaviour to "if/then" rules for artificial intelligence-based computer programs is not always possible, because most of the necessary information is not easily available in electronic form (for example patient temperature, abdominal discomfort, patient "looks" ill).Also, because of the heterogeneity of used laboratory and medical information systems even within the same hospital, merging of information obtained from different sources is often futile.Therefore, the CFM, originally designed as rapid method for surveillance of cases of infection, is an interesting method, although its use strongly depends on the availability of computer systems, since complex "if/then" rules for data correction of large datasets are highly time and concentration consuming and consecutive human errors never can be ruled out.
However, because of the requirement of at least two sets of positive blood cultures within 5 days, the prevalence of true bacteraemia caused by possible skin contaminants may be underestimated by the CFM if the practice of obtaining only one set of blood cultures is common.This situation usually does not arise at ICUs, since in this setting often more than one set of blood culture is drawn on consecutive days.The rationale for this practice is monitoring for success of therapy, and not to miss additional infections by multiple resistant pathogens during antimicrobial therapy.
On the other hand, the CFM enables to study those microorganisms classified as "true" pathogens, and hence being clinically and epidemiologically more relevant.Also, this method gives better estimation of the proportion of the organism isolated from blood cultures.Our study showed   a percentage of 13.4% of Staphylococcus epidermidis using CFM, which is a very plausible estimation on the real proportion of infections caused by this organism according to previous results of prospective studies on catheter-related infections at ICUs [17].Regarding susceptibility pattern and implications on empiric antibiotic therapy, the CFM showed only a difference to DFM or RDM for fusidic acid.
With regard to the associations of empiric therapy and antimicrobial susceptibilities, the difference for the CFM method for fusidic acid, while an interesting observation, is not relevant to the topic of the study question.

Conclusion
Our study demonstrates that different approaches towards raw data correction -none (RDM), duplicate-free (DFM), and a contaminant-free method (CFM) -show different results in analysis of positive blood cultures.
Regarding the spectrum of microorganisms, RDM and DFM yielded almost similar results in ranking of microor-  ganisms, whereas using the CFM resulted in a clinically and epidemiologically more plausible spectrum.For possible skin contaminants, the proportion of microorganisms in terms of number of episodes is most influenced by the CFM, followed by the DFM.However, with exception of fusidic acid for gram-positive organisms, none of the evaluated correction methods would have changed advice for empiric therapy on the selected ICUs.

Table 1 : Comparison of the spectrum and ranking of microorganisms identified from blood cultures in 13 ICUs (1998-2004), analyzed by different data pre-processing methods.
RDM = raw data method; b DFM = duplicate-free method; c CFM = contaminant-free method. a

Table 2 : Difference of proportions according to applied data correction method for possible skin contaminants.
2= Chi-square test; A p-value less than 0.05 (*) is considered to indicate a statistically significant difference.

Table 3 : Difference of proportions according to applied data correction method for obligate pathogens.
Publish with Bio Med Central and every scientist can read your work free of charge "BioMed Central will be the most significant development for disseminating the results of biomedical researc h in our lifetime."BioMedcentral BMC Medical Informatics and Decision Making 2009, 9:27 http://www.biomedcentral.com/1472-6947/9/27 available free of charge to the entire biomedical community peer reviewed and published immediately upon acceptance cited in PubMed and archived on PubMed Central yours -you keep the copyright Submit your manuscript here: http://www.biomedcentral.com/info/publishing_adv.asp