- Research article
- Open Access
- Open Peer Review
Interactive decision support in hepatic surgery
BMC Medical Informatics and Decision Making volume 2, Article number: 5 (2002)
Hepatic surgery is characterized by complicated operations with a significant peri- and postoperative risk for the patient. We developed a web-based, high-granular research database for comprehensive documentation of all relevant variables to evaluate new surgical techniques.
To integrate this research system into the clinical setting, we designed an interactive decision support component. The objective is to provide relevant information for the surgeon and the patient to assess preoperatively the risk of a specific surgical procedure.
Based on five established predictors of patient outcomes, the risk assessment tool searches for similar cases in the database and aggregates the information to estimate the risk for an individual patient.
The physician can verify the analysis and exclude manually non-matching cases according to his expertise. The analysis is visualized by means of a Kaplan-Meier plot.
To evaluate the decision support component we analyzed data on 165 patients diagnosed with hepatocellular carcinoma (period 1996–2000). The similarity search provides a two-peak distribution indicating there are groups of similar patients and singular cases which are quite different to the average. The results of the risk estimation are consistent with the observed survival data, but must be interpreted with caution because of the limited number of matching reference cases.
Critical issues for the decision support system are clinical integration, a transparent and reliable knowledge base and user feedback.
In the case of liver neoplasms surgery is highly complicated and difficult; it takes sometimes 10 hours to operate malignant liver neoplasms with a perioperative fatality rate as high as three to four percent, which is much higher than the usual operative risk.
Since six years there has been a research database covering in detail the conducted surgical procedures, relevant medical parameters of the patient and the postoperative course.
The primary goals of this documentation in the past and present is to collect detailed data about patient characteristics and outcomes to determine what kind of patients do benefit from specific procedures based on survival and complication rates. There is a demand from the surgeons to extend the capabilities from a pure research database towards a clinically integrated decision support system.
The database is currently being used by five surgeons, one medical student, one person of the administrative staff, one statistician, one computer scientist and finally by one anesthesiologist. In the future two radiologists will also be involved. All these users are scattered among the hospital. In this context of a highly interdisciplinary complex research topic our main focus is set on the following issues:
• Is a knowledge base for this medical domain with an appropriate data structure and high-quality content feasible from a technical point of view?
• Does a decision support system for this medical domain provide clinically relevant information?
Concept of risk assessment tool
The basic idea of the risk assessment tool is to find similar cases to a given patient. The prognosis of the matching subjects is aggregated and taken as an estimate for the risk of the individual patient.
From a methodical point of view similarity search in complex, multidimensional records which are partially incomplete is a difficult problem . To make it even worse, the problem is characterized by a high number of variables in relation to the number of records (we do have 451 items and altogether 766 patients).
For this reason the risk assessment tool is based on selected parameters which are known predictors for outcome of hepatic liver tumor resection . A case is defined to be similar to the actual patient, if all predictive parameters correspond within a given level of tolerance.
The final decision on surgery is taken by the patient and the surgeon; thus both methodology and result of the analysis should be transparent and easy to understand. For this reason the risk is visualized as Kaplan-Meier plot , which is the established standard for visualizing survival data in medicine.
In addition, summarized data on all cases classified by the computer as similar to the actual patient is displayed to enable the physician to verify the analysis.
Cases which are inappropriate according to the physicians expertise can be excluded from the analysis in an interactive manner.
The software engineering approach was iterative. By means of regular user meetings and rapid prototyping after approximately 10 iteration cycles a suitable database structure was defined.
The following design objectives were identified:
• high-granular documentation to enable detailed statistical analysis (e.g. association between laboratory parameters and outcomes)
• reports on data quality (e.g. lost followup)
• access from many locations within the clinic
• different access rights for each user group (surgeons, anaesthetists, administrative staff, statistician)
Automatic generation of web programs
Data entry is performed with a standard web browser. A dedicated software tool (see [8, 9]) has been developed for rapid prototyping of ergonomic, highly adaptive web forms and management of data transformations. It enables to define a data structure (e.g. database table) interactively. A preview of the forms can be generated and presented to the clinical user. When the data structure is defined, all PERL  programs and database tables are generated from templates, i.e. no line of code is programmed manually. The function of the tool is similar to the UltraDev™ extension of Macromedia Dreamweaver™ http://www.macromedia.com/, but is adapted to the needs of medical databases (e.g. specific templates).
The general documentation workflow is as follows: After login the patient is selected from a current list provided by the legacy system. The most recent document for this patient is displayed. The user can navigate within the documents, create new ones or edit the current page. The structure of each document is described in XML format .
The data structures themselves are created and edited with an Intranet based modelling tool. For each item a set of attributes is defined: Type of item (text, pulldown menu, checkbox, radio button, textarea, date, time), default values, constraints and layout. Each item has a unique object ID to enable data transformations when the data structure is updated.
The Intranet tool provides the following functions:
• generation of custom data entry forms
• combination of free text and structured data entry
• individual layout for clinical reports
• data transformations
• user authorization and access control
• tools for data export (XML format)
Security and confidentiality of patient information is protected by means of individual logins and a firewall.
Hard- and software
Both the database program and the risk assessment tool are written in PERL (version 5.005_03)  running on a Linux machine  providing Apache web server (version 1.3)  and a PostgreSQL database (version 6.5.3) .
Frontend and structure of the database
The database itself consists of eight tables (demographics, medical history, volumetrics, surgical documentation, histology, laboratory values, complications, outcome) with an overall number of 451 items indicating a high level of granularity which is required for this complex research topic. Information on 766 patients are recorded. Caused by the high number of items missing values cannot be avoided; this fact must be taken into account by the decision support component.
Fig. 1 presents a small section from the surgical documentation; besides numerical and categorical items multimedia data like CT or MR scans can be stored. Altogether 70 items are recorded per operation.
The interactive decision support component
The decision support component is a server side web application and is accessed using a standard web browser. The application itself is written in PERL invoked by an Apache web server. The knowledge base is stored in a PostgreSQL database which is located on the same server as the application and webserver. Standard techniques such as SQL are used to query the database via the appropriate interfaces.
After invoking the program by the web client a form is presented (Fig. 2) which requires the user to provide demographic data of the patient for whom a suggestion is needed. Clinically relevant parameters must be specified, which have shown to be predictive for patient outcomes (see ); for this reason we selected diagnosis, type of planned resection, PHRR (partial hepatic resection rate ), prothrombin activity (Quick, a blood clotting parameter) and gamma-GT (gamma-Glutamyltranspeptidase, a liver enzyme).
Sometimes not all parameters for an individual patient are available, e.g. estimation of the PHRR is very time consuming. The similarity search also includes datasets which have missing values. It is possible to specify a range for each of the numerical parameters.
Because the number of documented cases is small in comparison to the quantity of possible parameter combinations, the algorithm was designed to avoid overlooking of similar cases (high sensitivity, low specificity). For example there are 35 different diagnoses resulting in approximately 22 cases under the assumption of equipartition of diagnoses within the 766 patients. Therefore all cases that are not contradictory to the query are considered similar.
After submitting the form to the system it connects to the database to retrieve the appropriate results based on the specified parameters and their ranges. The system then computes the data necessary for the Kaplan-Meier plot and generates a web page containing the plot and the underlying data (Fig. 3, Fig. 4).
The Kaplan-Meier plot presents the course of survival over time. It takes into account that the information per patient is limited with respect to time as long as the patient isn't dead. For instance, if a patient had surgery three years ago and is alive, this information can be used only until the third year of the plot. After then, this information is censored, because it is unknown, for how long this patient will be alive.
By simple clicking on a similar case the surgeon can go directly to the database and verify the source information. Then the physician decides, whether this case should be excluded from the analysis by selecting the 'exclude' checkbox. After exclusion of inappropriate cases the analysis can be recalculated. By means of this interactive technique the physician gets involved into the analysis and has the ability to verify and adjust it for an individual patient according to his expert knowledge.
To evaluate the decision support component we analyzed data provided by the clinical information system (CIS) concerning the period from January 1996 to September 2000.
According to the CIS, within this time frame 3269 surgical procedures from 744 patients were performed covering diagnoses of malign and benign liver tumors. 165 of these patients were assigned HCC (hepatocellular carcinoma) as main diagnosis. These 165 patients are partitioned into 93 with no liver resection (conservative therapy, e.g. chemoembolisation) and 72 with liver resection. 21 of the 72 patients were subject to liver transplantation. The remaining 51 patients with HCC and liver resection without transplantation were grouped into 23 covered by the research database and 28 not covered by the research database. Therefore it is evident that a substantial number of patients treated in our hospital is not covered by the research database.
We used a batch version of our decision support system to calculate the Kaplan-Meier estimation for all 165 patients from the CIS diagnosed with HCC. Fig. 5 shows the distribution of the number of cases rated to be similar to a given patient. It has two peaks indicating there are groups of similar patients and singular cases which are quite different from the average. This may be influenced by the small number of reference cases in the database (range: 1 to 25 matched cases), but can also be interpreted that there are patients with exclusive features which are very different from "the average patient".
In the group of 51 patients who underwent liver resection we analyzed perioperative fatality rate (within 30 days) to assess whether there is a shift over time (e.g. due to medical progress) which would bias the survival estimation, but we did not find a significant change. In this group the risk assessment tool estimated a mean 1 year survival rate of 78+/-5% (mean +/- S.D.), which is consistent with an observed fatality rate of 20% (10 of 51 patients died within the first year after operation). This must be interpreted very carefully, because of the limited number of reference cases. We could not evaluate the survival estimations in the group of 93 patients without liver resection, because we do not have reliable information about the followup status of these patients and for this reason we do not know the correct fatality rate in this group.
The problem of similarity search in medicine
The situation with the hepatic surgery dataset concerning number of items, number of records and data quality issues, i.e. many variables, limited number of patients and missing values, is typical for medical, patient-oriented research databases [16, 17]. For this reason data mining and visualization techniques for large databases  cannot be applied easily.
Care must be taken in defining "similar patients" to avoid selection bias, either by too imprecise or too strict criteria. We decided to apply an interactive approach: the computer presents a list of similar patients, but the medical expert can exclude certain cases from the analysis. This provides a synthesis of formalized computer rules and empirical medical expertise.
One might argue why we applied quite few parameter to the similarity search. From a technical point of view, very few variables can split our dataset in manageable units, e.g. there are as many as 35 different diagnoses resulting in approximately 22 cases per diagnosis under the theoretical assumption of equipartition within the 766 patients.
From a clinical viewpoint, the risk assessment tool must be fast and easy to use – for this reason we applied 5 parameters, which have shown to be predictive for patient outcomes (see ).
The research database provides a set of specific reports, e.g. the number of patients per diagnostic category or a list of patients with lost-followup (i.e. followup status is unknown for more than 6 months).
The new decision support component has also an impact on data quality, because when the list of similar cases is displayed it becomes evident, which parameters are implausible or missing. Given the direct link to the individual patient record missing data can be entered directly if it is available.
However, we did not verify whether all eligible patients were entered into the database, therefore a selection bias may occur.
By searching similar cases for very common or clinically relevant situations, targeted data monitoring is feasible, which can improve data quality in an efficient way and might lead to better medical decisions in the long run – it is hard to prove this hypothesis, but because this research database is the foundation for many clinical studies in this field better data quality will enable faster and more reliable scientific results.
The decision about whether or not an individual patient is eligible for a surgical procedure is essential for the routine work of any surgeon. A standalone research database is not suitable for supporting this difficult task. For this reason we developed a tool to extract and visualize information from the research database relevant for an individual patient. This aggregation enables specific insights, tailored to a specific patient, which are not available by simple database queries.
A stepwise decision support is provided:
First, the surgeon and the patient get assistance whether or not a surgical procedure is appropriate. The risk can be calculated specifically for each patient by established predictors of outcomes for this operation.
Second, the surgeon can select different procedure types (extent of liver resection) and analyze the risk.
Third, the clinical decision is stored into the database to enable statistical analysis for future improvements of the system.
The evaluation based on retrospective data of the clinical information system provides evidence, that the risk assessment tools delivers clinically relevant information. However, we did not quantify the potential clinical impact of our system; in this case, a controlled trial would be required.
So far surgeons have made decisions based on their personal expertise and intuition – we want to provide additional, objective information to facilitate this difficult task. The patient also benefits from this comprehensive information.
Visualization and aggregation of medical information
Our approach is visualization by means of the Kaplan-Meier plot which is – in the field of medicine – the established method for displaying survival data. The surgeon and furthermore the patient can get an idea of the risk of the surgery supporting both of them in their decision whether or not to perform the surgery. The benefit of the Kaplan-Meier plot compared to complex decisions trees  is obvious. A decision tree is often hard to understand especially for persons not familiar with this kind of representation. However, due to limitations in sample size and length of follow-up, the Kaplan-Meier plot must be interpreted carefully.
There is no special training needed to introduce the risk assessment tool. The surgeon isn't influenced in his opinion by a unary suggestion presented by the system. But his decision process is supported by data of documented surgeries. In addition Kaplan-Meier plots for certain patient groups can be generated for scientific purposes, i.e. without regard to an individual patient.
High-granular database design
One might argue why we store so many different variables (451 items per patient) in our database. For answering a specific research question (e.g. which of two surgical procedures is more efficient?) much less items were sufficient. But due to medical progress, for example the invention of the jet-cutter , new scientific questions arose in the past and very probably will arise in the future. Retrospective data collection is not only expensive, but also prone to errors and in many cases the data cannot be reconstructed precisely. For this reason we decided to implement a comprehensive medical record for liver resection patients, to prospectively document all potentially relevant items. An alternative approach would be a smaller data model, which would be limited to a specific research question, but could provide more complete data.
With the risk assessment tool we want to bridge the gap between a pure research and a clinical database and by this means improve the data quality.
Success factors for clinical decision support
There are not many, but some relevant examples for effective clinical decision support, e.g. the classical "Computer reminders" of McDonald  and recent applications concerning efficient use of antibiotics [21, 22]. We tried to implement the following common characteristics in our system:
• Clinical integration
Clinical decisions are and should be taken in the forseeable future by doctors and their patients, not by machines. For this reason the computer has the role of enabling the physician to find the correct conclusions. In this setting, any system must provide a comprehensible benefit for the clinical user. Without workflow integration, the data in the computer will not correspond to the medical reality around it and any system will not be able to provide clinically useful information.
• Transparent, up-to-date knowledge base
A "black box" approach is inappropriate in the medical setting, because many decisions are ambiguous. The medical evidence behind a specific statement must be available on demand, to enable the doctor to verify the rationale behind it. In our case, we provide the user the opportunity to view the full record of a case rated as similar to the actual patient. As a side effect of this approach errors in the knowledge base are detected spontaneously.
• User feedback
Neither physicians nor computer scientists know an appropriate data model for a specific medical domain in the beginning. From our experience, in a theoretical setting clinically adequate data structures cannot be defined, thus an iterative approach in software engineering is required. User feedback is important not only for fine tuning of data structures, but also for general acceptance, because the physicians get involved. By collecting the real decisions taken in a particular case we want to gather information on the clinical use of the system.
As stated earlier, a major limitation of medical decision support in general and specifically with regards to the hepatic surgery database is the limited number of fully documented patients. Even at a major university hospital only about 100–150 operations are performed per year. Given the many possible parameters which might influence patient outcomes and the progress in surgery, interinstitutional cooperation is required to build a comprehensive knowledge base. Internet technology can provide the technical platform for this collaboration. Once the knowledge base and the update mechanisms for the content are established, a public accessible decision support tool is feasible.
Rau HG, Schardey HM, Buttler E, Reuter C, Cohnert TU, Schildberg FW: A comparison of different techniques for liver resection: blunt dissection, ultrasonic aspirator and jet-cutter. Eur J Surg Oncol. 1995, 21: 183-7.
Schauer R, Rau H, Baretton G, Blasenbreu S, Meyer G, Schildberg FW: Prognostic factors in gallbladder carcinoma as a decision aid for reoperation. Langenbecks Arch Surg. 1997, 114: 1072-4.
Rau HG, Buttler ER, Baretton G, Schardey HM, Schildberg FW: Jet-cutting supported by high frequency current: new technique for hepatic surgery. World J Surg. 1997, 21: 254-9. 10.1007/s002689900225.
Rau HG, Buttler ER, Baretton G, Schardey HM, Schildberg FW: Jet-cutting Supported by High Frequency Current: New Technique for Hepatic Surgery. World J Surg. 1997, 21: 254-260. 10.1007/s002689900225.
Rau HG, Schauer R, Helmberger T, Holzknecht N, von Rückmann B, Meyer L, Buttler E, Kessler M, Zahlmann G, Schuhmann D, Schildberg FW: Impact of virtual reality imaging on hepatic liver tumor resection: calculation of risk. Langenbecks Arch Surg. 2000, 385: 162-170. 10.1007/s004230050260.
Kanth KV, Agrawal D, Singh AK: Dimensionality Reduction for Similarity Searching in Dynamic Databases. In: SIGMOD 1998. Edited by: Haas LM, Tiwary A. 1998, ACM Press, 166-176.
Altman DG: Practical Statistics for Medical Research. Chapman & Hall, London. 1991, 368-371.
Dugas M, Überla K: Intranet Based Clinical Workstations. In: Medical Informatics, Biostatistics and Epidemiology for Efficient Health Care and Medical Research. Edited by: Victor N. 1999, Urban und Vogel, Munich, 235-238.
Dugas M, Überla K: Intranet-Based Clinical Data Entry. AMIA Proc. 1999, 1051-
L. Wall, R. Schwartz: Programming PERL. O'Reilly & Associates Inc., Sebastopol, CA, USA. 1992
Extensible Markup Language (XML) [resource on World Wide Web]. 2000, [http://www.w3.org/XML]
SuSE Linux Home Page [resource on World Wide Web]. 2000, [http://www.suse.de]
The Apache Software Foundation Home Page [resource on World Wide Web]. 2001, [http://www.apache.org]
PostgreSQL [resource on World Wide Web]. 2000, [http://www.de.postgresql.org/]
Okamoto EKA, Yamanaka N: Prediction of safe limits of hepatectomy by combined volumetric and function measurements in patients with impaired hepatic function. Surgery. 1984, 95: 586-592.
Morik K, Imboff M, Brockhausen P, Joachims T, Gather U: Knowledge discovery and knowledge validation in intensive care. Artif Intell Med. 2000, 19: 225-249. 10.1016/S0933-3657(00)00047-6.
Lavrac N: Selected techniques for data mining in medicine. Artif Intell Med. 1999, 16: 3-23. 10.1016/S0933-3657(98)00062-1.
Keim DA: Visual Database Exploration Techniques. Int. Conference on Knowledge Discovery & Data mining, Newport Beach, CA 1997 (KDD'97). [resource on World Wide Web]. 2001, [http://www.informatik.uni-halle.de/~keim/PS/KDD97.pdf]
Ankerst M, Elsen Ch, Ester M, Kriegel HP: Visual Classification: An Interactive Approach to Decision Tree Construction. ACM SIGKDD Int. Conf. on Knowledge Discovery & Data Mining (KDD'99), San Diego, CA. 1999
McDonald CJ: Protocol-based computer reminders, the quality of care and the non-perfectibility of man. N Engl J Med. 1976, 295: 1351-1355.
Evans RS, Pestotnik SL, Classen DC, Clemmer TP, Weaver LK, Orme JF, Lloyd JF, Burke JP: A Computer-Assisted Management Program for Antibiotics and Other Antiinfective Agents. N Engl J Med. 1998, 338: 232-8. 10.1056/NEJM199801223380406.
Shojania K: Reducing vancomycin use utilizing a computer guideline: results of a randomized controlled trial. JAMIA. 1998, 5: 554-562.
Dugas M: Clinical applications of Intranet-Technology. In: New Technologies in Hospital Information Systems. Edited by: Dudeck J. 1997, IOS Press, Amsterdam, 115-118.
The pre-publication history for this paper can be accessed here:http://www.biomedcentral.com/1472-6947/2/5/prepub
We acknowledge the important contributions from Dr. Hornung, department of surgery and Dr. Sator, department of clinical chemistry, University Hospital of Munich, Germany. The CT scan was provided by Dr. Hundt, department of radiology, University Hospital of Munich, Germany.
Dr. Dugas: Database design and implementation, data analysis, preparation of the manuscript
Dr. Schauer, Mr. Volk, Dr. Rau: Clinical implementation, data management, review