Open Access

Prognosis of the individual course of disease - steps in developing a decision support tool for Multiple Sclerosis

  • M Daumer1, 2Email author,
  • A Neuhaus1,
  • C Lederer1, 2,
  • M Scholz2,
  • JS Wolinsky3,
  • M Heiderhoff4 and
  • the Sylvia Lawry Centre for Multiple Sclerosis Research
BMC Medical Informatics and Decision MakingBMC series – open, inclusive and trusted20077:11

DOI: 10.1186/1472-6947-7-11

Received: 03 November 2006

Accepted: 08 May 2007

Published: 08 May 2007

Abstract

Background

Multiple sclerosis is a chronic disease of uncertain aetiology. Variations in its disease course make it difficult to impossible to accurately determine the prognosis of individual patients. The Sylvia Lawry Centre for Multiple Sclerosis Research (SLCMSR) developed an "online analytical processing (OLAP)" tool that takes advantage of extant clinical trials data and allows one to model the near term future course of this chronic disease for an individual patient.

Results

For a given patient the most similar patients of the SLCMSR database are intelligently selected by a model-based matching algorithm integrated into an OLAP-tool to enable real time, web-based statistical analyses. The underlying database (last update April 2005) contains 1,059 patients derived from 30 placebo arms of controlled clinical trials. Demographic information on the entire database and the portion selected for comparison are displayed. The result of the statistical comparison is provided as a display of the course of Expanded Disability Status Scale (EDSS) for individuals in the database with regions of probable progression over time, along with their mean relapse rate. Kaplan-Meier curves for time to sustained progression in the EDSS and time to requirement of constant assistance to walk (EDSS 6) are also displayed. The software-application OLAP anticipates the input MS patient's course on the basis of baseline values and the known course of disease for similar patients who have been followed in clinical trials.

Conclusion

This simulation could be useful for physicians, researchers and other professionals who counsel patients on therapeutic options. The application can be modified for studying the natural history of other chronic diseases, if and when similar datasets on which the OLAP operates exist.

Background

Multiple sclerosis (MS) is an inflammatory disease of the central nervous system. Currently there is no cure for MS, but there are treatments for its relapsing forms that reduce the short and mid-range impact of the disease. It remains impossible to accurately predict the prognosis of individual patients, which could be important for patient counselling and in weighing therapeutic options. The Sylvia Lawry Centre for MS Research in Munich (SLCMSR) uses data from clinical trials and natural history studies to develop mathematical models for the course of the disease which might assist in the efforts to accurately determine short, mid and long-range prognosis. The "individual risk profile" project makes the Centre's large database available to health care professionals via access to a representative part of the database.

Implementation

The SLCMSR database consist of data from most natural history studies and the placebo arms of nearly all randomised controlled clinical trials in MS patients from the last decade. The data are anonymised, homogenized and pooled from the academic and corporate sources. As of April 2005, 45 separate data sets and 81,000 patient-years of data from 20,000 patients were included. The database is divided into so-called "open" and "closed" parts. The "open" database is accessible by researchers (and in the future possibly other professionals) at their discretion. The "closed" database is accessible only to authorized staff on behalf of the Validation Committee of the Centre as part of the validation policy of the SLCMSR [1].

An interdisciplinary team of neurologists, information technology specialists, and biostatisticians developed a Java-based OLAP tool using the statistical software package R 2.0.1 [2, 3]. The source code is available in Additional file 1. For validation purposes essential parts were independently re-programmed using SAS/INTRNET [4]. Remote users are able to trigger powerful statistical software programs running on the host computer using a simple graphical interface displayed in a standard browser. The results of the analysis are displayed in real time.

Registered OLAP users access the database via their internet browser, select values for patient characteristics of their interest and get displays of the observed disease course for placebo patients from the database with matching characteristics.

Results

The essential component of the OLAP is a matching algorithm. A given patient of interest is "identified" by a set of covariates which are widely accepted to be potential prognostic factors [57]: the number of relapses in the last 12 months, disease duration from diagnosis, age at disease onset, disability level as measured by the Expanded Disability Status Scale (EDSS) [8], and disease course [9] – either relapsing remitting, secondary progressive, primary progressive or clinically isolated syndrome [10, 11]. The matching algorithm then automatically selects the most similar subgroup of patients that are included in the banked database, with similarity defined by the covariates on a disease course specific outcome. The resulting display shows the disease course of all patients in the database that are similar to the patient of interest, and this information can be used to project a hypothesized outcome for the patient of interest, based on the behaviour of similar patients in the database.

This descriptive report is based on 1,059 patients from placebo arms of controlled clinical trials available from release 1.0 of the SLCMSR database (April 2005).

The patient baseline characteristics of the database are summarized in table 1. The data from patients in the dataset agree well with the literature reports concerning sex ratio, age at disease onset, duration of disease and relapse rate of patients with the same disease course as noted here [6, 7]. Demographic information of the selected subgroup is displayed in comparison to the requested characteristics and to demographics of all patients available with the same disease course (see figure 1). Output is provided either as a display of the change of EDSS for individuals in the database over time, regions of progression over time along with the mean relapse rate, or the Kaplan-Meier curve of time to sustained progression in EDSS and time to progression to EDSS 6 (the time of constant assistance to walk) (see figures 2 and 3).
https://static-content.springer.com/image/art%3A10.1186%2F1472-6947-7-11/MediaObjects/12911_2006_Article_142_Fig1_HTML.jpg
Figure 1

Screenshot from the OLAP-tool "Individual Risk Profile" displaying the demographics of the selected subgroup (blue). The green dot indicates the requested patient characteristics (RR disease course, baseline EDSS 2, age at onset 30 years, disease duration 36 months and 2 relapses in the last 12 months). In addition the distributions of all patients from the clinical trial data having the same course of the disease are displayed to give the user the impression on how representative the selected subgroup for the selected course is (gray).

https://static-content.springer.com/image/art%3A10.1186%2F1472-6947-7-11/MediaObjects/12911_2006_Article_142_Fig2_HTML.jpg
Figure 2

Screenshot from the OLAP-tool "Individual Risk Profile" displaying EDSS courses and regions of EDSS evolution. The interior band defines the middle 50% of EDSS courses. The other bands outline (from the inside to the outside) 75%, 90% and 95% of the individual disease courses. Additional information about the disease course is provided by the annual relapse rates. These are displayed as box plots and illustrate the disease activity.

https://static-content.springer.com/image/art%3A10.1186%2F1472-6947-7-11/MediaObjects/12911_2006_Article_142_Fig3_HTML.jpg
Figure 3

Screenshot from the OLAP-tool "Individual Risk Profile" displaying the Kaplan-Meier curves for the time to sustained progression (6 months confirmation period) and time to EDSS 6. For illustration purposes Kaplan-Meier curves are displayed for a SPMS population. In the background the curves for the population as defined in Figure 1 are displayed.

Table 1

Descriptive statistics of clinical trial, placebo arm MS patients derived from the open data set of the SLCMSR and included in "Individual Risk Profile" OLAP-tool, version 2.0.0, divided by the four different courses CIS clinically isolated syndrome, RR relapsing remitting, SP secondary progressive and PP primary progressive.

 

Disease subtype

Database patient characteristics

CIS

RR

SP

PP

No. of patients

124

460

455

20

Sex ratio (m/f)

0.38

0.41

0.69

0.82

Age at onset (Median and 5%- and 95%- percentile)

30 [19;40]

29 [17;41]

29 [18;45]

37 [19;57]

Baseline EDSS (Median and 5%- and 95%- percentile)

1.5 [0.0;5.0]

2.5 [1.0;5.5]

5.5 [3.0;6.5]

3.5 [1.5;6.0]

Duration (months) (Median and 5%- and 95%- percentile)

2 [0;104]

66 [7;225]

148 [40;341]

50 [11;190]

Relapses last 12 months:

no relapse

5

47

256

6

Relapses last 12 months:

1 relapse

115

131

97

4

Relapses last 12 months:

2 or more relapses

0

132

36

1

Relapses last 12 months:

unknown

4

150

66

9

The values in brackets are the 5%- and 95%- percentile.

Discussion

The advantages of OLAP-tools based on a unified database ("data warehousing") are widely applied in business and increasingly used in medical research and medical decision-making. In this report we describe a tool for investigating the course of MS based on specific clinical data and using that information to project disease course in patients of interest.

The predictive power of this tool is limited by restrictions of the composition of patients and duration of observations in the current release of the database. The clinical data are restricted to data from the placebo groups of randomized clinical trials, which may not reflect disease outcomes in patients not involved in clinical trials and certainly do not reflect the impact of daily disease management of patients in non-research clinical settings. Additionally, the observation period of the included patients is limited to a maximum of 3 years (median follow-up 2.2 years) of observation, thus limiting the ability to accurately project longer term outcomes for patients with what is a life-long disease.

A comparison of the tool with population based prevalence cohorts [57] is the next step and this will test the reliability of the tool. In the current version the individual datasets consist of the standard variables (course, age at onset, baseline EDSS, disease duration, and number of relapses in the last 12 months). Although available, quantitative MRI variables were not included due to limitations in availability in clinical practise and their relatively poor additional predictive value for near-term clinical outcomes compared to the other clinical variables [12]. Implementing more refined models especially for the development of disease of progressive patients might result in more valid course predictions. Including long-term observational data from natural history studies that cover 20–30 years would help to improve the core data and likely the predictive ability [13, 14] of the tool as well, if potentially important predictors for the long-term disease evolution such as time to reach certain EDSS levels (e.g. EDSS = 3, 4 and 6) and time-point of reaching the secondary progressive phase of the disease will be included.

In addition, the utility of the OLAP tool as a predictor of disease course is limited by the patient characteristics in the parent dataset. For instance, there are only a small number of patients with a primary progressive course of MS in the SLCMSR dataset, and this subtype cannot be sufficiently analysed yet. The advantage of the OLAP tool (compared with a purely model based prediction), however, is that this limitation is transparent to the user (figure 1).

Conclusion

The "individual risk profile" project is based on a comprehensive database and the programming of an OLAP-tool. Our initial intention is to pilot the use of the database amongst neurologists and other researchers experienced in MS with an interest in clinical trials. This phase should provide informed feedback on the utility of the tool for clinical or research use. Updated versions are planned with expanded databases in particular inclusion of long-term natural history data and a broader spectrum of predictors.

Most developmental steps of this project as well as principles of the creation of the underlying database are not disease-specific. It will be of interest to apply the methods and software developed at the SLCMSR for studying MS to other chronic diseases like heart failure, chronic obstructive pulmonary disease, or diabetes mellitus.

Availability and requirements

Project name: OLAP Individual Risk Profile

Project home page: https://​www.​slcmsr.​net/​public/​

Operating systems: Web based application

Programming language: R, Java

Other requirements: CSS1/UTF-8 compliant browser is required (e.g. Internet Explorer 5.5 or higher, Firefox 1.0 or higher)

License: Free, anyone may use the service for non-commercial purposes.

Any restrictions to use by non-academics: No.

Declarations

Acknowledgements

The SLCMSR is indebted to all its data donors (listed at http://​www.​slcmsr.​org). We thank S. Heiss and C. Harboeck, for SAS programming and S. Reingold, P. O'Connor, C. Polman, G. Cutter, D. Miller, J. Noseworthy, L. Kappos, F. Lublin for valuable comments. In honor of the singular contribution to the establishment of the SLC and bettering the lives of people with MS around the world the core resource is called “the Ian McDonald MS database”.

Authors’ Affiliations

(1)
Sylvia Lawry Centre for Multiple Sclerosis Research
(2)
Trium Analysis Online GmbH
(3)
University of Texas Health Science Center
(4)
University Hospital Heidelberg, Department of General Practice and Health Services Research

References

  1. Schach S, Daumer M, Neiß A: Maintaining high quality of statistical evaluations based on the SLCMSR data base. Validation Policy of the Sylvia Lawry Centre. 2005
  2. R Development Core Team: R: A language and environment for statistical computing. 2004, R Foundation for Statistical Computing, Vienna, Austria, ISBN 3-900051-07-0, [http://​www.​R-project.​org]
  3. Gentleman R, Ihaka R: Lexical Scope and Statistical Computing. JCGS. 2000, 9 (3): 491-508.
  4. SAS/INTRNET Software. [http://​support.​sas.​com/​rnd/​web/​intrnet/​index.​html]
  5. Pittock SJ, Mayr WT, McClelland RL, Jorgensen NW, Weigand SD, Noseworthy JH, Weinshenker BG, Rodriguez M: Change in MS-related disability in a population-based cohort: A 10-year follow-up study. Neurology. 2004, 62: 51-59. 10.1159/000080499.View ArticlePubMed
  6. Ebers GC: Natural history of multiple sclerosis. Multiple sclerosis. Edited by: Compston A. 1998, London: Churchill Livingstone, 191-221. 3
  7. Confavreux C, Vukusic S, Moreau T, Adeleine P: Relapses and progression of disability in multiple sclerosis. New Engl J Med. 2000, 343: 1430-1438. 10.1056/NEJM200011163432001.View ArticlePubMed
  8. Kurtzke JF: Rating neurologic impairment in multiple sclerosis: an expanded disability status scale (EDSS). Neurology. 1983, 33: 1444-1452.View ArticlePubMed
  9. Lublin FD, Reingold SC: Defining the clinical course of multiple sclerosis: results of an international survey. National Multiple Sclerosis Society (USA) Advisory Committee on Clinical Trials of New Agents in Multiple Sclerosis. Neurology. 1996, 46: 907-911.View ArticlePubMed
  10. Comi G, Filippi M, Barkhof F, Durelli L, Edan G, Fernandez O, Hartung H, Seeldrayers P, Sorensen PS, Rovaris M, Martinelli V, Hommes OR, Early Treatment of Multiple Sclerosis Study Group: Effect of early interferon treatment on conversion to definite multiple sclerosis: a randomised study. Lancet. 2001, 357: 1576-1582. 10.1016/S0140-6736(00)04725-5.View ArticlePubMed
  11. Jacobs LD, Beck RW, Simon JH, Kinkel RP, Brownscheidle CM, Murray TJ, Simonian NA, Slasor PJ, Sandrock AW: Intramuscular interferon beta-1a therapy initiated during a first demyelinating event in multiple sclerosis. CHAMPS Study Group. N Engl J Med. 2000, 343: 898-904. 10.1056/NEJM200009283431301.View ArticlePubMed
  12. Daumer M, Hapfelmeier A, Neuhaus A, Ebers G: The additional predictive value of magnetic resonance imaging for the prediction of future relapses if relapse history is available. Multiple Sclerosis. 2006, 12 (Suppl 1): 46-47.
  13. Quade D: Nonparametric Analysis of Covariance by Matching. Biometrics. 1982, 38: 597-611. 10.2307/2530042.View ArticlePubMed
  14. Bergamaschi R: Prognosis of multiple sclerosis: clinical factors predicting the late evolution for an early treatment decision. Expert Rev Neurother. 2006, 6: 357-364. 10.1586/14737175.6.3.357.View ArticlePubMed
  15. Pre-publication history

    1. The pre-publication history for this paper can be accessed here:http://​www.​biomedcentral.​com/​1472-6947/​7/​11/​prepub

Copyright

© Daumer et al; licensee BioMed Central Ltd. 2007

This article is published under license to BioMed Central Ltd. This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://​creativecommons.​org/​licenses/​by/​2.​0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.