Can multiple SNP testing in BRCA2 and BRCA1 female carriers be used to improve risk prediction models in conjunction with clinical assessment?

Prosperi, Mattia CF; Ingham, Sarah L; Howell, Anthony; Lalloo, Fiona; Buchan, Iain E; Evans, Dafydd Gareth

doi:10.1186/1472-6947-14-87

Research article
Open access
Published: 01 October 2014

Can multiple SNP testing in BRCA2 and BRCA1 female carriers be used to improve risk prediction models in conjunction with clinical assessment?

Mattia CF Prosperi¹,
Sarah L Ingham¹,
Anthony Howell²,
Fiona Lalloo³,
Iain E Buchan¹ &
…
Dafydd Gareth Evans^2,3

BMC Medical Informatics and Decision Making volume 14, Article number: 87 (2014) Cite this article

3130 Accesses
11 Citations
2 Altmetric
Metrics details

Abstract

Background

Several single nucleotide polymorphisms (SNPs) at different loci have been associated with breast cancer susceptibility, accounting for around 10% of the familial component. Recent studies have found direct associations between specific SNPs and breast cancer in BRCA1/2 mutation carriers. Our aim was to determine whether validated susceptibility SNP scores improve the predictive ability of risk models in comparison/conjunction to other clinical/demographic information.

Methods

Female BRCA1/2 carriers were identified from the Manchester genetic database, and included in the study regardless of breast cancer status or age. DNA was extracted from blood samples provided by these women and used for gene and SNP profiling. Estimates of survival were examined with Kaplan-Meier curves. Multivariable Cox proportional hazards models were fit in the separate BRCA datasets and in menopausal stages screening different combinations of clinical/demographic/genetic variables. Nonlinear random survival forests were also fit to identify relevant interactions. Models were compared using Harrell’s concordance index (1 - c-index).

Results

548 female BRCA1 mutation carriers and 523 BRCA2 carriers were identified from the database. Median Kaplan-Meier estimate of survival was 46.0 years (44.9-48.1) for BRCA1 carriers and 48.9 (47.3-50.4) for BRCA2. By fitting Cox models and random survival forests, including both a genetic SNP score and clinical/demographic variables, average 1 - c-index values were 0.221 (st.dev. 0.019) for BRCA1 carriers and 0.215 (st.dev. 0.018) for BRCA2 carriers.

Conclusions

Random survival forests did not yield higher performance compared to Cox proportional hazards. We found improvement in prediction performance when coupling the genetic SNP score with clinical/demographic markers, which warrants further investigation.

Peer Review reports

Background

BRCA1 and BRCA2 are major susceptibility genes that confer high lifetime risks for both breast and ovarian cancer. Deleterious mutations in these autosomal dominant cancer genes account for approximately 15-20% of the familial component of breast cancer [1–3]. The variable penetrance exhibited by these BRCA mutations suggest other genetic factors to be present [4], and several studies have now identified a large number of breast cancer susceptibility alleles [5–7]. Genome association studies had identified until recently 19 common variants at 18 loci that are associated with breast cancer susceptibility [5, 7] though the risk attributed to each of these single nucleotide polymorphisms (SNPs) are often modest and largely remain unexplained [6]. More recent studies into these polymorphisms have found direct associations between specific SNPs and breast cancer in BRCA1/2 mutation carriers; TOX3, FGFR2, MAP3K, LSP1, 2q35, SLC4A7, 1p11.2, 5p12, 6q25.1 loci have all been associated with increased risk in breast cancer for BRCA2 mutation carriers [6, 7]. Antoniou et al.[6] further determined TOX3, 2q35, and 6q25.1 were polymorphisms that increased risk for BRCA1 mutation carriers. However, a recent study by Ingham et al.[8] found the 18 validated breast cancer susceptibility SNPs do not differentiate the risks of breast cancer in those with BRCA1 mutations.

Some genetic modifiers may in themselves influence breast cancer risk factors rather than be directly associated; such as the genetic component associated with high mammographic density [4, 9]. A recent study by Mitchell et al. looking at mammographic density in 206 BRCA1 and BRCA2 carriers compared to non-carriers found a significant association between increased breast cancer risk and increasing density in BRCA1/2 carriers [9].

Alongside risk factors with a genetic component there are several hormonal risk factors that are thought to be associated with breast cancer both among the general population and those with hereditary breast cancer [10]. Correlations have been made between changes in breast mitotic/apoptotic activity and alterations in hormone levels across the menstrual cycle, and that if the levels of oestrogen and progesterone are reduced then the risk of breast cancer is reduced [11, 12]. Though some debate surrounds the association of these factors with breast cancer among BRCA1/2 carriers, with studies finding an association only in BRCA1 mutation carriers [13] and other finding no association [12]. Modifiable factors, such as body mass index (BMI) are also thought to influence the risk of breast cancer. Obesity has a well-documented association with breast cancer in the general population, due to influence of biological pathways [14], and postmenopausal weight gain has been associated with increased risk among BRCA carriers [15].

At present, several personalised risk prediction models have been developed using familial, demographic, clinical, laboratory, genetic information domains, with a few combinations thereof [8, 16–19], as for instance the Gail, BOADICEA or IBIS methods [20], as well as more specific studies as surveys on gene expression markers [21], and use of machine learning for predicting recurrence or re-defying subtypes [22, 23].

The aim of this study was to determine whether validated susceptibility SNPs improve the predictive ability of risk models in conjunction and comparison to demographic and clinical information.

Methods

Study population

Patients included in this study were BRCA1 and BRCA2 female pathogenic mutation carriers ascertained from the Genetic Medicine department, St Mary’s Hospital, Manchester, UK. This clinic is one of the largest specialist genetics departments within the UK, and all families with a history of breast or ovarian cancer within the North West region are referred. Patients were included in this study regardless of breast cancer status or age. Dates of birth were taken from the information collected at time of family referral to the genetics department. Cases of breast cancer were confirmed by means of hospital records or the North West Cancer Intelligence Service. Dates of last follow-up were either date of breast cancer diagnosis or date the woman was last in contact with the genetics department or other NHS service or date of death.

Ethics statement

This research has been performed in accordance with the Declaration of Helsinki. The NHS Health Research Authority, National Health Research Ethics Committee North West, Greater Manchester Central (Barlow House, 4 Minshull Street, Manchester, M1 3DZ), reviewed this study and gave ethical approval; the Research Ethics Committee reference number is 10/H1008/24, dated 11^th July 2013. Written informed consent was obtained from all study participants (none minor at the time of enrolment).

DNA testing

DNA was extracted from blood samples provided by women attending the genetic clinics, using DNA Sanger sequencing and multiplex ligation-dependent probe amplification analysis for gene and SNP profiling; BRCA1 and BRCA2 mutations were identified as well as the presence of any of the 18 tested breast cancer SNPs. Overall breast cancer SNP risk scores were calculated for each woman using the methods as recorded in the article Ingham et al.[8].

Statistical models

The study population was stratified by BRCA type (1 or 2) and menopausal stage (ovulating vs. menopause). Incidence of breast cancer was calculated for the strata, as well as Kaplan-Meier [24] estimates of survival. Main-effect multivariable Cox proportional hazards (CPH) [25] models were fit in the separate BRCA data sets and then in the menopausal stages. End-point was the time to cancer, censored by the current age (or loss to follow up, or death for other causes). Proportional hazards assumption was tested via weighted residuals [26]. Variables included in the analyses were (see Table 1): year of birth, Manchester score [27] (transformed using the inverse hyperbolic sine), BMI; parity; age of menarche; age of menopause; age of first full-term pregnancy; oral contraception usage; time of diagnosis of an ovarian cancer followed up by oophorectomy (if any); time of mastectomy (if any); SNPs rs614367, rs704010, rs713588, rs889312, rs909116, rs1011970, rs1156287, rs1562430, rs2981579, rs3757318, rs3803662, rs4973768, rs8009944, rs9790879, rs10995190, rs11249433, rs13387042, rs10931936, genetic predisposition score (GPS), calculated on the mentioned SNPs according to Ingham et al.[8] Missing values were preliminarily analysed by means of univariable CPH, comparing Akaike information criterion (AIC) [28] and coefficient p-values of models with median/modes imputation vs. stratification into quartiles and addition of a category for those values which were missing. The following CPH models were fit for each population stratum: (i) GPS; (ii) GPS + year of birth + Manchester score + BMI + parity + age of menarche + age of menopause + age of full-term pregnancy + oral contraception usage + oophorectomy + mastectomy; (iii) SNPs; (iv) SNPs + year of birth + Manchester score + BMI + parity + age of menarche + age of menopause + age of full-term pregnancy + oral contraception usage + oophorectomy + mastectomy; (v) year of birth + Manchester score + BMI + parity + age of menarche + age of menopause + age of full-term pregnancy + oral contraception usage + oophorectomy + mastectomy; (vi) all variables. CPH models (ii), (iii), (iv) and (vi) were feature-selected using a forward/backward stepwise heuristic driven by AIC [29]. Nonlinear random survival forests (RSF) [30] were also fit on all variables to identify putative variable interactions (333 trees, choosing the log-rank splitting rule). Table 1 summarises which variables were used for each model. CPH and RSF were compared using the complementary value of Harrell’s concordance index (1 - c-index) [31] and the area under the receiver operating characteristic (AUROC) [32], under a bootstrap-based (100 resampled sets, using the out-of-bag predictions) method of extra-sample error estimation [33].

Table 1 List of variables used in the study (for both BRCA1 and BRCA2 populations), data types, and variable inclusion in Cox proportional hazards models (i) to (vi)

Full size table

All analyses were carried out using the R software [34].

Results

The BRCA1 population included 548 subjects, whilst the BRCA2 population 523. Table 2 shows population characteristics stratified by BRCA type and menopausal stage.

Table 2 Characteristics of the study population

Full size table

Incidence of breast cancer for all BRCA1 carriers was 321 events per 23,649 person-years of follow-up (PYFY), i.e. 0.014 (95% confidence interval, CI 0.012 0.015). It was 92/9,872 (0.009, 95% CI 0.008-0.011) and 88/3,770 (0.023, 95% CI 0.019-0.029) for menopause and ovulating strata, respectively. The median (95% CI) Kaplan-Meier estimate of survival time to breast cancer was 46.0 (44.9-48.1) years in the whole BRCA1 population, 53.7 (52.0-60.7) for menopause stratum, and 35.5 (32.9-38.3) for the ovulating population (p > 0.0001, log-rank test). Women diagnosed with an ovarian cancer who underwent an oophorectomy had a higher survival probability than those who did not (p > 0.0001, log-rank test). At age 50 years, probability (95% CI) of survival was 0.82 (0.70-0.96) for those who had oophorectomy (71 women, 12 breast cancer events), versus 0.34 (0.30-0.39) for the others. At age 60 it was 0.59 (0.40-0.86) versus 0.19 (0.15-0.24). There was one case of breast cancer after risk reducing mastectomy (out of 49 women operated).

Incidence of cancer for all BRCA2 carriers was 323 events per 23,796 person-years of follow-up (PYFY), i.e. 0.014 (95% confidence interval, CI 0.012 0.015). It was 105/10,120 (0.010, 95% CI 0.008-0.012) and 72/3,265 (0.022, 95% CI 0.017- 0.028) for menopause and ovulating strata, respectively. The median (95% CI) Kaplan-Meier estimate of survival time was 48.9 (47.3-50.4) years in the whole BRCA2 population, 56.3 (52.3-58.7) for menopause stratum, and 36.8 (34.9-41.4) for the ovulating population (p > 0.0001, log-rank test). Women who underwent an oophorectomy had a higher survival probability than those who did not (p > 0.0001, log-rank test). At age 50 years, probability (95% CI) of survival was 0.88 (0.82-0.95) for those who had oophorectomy (102 women, 23 breast cancer events), versus 0.34 (0.30-0.40) for the others. At age 60 it was 0.70 (0.59-0.83) versus 0.11 (0.07-0.15). As in the BRCA1 population, there was only one case of breast cancer after risk reducing mastectomy (out of 17 women operated). Figure 1 shows Kaplan-Meier graphs for the whole BRCA1/2 population, for the menopausal stage strata, and for those who had/had not oophorectomy after the diagnosis of an ovarian cancer.

When applying models (i) through (vi) and RSF on the whole BRCA1 population, using the out-of-bag estimator, average (st. dev.) 1 - c-index values of models were (see Table 3), respectively, 0.468 (0.037), 0.221 (0.019), 0.504 (0.026), 0.238 (0.019), 0.222 (0.019), 0.236 (0.018), 0.243 (0.019). When applying models (i) through (vi) and RSF on the whole BRCA2 population, using the out-of-bag estimator, average (st. dev.) 1 - c-index values of models were, respectively, 0.417 (0.021), 0.215 (0.018), 0.469 (0.028), 0.241 (0.019), 0.217 (0.018), 0.232 (0.019), 0.230 (0.019). The best model was therefore (ii), including GPS and clinical/demographic variables. The hypothesis of a lower difference in mean with respect to model (ii) for all other models could be rejected, except for model (i) and (iii), which included only genetic variables (all p > 0.0001 for both BRCA1 and BRCA2, Student’s t-test corrected for sample overlap from multiple validation). Notably a re-calibrated SNP score, i.e. models (iii) and (iv), did not perform as well as the GPS. Consistent results were obtained by looking at the AUROC in the 1^st, 2^nd and 3^rd quartiles of observation times. The AUROC estimation was performed on a smaller out-of-bag sample (333 out-of-bag instances) for computational reasons. Figures 2 and 3 show c-index/AUROC graphs for BRCA1/2 sets based on the out-of-bag estimator. Similar figures were obtained when stratifying for the menopausal stage (data not shown).

Table 3 Average (st. dev.) 1 - c-index performance results of cox proportional hazards and random survival forest models as estimated by collating out-of-bag distributions from 100 bootstrap runs

Full size table

Tables 4 and 5 report relative hazards obtained by fitting Cox model (ii) on BRCA1 and BRCA2 populations, overall and stratified by menopausal stage. There was a calendar year of birth effect, increasing the risk of cancer for both BRCA1/2 carrier cohorts (RH ranging from 1.06 to 1.08, p > 0.0005 across all strata). The Manchester score had a protective effect in the BRCA1 menopause stratum (RH = 0.35, p = 0.0006) and showed the same trend in the whole BRCA1 population (RH = 0.8, p = 0.1), but the RH directions were not consistent across all strata as well as significance levels. The GPS score had a protective effect in the whole BRCA1 population and in the ovulating strata (RH 0.76/0.58, p > 0.015), and was associated to a higher hazard of breast cancer in the BRCA2 whole population (RH = 1.33, p = 0.035).

Table 4 Multivariable cox regression fit on BRCA1 data set, overall and stratified by menopausal stage, with covariate set based on model (ii)

Full size table

Table 5 Multivariable cox regression fit on BRCA2 data set, overall and stratified by menopausal stage, with covariate set based on model (ii)

Full size table

The ovulating stratum (i.e. “not yet” in the menopausal stage as from Tables 4 and 5) had a higher hazard of breast cancer as compared to the first age quartile of the menopausal stage stratum (i.e. women entering the menopausal stage at ~40 years old). An early age of menopause (first age quartile, ~40 years old) was associated with a higher hazard of breast cancer as compared to an older age of menopause (yet a higher hazard than the ovulating stratum), consistently across all BRCA1/2 carrier types, in the whole population and in the menopausal stage stratum. Note that menopause may be happening within the same year a chemotherapy was initiated right upon breast cancer diagnosis, resulting de facto in competing events (as diagnosis of menopause was given to the nearest year of age). Women who had either oophorectomy had a lower hazard as compared to those who had not (mastectomy could not be properly assessed due to the low number of events).

Finally, when fitting model (vi), i.e. feature-selected Cox regression using a forward/backward stepwise heuristic driven by AIC, for both BRCA1/2 sets only the year of birth, all the menopausal age stages (along with ovulating stratum), and the oophorectomy variables were selected in the final model (RH were in line with those obtained from other models).

Discussion

In this study we applied a robust model selection framework composed of linear and non-linear statistical techniques for survival analysis, with the objective to test the predictive ability of existing risk scores for breast cancer in a population of BRCA1/2 carriers, and to improve over the current state-of-the-art, from the models based on early genotyping and familial assessment to the most recent SNP scoring, trying to combine both clinical/demographic information with high-resolution genetics. Also, we assessed the incidence and the determinants of breast cancer in the study population, and stratified the analyses by the menopausal status.

RSF did not yield higher performance as compared to CPH, even if for some of the data sets the proportional hazard assumption was not met. Interestingly, the re-calibration of GPS via the inclusion of SNPs in a CPH did not produce a better model fit (in terms of c-index or AUROC) than using the original GPS in a CPH. In our case, the c-index estimation through out-of-bag distributions may be a conservative choice, but robust to over-training.

This study further highlights the predictive ability of GPS for BRCA2, showing an increased RH 1.33 (1.1-1.61) in the whole population, although not significant at the 0.05 level in the menopausal/ovulating stage strata. Instead, for BRCA1 the effect of GPS was protective (RH = 0.76, p = 0.01) in the whole BRCA1 population and in the ovulating stage stratum (also protective but not significant at the 0.05 level in the menopausal stratum). Previous findings of Ingham et al.[8] already pointed out the predictive ability of 18 SNP GPS in BRCA2 but not BRCA1 carriers. This significant association of GPS however was not supported when fitting the stepwise models, retaining only the year of birth, the menopausal stage and the oophorectomy variables (across all carrier types and strata). The age cohort and oophorectomy had been previously associated with increased and decreased risk of breast cancer, respectively [35, 36]. We found that an later ages of menopause have a lower hazard of breast cancer as compared to the first age quartile, ~40 years old, which seems in contradiction with previous results by Tyrer et al.[18], and being on the ovulating stratum has a higher hazard than experiencing early menopause. This is likely a model artefact, because the menopause may happen (being induced) right after to the initiation of a chemotherapy (i.e. competing events), and the menopause age is given to the nearest year. In any case, as women entering the menopausal stage early may be subject to treatment for preserving fertility, this warrants further investigation including a number of potential confounders.

Limitations of this study are in the usage of the c-index as a measure of model performance, which presents a series of flaws [37–39], although our results were confirmed using the AUROC estimator. Alternative measures have been presented, like prediction error curves [40] that may be employed as additional indicators. Another limitation is that we did not fit the Cox models using time-updated covariates (as for menopausal stage or age of menarche, for instance) and this may dilute their effect across all time, instead of calculating the hazard on specific time intervals.

Conclusions

We exploited model selection in machine learning towards the personalised diagnosis of breast cancer, incorporating different domains of information including genetics, clinical, and demographics. Given the improvement in prediction performance obtained by coupling a genetic progression score with clinical and demographic markers, further investigation for identifying both genetic and non-genetic factors (along with their interactions in terms of epigenetics) is warranted.

References

Mavaddat N, Antoniou AC, Easton DF, Garcia-Closas M: Genetic susceptibility to breast cancer. Molecular oncology. 2010, 4 (3): 174-191.
Article CAS PubMed Google Scholar
Couch FJ, Wang X, McGuffog L, Lee A, Olswold C, Kuchenbaecker KB, Soucy P, Fredericksen Z, Barrowdale D, Dennis J, Gaudet MM, Dicks E, Kosel M, Healey S, Sinilnikova OM, Lee A, Bacot F, Vincent D, Hogervorst FB, Peock S, Stoppa-Lyonnet D, Jakubowska A, Radice P, Schmutzler RK, Domchek SM, Piedmonte M, Singer CF, Friedman E, Thomassen M, kConFab Investigators, et al: Genome-wide association study identifies novel breast cancer susceptibility loci. Nature. 2007, 447 (7148): 1087-1093.
Article PubMed PubMed Central Google Scholar
Stacey SN, Manolescu A, Sulem P, Rafnar T, Gudmundsson J, Gudjonsson SA, Masson G, Jakobsdottir M, Thorlacius S, Helgason A, Aben KK, Strobbe LJ, Albers-Akkers MT, Swinkels DW, Henderson BE, Kolonel LN, Le Marchand L, Millastre E, Andres R, Godino J, Garcia-Prats MD, Polo E, Tres A, Mouy M, Saemundsdottir J, Backman VM, Gudmundsson L, Kristjansson K, Bergthorsson JT, Kostic J, et al: Common variants on chromosomes 2q35 and 16q12 confer susceptibility to estrogen receptor-positive breast cancer. Nat Genet. 2007, 39 (7): 865-869.
Article CAS PubMed Google Scholar
Chenevix-Trench G, Milne RL, Antoniou AC, Couch FJ, Easton DF, Goldgar DE, Cimba: An international initiative to identify genetic modifiers of cancer risk in BRCA1 and BRCA2 mutation carriers: the Consortium of Investigators of Modifiers of BRCA1 and BRCA2 (CIMBA). Breast cancer research: BCR. 2007, 9 (2): 104-
Article PubMed PubMed Central Google Scholar
Turnbull C, Ahmed S, Morrison J, Pernet D, Renwick A, Maranian M, Seal S, Ghoussaini M, Hines S, Healey CS, Hughes D, Warren-Perry M, Tapper W, Eccles D, Evans DG, Hooning M, Schutte M, van den Ouweland A, Houlston R, Ross G, Langford C, Pharoah PD, Stratton MR, Dunning AM, Rahman N, Easton DF, Breast Cancer Susceptibility Collaboration (UK): Genome-wide association study identifies five new breast cancer susceptibility loci. Nat Genet. 2010, 42 (6): 504-507.
Article CAS PubMed PubMed Central Google Scholar
Antoniou AC, Beesley J, McGuffog L, Sinilnikova OM, Healey S, Neuhausen SL, Ding YC, Rebbeck TR, Weitzel JN, Lynch HT, Isaacs C, Ganz PA, Tomlinson G, Olopade OI, Couch FJ, Wang X, Lindor NM, Pankratz VS, Radice P, Manoukian S, Peissel B, Zaffaroni D, Barile M, Viel A, Allavena A, Dall'Olio V, Peterlongo P, Szabo CI, Zikan M, Claes K: Common breast cancer susceptibility alleles and the risk of breast cancer for BRCA1 and BRCA2 mutation carriers: implications for risk prediction. Cancer Res. 2010, 70 (23): 9742-9754.
Article CAS PubMed PubMed Central Google Scholar
Antoniou AC, Kartsonaki C, Sinilnikova OM, Soucy P, McGuffog L, Healey S, Lee A, Peterlongo P, Manoukian S, Peissel B, Zaffaroni D, Cattaneo E, Barile M, Pensotti V, Pasini B, Dolcetti R, Giannini G, Putignano AL, Varesco L, Radice P, Mai PL, Greene MH, Andrulis IL, Glendon G, Ozcelik H, Thomassen M, Gerdes AM, Kruse TA, Birk Jensen U, Crüger DG, et al: Common alleles at 6q25.1 and 1p11.2 are associated with breast cancer risk for BRCA1 and BRCA2 mutation carriers. Hum Mol Genet. 2011, 20 (16): 3304-3321.
Article CAS PubMed PubMed Central Google Scholar
Ingham SL, Warwick J, Byers H, Lalloo F, Newman WG, Evans DG: Is multiple SNP testing in BRCA2 and BRCA1 female carriers ready for use in clinical practice? Results from a large Genetic Centre in the UK. Clin Genet. 2013, 84 (1): 37-42.
Article CAS PubMed Google Scholar
Mitchell G, Antoniou AC, Warren R, Peock S, Brown J, Davies R, Mattison J, Cook M, Warsi I, Evans DG, Eccles D, Douglas F, Paterson J, Hodgson S, Izatt L, Cole T, Burgess L, Eeles R, Easton DF: Mammographic density and breast cancer risk in BRCA1 and BRCA2 mutation carriers. Cancer Res. 2006, 66 (3): 1866-1872.
Article CAS PubMed Google Scholar
Chang-Claude J, Andrieu N, Rookus M, Brohet R, Antoniou AC, Peock S, Davidson R, Izatt L, Cole T, Noguès C, Luporsi E, Huiart L, Hoogerbrugge N, Van Leeuwen FE, Osorio A, Eyfjord J, Radice P, Goldgar DE, Easton DF, Epidemiological Study of Familial Breast Cancer (EMBRACE): Age at menarche and menopause and breast cancer risk in the international BRCA1/2 carrier cohort study. Cancer Epidemiol Biomarkers Prev. 2007, 16 (4): 740-746.
Article PubMed Google Scholar
Ferguson DJ, Anderson TJ: Morphological evaluation of cell turnover in relation to the menstrual cycle in the “resting” human breast. Br J Cancer. 1981, 44 (2): 177-181.
Article CAS PubMed PubMed Central Google Scholar
Pike MC, Spicer DV, Dahmoush L, Press MF: Estrogens, progestogens, normal breast cell proliferation, and breast cancer risk. Epidemiol Rev. 1993, 15 (1): 17-35.
CAS PubMed Google Scholar
Kotsopoulos J, Lubinski J, Lynch HT, Neuhausen SL, Ghadirian P, Isaacs C, Weber B, Kim-Sing C, Foulkes WD, Gershoni-Baruch R, Ainsworth P, Friedman E, Daly M, Garber JE, Karlan B, Olopade OI, Tung N, Saal HM, Eisen A, Osborne M, Olsson H, Gilchrist D, Sun P, Narod SA: Age at menarche and the risk of breast cancer in BRCA1 and BRCA2 mutation carriers. Cancer causes < control: CCC. 2005, 16 (6): 667-674.
Article PubMed Google Scholar
Guinan EM, Hussey J, McGarrigle SA, Healy LA, O’Sullivan JN, Bennett K, Connolly EM: A prospective investigation of predictive and modifiable risk factors for breast cancer in unaffected BRCA1 and BRCA2 gene carriers. BMC Cancer. 2013, 13: 138-
Article PubMed PubMed Central Google Scholar
Manders P, Pijpe A, Hooning MJ, Kluijt I, Vasen HF, Hoogerbrugge N, van Asperen CJ, Meijers-Heijboer H, Ausems MG, van Os TA, Gomez-Garcia EB, Brohet RM HEBON, van Leeuwen FE, Rookus MA: Body weight and risk of breast cancer in BRCA1/2 mutation carriers. Breast Cancer Res Treat. 2011, 126 (1): 193-202.
Article CAS PubMed Google Scholar
Gorfine M, Hsu L, Parmigiani G: Frailty models for familial risk with application to breast cancer. J Am Stat Assoc. 2013, 108 (504): 1205-1215.
Article CAS PubMed PubMed Central Google Scholar
Meads C, Ahmed I, Riley RD: A systematic review of breast cancer incidence risk prediction models with meta-analysis of their performance. Breast Cancer Res Treat. 2012, 132 (2): 365-377.
Article PubMed Google Scholar
Tyrer J, Duffy SW, Cuzick J: A breast cancer prediction model incorporating familial and personal risk factors. Stat Med. 2004, 23 (7): 1111-1130.
Article PubMed Google Scholar
Amir E, Freedman OC, Seruga B, Evans DG: Assessing women at high risk of breast cancer: a review of risk assessment models. J Natl Cancer Inst. 2010, 102 (10): 680-691.
Article PubMed Google Scholar
Lee AJ, Cunningham AP, Kuchenbaecker KB, Mavaddat N, Easton DF, Antoniou AC, Consortium of Investigators of Modifiers of B, Breast Cancer Association C: BOADICEA breast cancer risk prediction model: updates to cancer incidences, tumour pathology and web interface. Br J Cancer. 2014, 110 (2): 535-545.
Article CAS PubMed Google Scholar
Yang X, Ai X, Cunningham JM: Computational prognostic indicators for breast cancer. Cancer Manag Res. 2014, 6: 301-312.
Article PubMed PubMed Central Google Scholar
Faradmal J, Soltanian AR, Roshanaei G, Khodabakhshi R, Kasaeian A: Comparison of the performance of log-logistic regression and artificial neural networks for predicting breast cancer relapse. Asian Pacific journal of cancer prevention: APJCP. 2014, 15 (14): 5883-5888.
Article PubMed Google Scholar
Jahid MJ, Huang TH, Ruan J: A personalized committee classification approach to improving prediction of breast cancer metastasis. Bioinformatics. 2014, 30 (13): 1858-1866.
Article CAS PubMed PubMed Central Google Scholar
Kaplan EL, Meier P: Nonparametric-estimation from incomplete observations. J Am Stat Assoc. 1958, 53 (282): 457-481.
Article Google Scholar
Cox DR: Regression models and life-tables. J Roy Stat Soc B. 1972, 34 (2): 187-+.
Google Scholar
Grambsch PM, Therneau TM: Proportional hazards tests and diagnostics based on weighted residuals. Biometrika. 1994, 81 (3): 515-526.
Article Google Scholar
Evans DGR, Lalloo F, Wallace A, Rahman N: Update on the Manchester scoring system for BRCA1 and BRCA2 testing. J Med Genet. 2005, 42 (7): e39-
Article CAS PubMed PubMed Central Google Scholar
Akaike H: New look at statistical-model identification. Ieee T Automat Contr. 1974, Ac19 (6): 716-723.
Article Google Scholar
Venables WN, Ripley BD: Modern Applied Statistics With S-PLUS. 1999, New York: Springer, 3
Book Google Scholar
Ishwaran H, Kogalur UB, Blackstone EH, Lauer MS: Random survival forests. Ann Appl Stat. 2008, 2 (3): 841-860.
Article Google Scholar
Harrell FE, Califf RM, Pryor DB, Lee KL, Rosati RA: Evaluating the yield of medical tests. Jama-J Am Med Assoc. 1982, 247 (18): 2543-2546.
Article Google Scholar
Heagerty PJ, Zheng YY: Survival model predictive accuracy and ROC curves. Biometrics. 2005, 61 (1): 92-105.
Article PubMed Google Scholar
Hastie T, Tibshirani R, Friedman JH: The Elements of Statistical Learning: Data Mining, Inference, and Prediction. 2009, New York, NY: Springer, 2
Book Google Scholar
R Core Team: R: A Language and Environment for Statistical Computing. 2013, Vienna, Austria: R Foundation for Statistical Computing
Google Scholar
Ocana-Riola R, Mayoral-Cortes JM, Navarro-Moreno E: Age-period-cohort effect on female breast cancer mortality in Southern Spain. Med Oncol. 2013, 30 (3): 671-
Article PubMed Google Scholar
Valachis A, Nearchou AD, Lind P: Surgical management of breast cancer in BRCA-mutation carriers: a systematic review and meta-analysis. Breast Cancer Res Treat. 2014, 144 (3): 443-455.
Article CAS PubMed Google Scholar
Pencina MJ, D’Agostino RB: Overall C as a measure of discrimination in survival analysis: model specific population value and confidence interval estimation. Stat Med. 2004, 23 (13): 2109-2123.
Article PubMed Google Scholar
Taktak AFG, Eleuteri A, Lake SP, Fisher AC: A web-based tool for the assessment of discrimination and calibration properties of prognostic models. Comput Biol Med. 2008, 38 (7): 785-791.
Article PubMed Google Scholar
Gonen M, Heller G: Concordance probability and discriminatory power in proportional hazards regression. Biometrika. 2005, 92 (4): 965-970.
Article Google Scholar
Mogensen UB, Ishwaran H, Gerds TA: Evaluating random forests for survival analysis using prediction error curves. J Stat Softw. 2012, 50 (11): 1-23.
Article PubMed PubMed Central Google Scholar

Pre-publication history

The pre-publication history for this paper can be accessed here:http://www.biomedcentral.com/1472-6947/14/87/prepub

Download references

Acknowledgments

This study was funded by an NIHR programme grant (RP-PG-0707-10031). SNP testing was funded through the Genesis breast cancer prevention appeal. DGE receives support through the NIHR Manchester BRC and is an NIHR Clinical Senior Investigator. We also acknowledge the University of Manchester’s Health eResearch Centre (HeRC) funded by the MRC grant MR/K006665/1. The study sponsor(s) had no role in study design; collection, analysis and interpretation of data; writing of the report or in the decision to submit for publication.

Author information

Authors and Affiliations

Institute of Population Health, Centre for Health Informatics, University of Manchester, Manchester, UK
Mattia CF Prosperi, Sarah L Ingham & Iain E Buchan
Genesis Prevention Centre, University Hospital of South Manchester, Manchester, UK
Anthony Howell & Dafydd Gareth Evans
Department of Genetic Medicine, Manchester Academic Health Science Centre, St. Mary’s Hospital, University of Manchester, Manchester, UK
Fiona Lalloo & Dafydd Gareth Evans

Authors

Mattia CF Prosperi
View author publications
You can also search for this author in PubMed Google Scholar
Sarah L Ingham
View author publications
You can also search for this author in PubMed Google Scholar
Anthony Howell
View author publications
You can also search for this author in PubMed Google Scholar
Fiona Lalloo
View author publications
You can also search for this author in PubMed Google Scholar
Iain E Buchan
View author publications
You can also search for this author in PubMed Google Scholar
Dafydd Gareth Evans
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Mattia CF Prosperi.

Additional information

Competing interests

The authors declare that they have no competing interests.

Authors’ contributions

MCFP statistical modelling, manuscript writing; SI data preparation, statistical analysis, manuscript writing; IEB statistical review, manuscript review; DGE patient identification, statistical review, manuscript writing; AH and FL patient identification, manuscript writing. All authors read and approved the final manuscript.

Authors’ original submitted files for images

Below are the links to the authors’ original submitted files for images.

Authors’ original file for figure 1

Authors’ original file for figure 2

Authors’ original file for figure 3

Rights and permissions

This article is published under license to BioMed Central Ltd. This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly credited. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.

Reprints and permissions

About this article

Cite this article

Prosperi, M.C., Ingham, S.L., Howell, A. et al. Can multiple SNP testing in BRCA2 and BRCA1 female carriers be used to improve risk prediction models in conjunction with clinical assessment?. BMC Med Inform Decis Mak 14, 87 (2014). https://doi.org/10.1186/1472-6947-14-87

Download citation

Received: 03 June 2014
Accepted: 25 September 2014
Published: 01 October 2014
DOI: https://doi.org/10.1186/1472-6947-14-87

Can multiple SNP testing in BRCA2 and BRCA1 female carriers be used to improve risk prediction models in conjunction with clinical assessment?