TY - JOUR AU - Linder, J. A. AU - Haas, J. S. AU - Iyer, A. AU - Labuzetta, M. A. AU - Ibara, M. AU - Celeste, M. AU - Getty, G. AU - Bates, D. W. PY - 2010 DA - 2010// TI - Secondary use of electronic health record data: spontaneous triggered adverse drug event reporting JO - Pharmacoepidemiol Drug Saf VL - 19 UR - https://doi.org/10.1002/pds.2027 DO - 10.1002/pds.2027 ID - Linder2010 ER - TY - JOUR AU - Norén, G. N. AU - Hopstadius, J. AU - Bate, A. AU - Star, K. AU - Edwards, I. R. PY - 2009 DA - 2009// TI - Temporal pattern discovery in longitudinal electronic patient records JO - Data Min Knowl Discov VL - 20 UR - https://doi.org/10.1007/s10618-009-0152-3 DO - 10.1007/s10618-009-0152-3 ID - Norén2009 ER - TY - JOUR AU - Boockvar, K. S. AU - Livote, E. E. AU - Goldstein, N. AU - Nebeker, J. R. AU - Siu, A. AU - Fried, T. PY - 2010 DA - 2010// TI - Electronic health records and adverse drug events after patient transfer JO - Qual Saf Health Care VL - 19 ID - Boockvar2010 ER - TY - JOUR AU - Hurdle, J. F. AU - Haroldsen, S. C. AU - Hammer, A. AU - Spigle, C. AU - Fraser, A. M. AU - Mineau, G. P. AU - Courdy, S. J. PY - 2013 DA - 2013// TI - Identifying clinical/translational research cohorts: ascertainment via querying an integrated multi-source database JO - J Am Med Inform Assoc VL - 20 UR - https://doi.org/10.1136/amiajnl-2012-001050 DO - 10.1136/amiajnl-2012-001050 ID - Hurdle2013 ER - TY - JOUR AU - Vlug, A. AU - Van der Lei, J. AU - Mosseveld, B. AU - Van Wijk, M. AU - Van der Linden, P. AU - MC, S. AU - Van Bemmel, J. PY - 1999 DA - 1999// TI - Postmarketing surveillance based on electronic patient records: the IPCI project JO - Methods Inf Med VL - 38 ID - Vlug1999 ER - TY - JOUR AU - Liao, K. P. AU - Cai, T. AU - Gainer, V. AU - Goryachev, S. AU - Zeng-treitler, Q. AU - Raychaudhuri, S. AU - Szolovits, P. AU - Churchill, S. AU - Murphy, S. AU - Kohane, I. AU - Karlson, E. W. AU - Plenge, R. M. PY - 2010 DA - 2010// TI - Electronic medical records for discovery research in rheumatoid arthritis JO - Arthritis Care Res (Hoboken) VL - 62 UR - https://doi.org/10.1002/acr.20184 DO - 10.1002/acr.20184 ID - Liao2010 ER - TY - JOUR AU - Stanfill, M. H. AU - Williams, M. AU - Fenton, S. H. AU - Jenders, R. A. AU - Hersh, W. R. PY - 2010 DA - 2010// TI - A systematic literature review of automated clinical coding and classification systems JO - J Am Med Inform Assoc VL - 17 UR - https://doi.org/10.1136/jamia.2009.001024 DO - 10.1136/jamia.2009.001024 ID - Stanfill2010 ER - TY - JOUR AU - Chung, C. P. AU - Murray, K. T. AU - Stein, C. M. AU - Hall, K. AU - Ray, W. A. PY - 2010 DA - 2010// TI - A computer case definition for sudden cardiac death JO - Pharmacoepidemiol Drug Saf VL - 19 UR - https://doi.org/10.1002/pds.1888 DO - 10.1002/pds.1888 ID - Chung2010 ER - TY - JOUR AU - Cunningham, A. AU - Stein, C. M. AU - Chung, C. P. AU - Daugherty, J. R. AU - Smalley, W. E. AU - Ray, W. A. PY - 2011 DA - 2011// TI - An automated database case definition for serious bleeding related to oral anticoagulant use JO - Pharmacoepidemiol Drug Saf VL - 20 UR - https://doi.org/10.1002/pds.2109 DO - 10.1002/pds.2109 ID - Cunningham2011 ER - TY - JOUR AU - Singh, J. A. AU - Holmgren, A. R. AU - Noorbaloochi, S. PY - 2004 DA - 2004// TI - Accuracy of Veterans Administration databases for a diagnosis of rheumatoid arthritis JO - Arthritis Rheum VL - 51 UR - https://doi.org/10.1002/art.20827 DO - 10.1002/art.20827 ID - Singh2004 ER - TY - JOUR AU - Nicholson, A. AU - Tate, A. R. AU - Koeling, R. AU - Cassell, J. A. PY - 2011 DA - 2011// TI - What does validation of cases in electronic record databases mean? The potential contribution of free text JO - Pharmacoepidemiol Drug Saf VL - 20 UR - https://doi.org/10.1002/pds.2086 DO - 10.1002/pds.2086 ID - Nicholson2011 ER - TY - JOUR AU - Meystre, S. M. AU - Savova, G. K. AU - Kipper-Schuler, K. C. AU - Hurdle, J. F. PY - 2008 DA - 2008// TI - Extracting information from textual documents in the electronic health record: a review of recent research JO - Yearb Med Inform VL - 47 ID - Meystre2008 ER - TY - JOUR AU - Perlis, R. H. AU - Iosifescu, D. V. AU - Castro, V. M. AU - Murphy, S. N. AU - Gainer, V. S. AU - Minnier, J. AU - Cai, T. AU - Goryachev, S. AU - Zeng, Q. AU - Gallagher, P. J. AU - Fava, M. AU - Weilburg, J. B. AU - Churchill, S. E. AU - Kohane, I. S. AU - Smoller, J. W. PY - 2012 DA - 2012// TI - Using electronic medical records to enable large-scale studies in psychiatry: treatment resistant depression as a model JO - Psychol Med VL - 42 UR - https://doi.org/10.1017/S0033291711000997 DO - 10.1017/S0033291711000997 ID - Perlis2012 ER - TY - CHAP AU - Elkin, P. L. AU - Froehling, D. AU - Wahner-Roedler, D. AU - Trusko, B. AU - Welsh, G. AU - Ma, H. AU - Asatryan, A. X. AU - Tokars, J. I. AU - Rosenbloom, S. T. AU - Brown, S. H. PY - 2008 DA - 2008// TI - AMIA Annu Symp Proc BT - NLP-based identification of pneumonia cases from free-text radiological reports ID - Elkin2008 ER - TY - JOUR AU - Savova, G. K. AU - Fan, J. AU - Ye, Z. AU - Murphy, S. P. AU - Zheng, J. AU - Chute, C. G. AU - Kullo, I. J. PY - 2010 DA - 2010// TI - Discovering peripheral arterial disease cases from radiology notes using natural language processing division of biomedical statistics and informatics, 2 division of cardiovascular diseases JO - AMIA Annu Symp Proc VL - 2010 ID - Savova2010 ER - TY - JOUR AU - Pakhomov, S. AU - Weston, S. A. AU - Jacobsen, S. J. AU - Chute, C. G. AU - Meverden, R. AU - Roger, V. L. PY - 2007 DA - 2007// TI - Electronic medical records for clinical research: application to the identification of heart failure JO - Am J Manag Care VL - 13 ID - Pakhomov2007 ER - TY - JOUR AU - Friedlin, J. AU - Overhage, M. AU - Al-Haddad, M. a. AU - Waters, J. a. AU - Aguilar-Saavedra, J. J. R. AU - Kesterson, J. AU - Schmidt, M. PY - 2010 DA - 2010// TI - Comparing methods for identifying pancreatic cancer patients using electronic data sources JO - AMIA Annu Symp Proc VL - 2010 ID - Friedlin2010 ER - TY - JOUR AU - Roque, F. S. AU - Jensen, P. B. AU - Schmock, H. AU - Dalgaard, M. AU - Andreatta, M. AU - Hansen, T. AU - Søeby, K. AU - Bredkjær, S. AU - Juul, A. AU - Werge, T. AU - Jensen, L. J. AU - Brunak, S. PY - 2011 DA - 2011// TI - Using electronic patient records to discover disease correlations and stratify patient cohorts JO - PLoS Comput Biol VL - 7 UR - https://doi.org/10.1371/journal.pcbi.1002141 DO - 10.1371/journal.pcbi.1002141 ID - Roque2011 ER - TY - JOUR AU - Farkas, R. AU - Szarvas, G. PY - 2008 DA - 2008// TI - Automatic construction of rule-based ICD-9-CM coding systems JO - BMC Bioinforma VL - 9 UR - https://doi.org/10.1186/1471-2105-9-S3-S10 DO - 10.1186/1471-2105-9-S3-S10 ID - Farkas2008 ER - TY - JOUR AU - Persell, S. D. AU - Dunne, A. P. AU - Lloyd-Jones, D. M. AU - Baker, D. W. PY - 2009 DA - 2009// TI - Electronic health record-based cardiac risk assessment and identification of unmet preventive needs JO - Med Care VL - 47 UR - https://doi.org/10.1097/MLR.0b013e31818dce21 DO - 10.1097/MLR.0b013e31818dce21 ID - Persell2009 ER - TY - JOUR AU - Wang, Z. AU - Shah, A. D. AU - Tate, A. R. AU - Denaxas, S. AU - Shawe-Taylor, J. AU - Hemingway, H. PY - 2012 DA - 2012// TI - Extracting diagnoses and investigation results from unstructured text in electronic health records by semi-supervised machine learning JO - PLoS One VL - 7 UR - https://doi.org/10.1371/journal.pone.0030412 DO - 10.1371/journal.pone.0030412 ID - Wang2012 ER - TY - JOUR AU - Savova, G. K. AU - Ogren, P. V. AU - Duffy, P. H. AU - Buntrock, J. D. AU - Chute, C. G. PY - 2008 DA - 2008// TI - Mayo clinic NLP system for patient smoking status identification JO - J Am Med Inform Assoc VL - 15 UR - https://doi.org/10.1197/jamia.M2437 DO - 10.1197/jamia.M2437 ID - Savova2008 ER - TY - JOUR AU - Clark, C. AU - Good, K. AU - Jezierny, L. AU - Macpherson, M. AU - Wilson, B. AU - Chajewska, U. PY - 2007 DA - 2007// TI - Identifying smokers with a medical extraction system JO - J Am Med Inform Assoc VL - 15 UR - https://doi.org/10.1197/jamia.M2442 DO - 10.1197/jamia.M2442 ID - Clark2007 ER - TY - JOUR AU - Schuemie, M. J. AU - Sen, E. AU - ‘t Jong, G. W. AU - van Soest, E. M. AU - Sturkenboom, M. C. AU - Kors, J. A. PY - 2012 DA - 2012// TI - Automating classification of free-text electronic health records for epidemiological studies. JO - Pharmacoepidemiol Drug Saf VL - 21 UR - https://doi.org/10.1002/pds.3205 DO - 10.1002/pds.3205 ID - Schuemie2012 ER - TY - JOUR AU - Garcia, E. A. PY - 2009 DA - 2009// TI - Learning from imbalanced data JO - IEEE Trans Knowl Data Eng VL - 21 UR - https://doi.org/10.1109/TKDE.2008.239 DO - 10.1109/TKDE.2008.239 ID - Garcia2009 ER - TY - JOUR AU - Mease, D. AU - Wyner, A. J. PY - 2007 DA - 2007// TI - Boosted classification trees and class probability / quantile estimation JO - J Mach Learn Res VL - 8 ID - Mease2007 ER - TY - JOUR AU - Taft, L. M. AU - Evans, R. S. AU - Shyu, C. R. AU - Egger, M. J. AU - Chawla, N. AU - Mitchell, J. A. AU - Thornton, S. N. AU - Bray, B. AU - Varner, M. PY - 2009 DA - 2009// TI - Countering imbalanced datasets to improve adverse drug event predictive models in labor and delivery JO - J Biomed Inform VL - 42 UR - https://doi.org/10.1016/j.jbi.2008.09.001 DO - 10.1016/j.jbi.2008.09.001 ID - Taft2009 ER - TY - CHAP AU - Van Hulse, J. AU - Khoshgoftaar, T. M. AU - Napolitano, A. PY - 2009 DA - 2009// TI - An empirical comparison of repetitive undersampling techniques BT - 2009 IEEE International Conference on Information Reuse & Integration UR - https://doi.org/10.1109/IRI.2009.5211614 DO - 10.1109/IRI.2009.5211614 ID - Van Hulse2009 ER - TY - CHAP AU - Chawla, N. V. ED - Maimon, O. ED - Rokach, L. PY - 2010 DA - 2010// TI - Data Mining for Imbalanced Datasets: An Overview BT - Data Mining and Knowledge Discovery Handbook PB - Springer US CY - Boston, MA ID - Chawla2010 ER - TY - CHAP AU - Van Hulse, J. AU - Khoshgoftaar, T. M. AU - Napolitano, A. PY - 2007 DA - 2007// TI - Experimental perspectives on learning from imbalanced data BT - Proceedings of the 24th international conference on Machine learning - ICML ’07 PB - ACM Press CY - New York, New York, USA UR - https://doi.org/10.1145/1273496.1273614 DO - 10.1145/1273496.1273614 ID - Van Hulse2007 ER - TY - JOUR AU - Chawla, N. V. AU - Bowyer, K. W. AU - Hall, L. O. AU - Kegelmeyer, W. P. PY - 2002 DA - 2002// TI - SMOTE: synthetic minority over-sampling technique JO - Artif Intell VL - 16 ID - Chawla2002 ER - TY - CHAP AU - Drummond, C. AU - Holte, R. C. PY - 2003 DA - 2003// TI - C4.5, Class Imbalance, and Cost Sensitivity: Why Under-Sampling beats Over-Sampling BT - Workshop on Learning from Imbalanced Data Sets II (ICML 2003) ID - Drummond2003 ER - TY - CHAP AU - Japkowicz, N. PY - 2000 DA - 2000// TI - The Class Imbalance Problem: Significance and Strategies BT - Proceedings of the 2000 International Conference on Artificial Intelligence (ICAI) ID - Japkowicz2000 ER - TY - BOOK AU - Ling, C. X. AU - Sheng, V. S. PY - 2011 DA - 2011// TI - Cost-Sensitive Learning and the Class Imbalance Problem PB - Springer CY - In Encyclopedia of Machine Learning ID - Ling2011 ER - TY - JOUR AU - Wang, T. AU - Qin, Z. AU - Zhang, S. AU - Zhang, C. PY - 2012 DA - 2012// TI - Cost-sensitive classification with inadequate labeled data JO - Inf Syst VL - 37 UR - https://doi.org/10.1016/j.is.2011.10.009 DO - 10.1016/j.is.2011.10.009 ID - Wang2012 ER - TY - JOUR AU - Japkowicz, N. AU - Stephen, S. PY - 2002 DA - 2002// TI - The class imbalance problem: a systematic study JO - Intell Data Anal VL - 6 ID - Japkowicz2002 ER - TY - JOUR AU - Sun, Y. AU - Kamel, M. AU - Wong, A. AU - Wang, Y. PY - 2007 DA - 2007// TI - Cost-sensitive boosting for classification of imbalanced data JO - Pattern Recognit VL - 40 UR - https://doi.org/10.1016/j.patcog.2007.04.009 DO - 10.1016/j.patcog.2007.04.009 ID - Sun2007 ER - TY - JOUR AU - Zhou, Z. AU - Member, S. AU - Liu, X. PY - 2006 DA - 2006// TI - Training cost-sensitive neural networks with methods addressing the class imbalance problem JO - IEEE Trans Knowl Data Eng VL - 18 UR - https://doi.org/10.1109/TKDE.2006.17 DO - 10.1109/TKDE.2006.17 ID - Zhou2006 ER - TY - CHAP AU - McCarthy, K. AU - Zabar, B. AU - Weiss, G. PY - 2005 DA - 2005// TI - Does cost-sensitive learning beat sampling for classifying rare classes? BT - Proceedings of the 1st international workshop on Utility-based data mining - UBDM ’05 PB - ACM Press CY - New York, New York, USA UR - https://doi.org/10.1145/1089827.1089836 DO - 10.1145/1089827.1089836 ID - McCarthy2005 ER - TY - CHAP AU - Liu, X. AU - Zhou, Z. PY - 2006 DA - 2006// TI - The Influence of Class Imbalance on Cost-Sensitive Learning: An Empirical Study BT - Sixth International Conference on Data Mining (ICDM’06) UR - https://doi.org/10.1109/ICDM.2006.158 DO - 10.1109/ICDM.2006.158 ID - Liu2006 ER - TY - JOUR AU - Cohen, J. PY - 1960 DA - 1960// TI - A coefficient of agreement for nominal scales JO - Educ Psychol Meas VL - 20 UR - https://doi.org/10.1177/001316446002000104 DO - 10.1177/001316446002000104 ID - Cohen1960 ER - TY - JOUR AU - Chapman, W. W. AU - Bridewell, W. AU - Hanbury, P. AU - Cooper, G. F. AU - Buchanan, B. G. PY - 2001 DA - 2001// TI - A simple algorithm for identifying negated findings and diseases in discharge summaries JO - J Biomed Inform VL - 34 UR - https://doi.org/10.1006/jbin.2001.1029 DO - 10.1006/jbin.2001.1029 ID - Chapman2001 ER - TY - CHAP AU - Setiono, R. AU - Liu, H. PY - 1995 DA - 1995// TI - Chi2: feature selection and discretization of numeric attributes BT - Proceedings of 7th IEEE International Conference on Tools with Artificial Intelligence ID - Setiono1995 ER - TY - JOUR AU - Adler, W. AU - Brenning, A. AU - Potapov, S. AU - Schmid, M. AU - Lausen, B. PY - 2011 DA - 2011// TI - Ensemble classification of paired data JO - Comput Stat Data Anal VL - 55 UR - https://doi.org/10.1016/j.csda.2010.11.017 DO - 10.1016/j.csda.2010.11.017 ID - Adler2011 ER - TY - CHAP AU - Sun, Y. AU - Kamel, M. AU - Wang, Y. PY - 2006 DA - 2006// TI - Boosting for Learning Multiple Classes with Imbalanced Class Distribution BT - Sixth International Conference on Data Mining (ICDM’06) PB - IEEE Computer Society CY - Washington, DC, USA UR - https://doi.org/10.1109/ICDM.2006.29 DO - 10.1109/ICDM.2006.29 ID - Sun2006 ER - TY - CHAP AU - Akbani, R. AU - Kwek, S. AU - Japkowicz, N. PY - 2004 DA - 2004// TI - Applying Support Vector Machines to Imbalanced Datasets BT - In Proceedings of the 15th European Conference on Machine Learning (ECML} ID - Akbani2004 ER - TY - CHAP AU - Chen, C. AU - Liaw, A. AU - Breiman, L. PY - 2004 DA - 2004// TI - Discovery BT - Using Random Forest to Learn Imbalanced Data ID - Chen2004 ER - TY - CHAP AU - Domingos, P. PY - 1999 DA - 1999// TI - Proceedings of the fifth ACM SIGKDD international conference on Knowledge discovery and data mining - KDD ’99 BT - MetaCost: A General Method for Making Classifiers Cost-Sensitive PB - ACM Press CY - New York, New York, USA ID - Domingos1999 ER - TY - CHAP AU - Hall, M. AU - Frank, E. AU - Holmes, G. AU - Pfahringer, B. AU - Reutemann, P. AU - Witten, I. H. PY - 2009 DA - 2009// TI - The WEKA data mining software BT - ACM SIGKDD Explorations Newsletter, Volume 11 ID - Hall2009 ER - TY - JOUR AU - Salzberg, S. L. PY - 1994 DA - 1994// TI - C4.5: Programs for Machine Learning by J. Ross Quinlan. Morgan Kaufmann Publishers, Inc., 1993 JO - Mach Learn VL - 16 ID - Salzberg1994 ER - TY - CHAP AU - Chang, C. -. C. AU - Lin, C. -. J. PY - 2011 DA - 2011// TI - LIBSVM: a library for support vector machines BT - ACM Transactions on Intelligent Systems and Technology, Volume 2 ID - Chang2011 ER - TY - JOUR AU - Hsu, C. AU - Chang, C. AU - Lin, C. PY - 2010 DA - 2010// TI - A practical guide to support vector classification JO - Bioinformatics VL - 1 ID - Hsu2010 ER - TY - CHAP AU - Cohen, W. W. ED - Prieditis, A. ED - Morgan Kaufmann, R. S. PY - 1995 DA - 1995// TI - Fast Effective Rule Induction BT - Proceedings of the Twelfth International Conference on Machine Learning ID - Cohen1995 ER - TY - JOUR AU - Quinlan, J. R. PY - 1986 DA - 1986// TI - Induction of decision trees JO - Mach Learn VL - 1 ID - Quinlan1986 ER - TY - JOUR AU - Weiss, G. M. AU - Provost, F. PY - 2003 DA - 2003// TI - Learning when training data are costly: the effect of class distribution on tree induction JO - J Artif Intell Res VL - 19 ID - Weiss2003 ER - TY - CHAP AU - Chan, P. K. AU - Stolfo, S. J. PY - 1998 DA - 1998// TI - Toward Scalable Learning with Non-uniform Class and Cost Distributions: A Case Study in Credit Card Fraud Detection BT - Proceedings of the Fourth International Conference on Knowledge Discovery and Data Mining, Volume 164 ID - Chan1998 ER - TY - CHAP AU - Ruch, P. AU - Baud, R. H. AU - Geiddbühler, A. AU - Lovis, C. AU - Rassinoux, A. M. AU - Rivière, A. PY - 2001 DA - 2001// TI - Proc AMIA Symp BT - Looking back or looking all around: comparing two spell checking strategies for documents edition in an electronic patient record ID - Ruch2001 ER - TY - JOUR AU - Harkema, H. AU - Dowling, J. N. AU - Thornblade, T. AU - Chapman, W. W. PY - 2009 DA - 2009// TI - ConText: an algorithm for determining negation, experiencer, and temporal status from clinical reports JO - J Biomed Inform VL - 42 UR - https://doi.org/10.1016/j.jbi.2009.05.002 DO - 10.1016/j.jbi.2009.05.002 ID - Harkema2009 ER - TY - CHAP AU - Apostolova, E. AU - Tomuro, N. AU - Demner-fushman, D. PY - 2011 DA - 2011// TI - Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies: short papers - Volume 2 BT - Automatic Extraction of Lexico-Syntactic Patterns for Detection of Negation and Speculation Scopes PB - Association for Computational Linguistics CY - Portland, Oregon, USA ID - Apostolova2011 ER - TY - STD TI - The pre-publication history for this paper can be accessed here:http://www.biomedcentral.com/1472-6947/13/30/prepub UR - http://www.biomedcentral.com/1472-6947/13/30/prepub ID - ref60 ER -