SNOMED CT – advances in concept mapping, retrieval, and ontological foundations. Selected contributions to the Semantic Mining Conference on SNOMED CT (SMCS 2006)
© Schulz and Klein; licensee BioMed Central Ltd. 2008
Published: 27 October 2008
Medical science and health care both possess a strong tradition of structuring their terminological knowledge through controlled vocabularies, such as nomenclatures, thesauri, or classification systems [1–6]. The availability of such resources mirrors the need for sharing a common understanding of the employed terms and the semantic relations holding between them. Community-wide efforts such as the Unified Medical Language System (UMLS) , have reached a level of both coverage and depth today that is unmatched within many if not most other scientific disciplines.
The reason behind investing such a tremendous effort into those terminological systems is evident: They are intended to provide support for a wide range of different tasks, such as unambiguous, focused access to patient data for retrieval and decision support, the comparison of clinical cases, the retrieval of similar cases, clinical epidemiology and research, billing and accounting activities, the generation of health statistics from relevant and valid data, semantic interoperability between several (thematically overlapping) data sources, the quality management of medical services, as well as the structuring of scientific literature repositories and experimental databases.
So, the question may be raised whether the current biomedical terminological resources are already sufficient to properly serve the expressed needs. Basically, all terminology systems in current routine use rest on informal specifications. Their semantics are essentially rooted in a human understanding of natural language and implicit assumptions about the taxonomic, partonomic and other unspecified relations between terms. Interpreting such relations in light of a particular search, a decision support problem or, even more challenging, the drawing of ad hoc inferences often leads to strange and erroneous results. This is usually due to a lack of any rigid, formal semantics and ontological foundation underlying the respective terminological systems, an issue that has been increasingly addressed in recent years.
Whereas current biomedical terminology systems such as ICD, ICF, LOINC, CMPU, ICNP and many others are highly focused on quite well-delimited tasks and subdomains, the vision of a universal clinical terminology, covering a broad range of health-related domains and meeting the needs of all health professionals, has stimulated numerous health informatics research activities.
Is this vision now being materialized by the new health terminology SNOMED CT? 
During the last two decades, SNOMED (Systemized Nomenclature of Medicine) has been growing from a pathology-centered vocabulary to a comprehensive, structured clinical terminology. Even though SNOMED CT is still rooted in a strong legacy, it is increasingly subscribing to principles of logics and ontology. This fact, together with its impressive number of terms in most areas of medicine and health care has lead to a growing international interest. Nine countries have so far joined the International Health Terminology Standards Development Organization IHTSDO, a non-profit association founded in 2007 with the task of the development, quality assurance and distribution of SNOMED CT.
However, there are still only very few prototypical implementations of SNOMED CT in clinical settings, the feasibility of such a comprehensive terminology as basis for the whole health delivery process is still subject to discussion, and several shortcomings, regarding both SNOMED CT architeture and content, still persist .
The papers in this supplement of BMC Medical Informatics and Decision Making are extended and updated contributions to the Semantic Mining Conference on SNOMED CT (SMCS 2006), organized by the European Union Network of Excellence "Semantic Interoperability and Data Mining in Biomedicine" in October 2006.
It was the first European forum on SNOMED CT for health policy makers, clinicians, nurses, system developers, computer scientists, terminologists and translators. A number of prominent invited speakers provided overviews of the current efforts and developments in the context of SNOMED CT and many scientific contributions illuminated ongoing research on SNOMED CT. Out of the 22 scientific papers and posters published in the proceedings of this conference, eight were selected for this special issue by the program committee due to their scientific excellence. This selection mirrors the different research strands on SNOMED CT and represents a broad range of countries, viz., The Netherlands, USA, France, Australia, Switzerland, Sweden, UK, Hungary, and Germany. We want to thank the reviewers for their in-depth work in reviewing first the conference submissions and then again the selected contributions. The following contributions are included in this special issue:
In their article Forty years of SNOMED: a literature review  Ronald Cornet and Nicolette de Keizer provide an overview of published studies on SNOMED over a period of 40 years, reflected in scientific publications. They found that most studies concern SNOMED in theory and a minor number provides an account of the use of SNOMED in practice. (This is also a clear tendency regarding the papers in this special issue).
A major challenge of the adoption of a world-wide terminology is the use of legacy terminologies tailored and optimized to meet specific coding and documentation requirements. Therefore, a seamless migration from proprietary solutions to a common standard requires high-quality rules for manual cross-terminology mapping as described by Geraldine Wade and Trent Rosenbloom in their paper Experiences Mapping a Legacy Interface Terminology to SNOMED CT  which emphasizes the value of discoveries resulting from this mapping as important contributions to the refinement of SNOMED CT.
The relation between SNOMED CT and a legacy terminology is also addressed by Iulian Alecu, Cedric Bousquet, and Marie-Christine Jaulent in their paper A case report: Using SNOMED CT for grouping Adverse Drug Reactions Terms . The authors provide evidence that the logical structure of SNOMED CT can be employed to improve term grouping and retrieval in the WHO Adverse Reaction Terminology, important for clinical trials and medical care.
Another mapping experience is reported by Yefeng Wang, Jon Patrick, Graeme Miller, and Julie O'Hallaran in A Computational Linguistics Motivated Mapping of ICPC-2 PLUS to SNOMED CT . The authors compare different terminology mapping approaches including language engineering methods and also address the problem arising from the fact that one source concept maps to the coordination of two target concepts.
Computational linguistics approaches are also employed by Patrick Ruch, Julien Gobeill, Christian Lovis, and Antoine Geissbühler.  In their contribution Automatic Medical Encoding with SNOMED Categories they present two information retrieval approaches that address both the retrieval of SNOMED CT concepts and the automated encoding of free text and report on a first evaluation of a prototype.
The encoding of clinical data using information models and archetypes is addressed by Erik Sundvall, Rahil Qamar, Mikael Nyström, Mattias Forss, Håkan Petersson, Hans Åhlfeldt, and Alan Rector. In their paper Integration of Tools for Binding Archetypes to SNOMED CT  they present an approach that supports this task. They also discuss the yet unresolved problems of binding clinical information models to SNOMED CT and the control of post-coordination of concepts.
In the first one, entitled Ontological analysis of SNOMED CT , Gergely Héja, György Surján, and Péter Varga perform an analysis of the structure of SNOMED CT based on the formal top-level ontology DOLCE. They present a typology of errors occuring when the SNOMED CT hierarchies are submitted to formal ontological scrutiny and provide suggestions of how to avoid these errors.
In the second one, Formal Representation of Complex SNOMED CT Expressions , Stefan Schulz, Kornél Markó, and Boontawee Suntisrivaraporn focus on the representation of complex events and procedures, highlight the limited expressiveness of SNOMED CT regarding negations and formally describe the ambiguities in the representation of complex concepts.
Due to the recently facilitated access to the SNOMED CT sources, research activities are increasing all over the world and the interchange between the SNOMED CT community and the academic world is strengthening. Evidence for this is given by the recent MEDINFO, MIE and AMIA conferences as well as by the recent AMIA KR-MED 2008 conference on "Representing and Sharing Knowledge using SNOMED CT".
We hope that you will enjoy this special issue and that it helps you keeping track of some of the fascinating research in applied terminology, ontology and clinical terminologies.
The publishing of this supplement was supported by the EU Network of Excellence Semantic Interoperability and Data Mining in Biomedicine (NoE 507505).
This article has been published as part of BMC Medical Informatics and Decision Making Volume 8 Supplement 1, 2008: Selected contributions to the First European Conference on SNOMED CT. The full contents of the supplement are available online at http://www.biomedcentral.com/1472-6947/8?issue=S1.
- Côté Roger, Rothwell David, Beckett Ronald, Palotay James, Brochu Louise: The Systemised Nomenclature of Medicine: SNOMED International. 1993, Northfield, IL: College of American PathologistsGoogle Scholar
- de Keizer F Nicolette, Abu-Hanna Ameen, Zwetsloot-Schonk JH: Understanding terminological systems. i: Terminology and typology. Methods of Information in Medicine. 2000, 39 (1): 16-21.PubMedGoogle Scholar
- ICD-10: International Statistical Classification of Diseases and Health Related Problems. 10th Revision. 1992, Geneva: World Health OrganizationGoogle Scholar
- Rector Alan: Clinical terminology: Why is it so hard?. Methods of Information in Medicine. 1999, 38: 239-252.PubMedGoogle Scholar
- MESH: Medical Subject Headings. 2008, Bethesda, MD: National Library of MedicineGoogle Scholar
- Stenzhorn Holger, Boeker Martin, Schulz Stefan, Smith Barry: Adapting clinical ontologies in real-world environments. Journal of Universal Computer Science. 2008Google Scholar
- UMLS: Unified Medical Language System. 2008, Bethesda, MD: National Library of MedicineGoogle Scholar
- SNOMED: Clinical Terms. 2008, Copenhagen, Denmark: International Health Terminology Standards Development Organisation (IHTSDO) , [http://www.ihtsdo.org]Google Scholar
- Schulz Stefan, Suntisrivaraporn Boontawee, Baader Franz: SNOMED CT's problem list: Ontologists' and logicians' therapy suggestions. MEDINFO 2007 – Proceedings of the 12th World Congress on Medical Informatics. Studies in Health Technology and Informatics. Edited by: Klaus A Kuhn, James R Warren, Tze-Yun Leong. 2007, San Francisco, CA, USA, Amsterdam: IOS Press, 2 (129): 802-806. September 7–11, 2004Google Scholar
- Cornet Ronald, de Keizer Nicolette: Forty years of SNOMED: a literature review. BMC Medical Informatics and Medical Decision Making. 2008, 8 (Suppl 1): S2-View ArticleGoogle Scholar
- Wade Geraldine, Rosenbloom Trent: Experiences mapping a legacy interface terminology to SNOMED CT. BMC Medical Informatics and Medical Decision Making. 2008, 8 (Suppl 1): S3-View ArticleGoogle Scholar
- Alecu Iulian, Bousquet Cedric, Jaulent Marie-Christine: A case report: Using SNOMED CT for grouping adverse drug reactions terms. BMC Medical Informatics and Medical Decision Making. 2008, 8 (Suppl 1): S4-View ArticleGoogle Scholar
- Wang Yefeng, Patrick Jon, Miller Graeme, O'Hallaran J: A computational linguistics motivated mapping of ICPC-2 PLUS to SNOMED CT. BMC Medical Informatics & Decision Making. 2008, 8 (Suppl 1): S5-View ArticleGoogle Scholar
- Ruch Patrick, Gobeill Julien, Lovis Christian, Geissbühler Antoine: Automatic medical encoding with SNOMED categories. BMC Medical Informatics and Medical Decision Making. 2008, 8 (Suppl 1): S6-View ArticleGoogle Scholar
- Sundvall Erik, Qamar Rahil, Nyström Mikael, Forss Mattias, Petersson Håkan, Åhlfeldt Hans, Rector Alan: Integration of tools for binding archetypes to SNOMED CT. BMC Medical Informatics and Medical Decision Making. 2008, 8 (Suppl 1): S7-View ArticleGoogle Scholar
- Ceusters Werner, Smith Barry, Kumar Anand, Dhaen Christoffel: Ontology-based error detection in SNOMED CT. MEDINFO 2004 – Proceedings of the 11th World Congress on Medical Informatics. Studies in Health Technology and Informatics. Edited by: Marius Fieschi, Enrico Coiera, Yu-Chan Jack Li. 2004, San Francisco, CA, USA, Amsterdam: IOS Press, 1 (107): 482-486. September 7–11, 2004Google Scholar
- Cimino James, Zhu Xinxin: The practical impact of ontologieson biomedical informatics. IMIA Yearbook of Medical Informatics. 2006, Stuttgart: Schattauer, 124-135.Google Scholar
- Spackman Kent, Reynoso Guillermo: Examining SNOMED from the perspective of formal ontological principles: Some preliminary analysis and observations. KR-MED 2004 – Proceedings of the 1st International Workshop on Formal Biomedical Knowledge Representation, Collocated with the 9th International Conference on the Principles of Knowledge Representation and Reasoning (KR 2004), Whistler, B.C., Canada, June 1, 2004. Edited by: Udo Hahn, Stefan Schulz, Ronald Cornet. 2004, Bethesda, MD: American Medical Informatics Association (AMIA), 72-80. [http://CEUR-WS.org/Vol-102/]Google Scholar
- Heja Gergely, Surján György, Varga Péter: Ontological analysis of SNOMED CT. BMC Medical Informatics and Medical Decision Making. 2008, 8 (Suppl 1): S8-View ArticleGoogle Scholar
- Schulz Stefan, Markó Kornél, Suntisrivaraporn Boontawee: Formal representation of complex SNOMED ct expressions. BMC Medical Informatics and Medical Decision Making. 2008, 8 (Suppl 1): S9-View ArticleGoogle Scholar
This article is published under license to BioMed Central Ltd. This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.