TY - JOUR AU - Bosch-Capblanch, Xavier PY - 2011 DA - 2011/05/19 TI - Harmonisation of variables names prior to conducting statistical analyses with multiple datasets: an automated approach JO - BMC Medical Informatics and Decision Making SP - 33 VL - 11 IS - 1 AB - Data requirements by governments, donors and the international community to measure health and development achievements have increased in the last decade. Datasets produced in surveys conducted in several countries and years are often combined to analyse time trends and geographical patterns of demographic and health related indicators. However, since not all datasets have the same structure, variables definitions and codes, they have to be harmonised prior to submitting them to the statistical analyses. Manually searching, renaming and recoding variables are extremely tedious and prone to errors tasks, overall when the number of datasets and variables are large. This article presents an automated approach to harmonise variables names across several datasets, which optimises the search of variables, minimises manual inputs and reduces the risk of error. SN - 1472-6947 UR - https://doi.org/10.1186/1472-6947-11-33 DO - 10.1186/1472-6947-11-33 ID - Bosch-Capblanch2011 ER -