Skip to main content

Table 3 Observation symbols currently supported by the Febrl package

From: Preparation of name and address data for record linkage using hidden Markov models

Symbol Description Usage Based on
LQ Locality qualifier words Addresses Look-up table
LN Locality (town, suburb) names Addresses Look-up table
TR Territory (state, region) names Addresses Lookup table
CR Country names Addresses Look-up table
IT Types of institution Addresses Look-up table
IN Names of institutions Addresses Look-up table
PA Type of postal address Addresses Look-up table
PC Postal (zip) codes Addresses Look-up table
UT Types of housing unit (eg flat, apartment) Addresses Look-up table
WN Wayfare names Addresses Look-up table
WT Wayfare types (eg street, road, avenue) Addresses Look-up table
TI Title words (eg Dr, Prof, Ms) Names Look-up table
SN Surnames Names Look-up table
GF Female given names Names Look-up table
GM Male given names Names Look-up table
PR Name prefixes Names Look-up table
SP Name qualifiers (eg aka, also known as) Names Look-up table
BO "baby of" and similar strings Names Look-up table
NE "Nee", "born as" or similar Names Look-up table
II One letter words (initials) Names Coded rule
ST Saint names (eg Saint George, San Angelo) Both Look-up table
CO Comma, semi-colon, colon Both Coded rule
SL Slash "/" and back-slash "\" Both Coded rule
N4 Numbers with four digits Addresses Coded rule
NU Other numbers Both Coded rule
AN Alphanumeric words Both Coded rule
VB Brackets, braces, quotes Both Coded rule
RU Rubbish Both Look-up table
UN Unknown (none of the above) Both Coded rule