Skip to main content

Table 4 The performance of CRF with all features for CTCF recognition

From: Enriching the international clinical nomenclature with Chinese daily used synonyms and concept recognition in physician notes

Round

Models

TP

FP

FN

P

R

F

Total

-

M0 (baseline)

9635

1111

1583

0.897

0.859

0.877

11,218

1st

M0 + F1

9654

1135

1564

0.895

0.861

0.877

11,218

1st

M0 + F2

9671

1127

1547

0.896

0.862

0.879

11,218

1st

M0 + F3

9664

1135

1554

0.895

0.862

0.878

11,218

1st

M0 + F4

9678

1101

1540

0.898

0.863

0.880

11,218

1st

M0 + F5 (M1)

9711

1057

1507

0.902

0.866

0.883

11,218

2nd

M1 + F1

9717

1066

1501

0.901

0.866

0.883

11,218

2nd

M1 + F2

9725

1083

1493

0.900

0.867

0.883

11,218

2nd

M1 + F3

9732

1082

1486

0.900

0.868

0.883

11,218

2nd

M1 + F4 (M2)

9725

1071

1493

0.901

0.867

0.884

11,218

3rd

M2 + F1 (M3)

9754

1062

1464

0.901

0.870

0.885

11,218

3rd

M2 + F2

9752

1080

1466

0.900

0.869

0.885

11,218

3rd

M2 + F3

9758

1094

1460

0.899

0.870

0.884

11,218

4th

M3 + F2 (M4)

9785

1069

1433

0.902

0.872

0.887

11,218

4th

M3 + F3

9765

1097

1453

0.899

0.871

0.885

11,218

  1. F1 the stop character feature, F2 the current word feature, F3 the current and context word feature, F4 the word POS tag feature, F5 the word associative feature