Skip to main content

Table 3 Characteristics (counts of sentences, words, and entities, words per sentence, entities per sentence, and entity density) in five folds of the dataset and the pool of querying data

From: An active learning-enabled annotation system for clinical named entity recognition

 

Sentence count

Word count

Entity Count

Words per sentence

Entities per sentence

Entity densitya

Fold 1

4,085

44,403

5,395

10.87

1.32

0.25

Fold 2

4,085

45,588

5,183

11.16

1.27

0.24

Fold 3

4,084

45,355

5,201

11.11

1.27

0.24

Fold 4

4,085

45,141

5,263

11.05

1.29

0.25

Fold 5

4,084

44,834

5,177

10.98

1.27

0.24

Pool (Fold 2 + 3 + 4 + 5)

16,338

180,918

20,824

11.07

1.27

0.24

  1. aEntity density is the number of words of the entities divided by the total number of words