Skip to main content

Table 1 Comparison with other existing dataset

From: A Chinese telemedicine-dialogue dataset annotated for named entities

 

This dataset

IMCS-NER

Count of all named entities

63,560

74,698

Average length of entity

4.33

2.63

Count of total characters

1,700,392

1,621,161

Ratio of tagged characters to total ones

16.2%

12.1%

Average count of characters per consultation

713.55

589.04