BMC Medical Informatics and Decision Making

Table 1 Statistics of our fine-grained Chinese word segmentation and part-of-speech tagging Corpus for clinical text

From: A fine-grained Chinese word segmentation and part-of-speech tagging corpus for clinical text

Dataset	Notes	Sentences	words
Training	1440	6867	158,035
valid	180	813	19,290
test	180	857	21,472
total	1800	8537	198,797

Back to article page

ISSN: 1472-6947

Contact us

General enquiries: journalsubmissions@springernature.com