Skip to main content

Table 4 The open-source datasets of the four English and Chinese Medical NLP tasks

From: Transformers-sklearn: a toolkit for medical language understanding with transformer-based models

Name

NLP Task

Language

Domain

Metric

TrialClassification [20]

Classification

Chinese

Clinical Trial

Macro F1

BC5CDR [21]

NER

English

PubMed titles and abstracts

Macro F1

DiabetesNER [22]

NER

Chinese

Diabetes Papers

Macro F1

BIOSSES [23]

Regression

English

Biomedical

Pearson correlation