Skip to main content

Table 1 The distribution of the THYME corpus. In this table, we show the different types of data in the corpus

From: A system for automatically extracting clinical events with temporal information

Data

Colon cancer

Train Dev Test

Brain cancer

Train Test

Document

293,143,141

30,148

Temporal expressions

3833 2078 1952

3,501,552

Event expressions

38,890 20,974 18,990

2557 11,510

ER

11,150 6163 5894

6,241,759