Skip to main content

Table 3 Data statistics

From: Deep learning approach to detection of colonoscopic information from unstructured reports

Data

For pre-trained word embedding

For training and test

Year

2000–2015

2011–2015

Number of documents

280,668

5,000

Number of sentences

4,193,814

81,666

Number of types of words

41,563

4,478