Skip to main content

Table 2 A comparison of works on the corpus building of health and medical questions

From: Qcorp: an annotated classification corpus of Chinese health questions

Corpus or Author name

Language

Asker

Corpus scale

Question sources

Disease covering

Annotated categories

Layers

NLM collected clinical questions [19]

En

P

4,654

Clinical settings (5 studies [20,21,22,23,24,25])

Not limited

64

4

Patrick J [30]

En

P

595

Clinical settings

Not limited

11

4

Zhang Y [31]

En

C

600

1 website

23 subcategories

>50

5

Roberts K [32]

En

C

1,467

1 website

Genetic and rare diseases

13

1

Maroy S [34]

En

C

1,279

6 websites

Cancer

10

2

Yin JW [37]

Cn

C

1,600

1 health APP

Maternal and infant health

8

1

Zhang N [38]

Cn

C

4,465

1 website, books, self-composed

Skin disease

52

2

Tang GY [39]

Cn

C

1,688

4 websites

Hyperlipidemia

241

1

Our Qcorp

Cn

C

5,000

5 websites

6 broad sections

29

2

  1. P refers to physician, and C refers to consumer