Skip to main content

Table 2 A comparison of works on the corpus building of health and medical questions

From: Qcorp: an annotated classification corpus of Chinese health questions

Corpus or Author name Language Asker Corpus scale Question sources Disease covering Annotated categories Layers
NLM collected clinical questions [19] En P 4,654 Clinical settings (5 studies [20,21,22,23,24,25]) Not limited 64 4
Patrick J [30] En P 595 Clinical settings Not limited 11 4
Zhang Y [31] En C 600 1 website 23 subcategories >50 5
Roberts K [32] En C 1,467 1 website Genetic and rare diseases 13 1
Maroy S [34] En C 1,279 6 websites Cancer 10 2
Yin JW [37] Cn C 1,600 1 health APP Maternal and infant health 8 1
Zhang N [38] Cn C 4,465 1 website, books, self-composed Skin disease 52 2
Tang GY [39] Cn C 1,688 4 websites Hyperlipidemia 241 1
Our Qcorp Cn C 5,000 5 websites 6 broad sections 29 2
  1. P refers to physician, and C refers to consumer