From: An evaluation of GPT models for phenotype concept recognition
Document-level
Mention-level
Precision
Recall
F1
HPO-GS
0.63
0.29
0.4
0.6
0.19
BIOC-GS
0.55
0.42
0.48
0.5
0.37
0.43