From: Automatic de-identification of French electronic health records: a cost-effective approach exploiting distant supervision and deep learning models
Hyperparameter
Value
Attention heads
12
Batch size
64
Epochs
5
Hidden size
768
Hidden layers
Maximum Sequence Length
512
Parameters
179 M
Optimizer
Adam