Skip to main content

Table 1 Hyperparameters of the BERT-based model

From: Automatic de-identification of French electronic health records: a cost-effective approach exploiting distant supervision and deep learning models

Hyperparameter

Value

Attention heads

12

Batch size

64

Epochs

5

Hidden size

768

Hidden layers

12

Maximum Sequence Length

512

Parameters

179 M

Optimizer

Adam