LegalLMs
Collection
XLM-RoBERTa models with continued pretraining on the MultiLegalPile
•
37 items
•
Updated
•
4
This model was trained from scratch on an unknown dataset. It achieves the following results on the evaluation set:
More information needed
More information needed
More information needed
The following hyperparameters were used during training:
| Training Loss | Epoch | Step | Validation Loss |
|---|---|---|---|
| 0.6815 | 23.01 | 50000 | 0.5264 |
| 0.6655 | 47.0 | 100000 | 0.4623 |
| 0.5867 | 70.01 | 150000 | 0.4325 |
| 0.5706 | 94.0 | 200000 | 0.4186 |