<!-- This model card has been generated automatically according to the information Keras had access to. You should probably proofread and complete it, then remove this comment. -->
pretrained-m-bert-100
This model is a fine-tuned version of on an unknown dataset. It achieves the following results on the evaluation set:
- Train Loss: 5.7003
- Validation Loss: 15.3566
- Epoch: 99
Model description
More information needed
Intended uses & limitations
More information needed
Training and evaluation data
More information needed
Training procedure
Training hyperparameters
The following hyperparameters were used during training:
- optimizer: {'name': 'Adam', 'learning_rate': 1e-04, 'decay': 0.0, 'beta_1': 0.9, 'beta_2': 0.999, 'epsilon': 1e-07, 'amsgrad': False}
- training_precision: float32
Training results
Train Loss | Validation Loss | Epoch |
---|---|---|
10.2669 | 10.9400 | 0 |
7.8880 | 10.8967 | 1 |
6.8580 | 11.5024 | 2 |
6.4321 | 11.5023 | 3 |
6.2235 | 11.2212 | 4 |
6.0038 | 11.3128 | 5 |
5.9881 | 11.3604 | 6 |
5.4409 | 11.6872 | 7 |
5.2113 | 11.5379 | 8 |
5.2660 | 12.0264 | 9 |
5.2330 | 11.7627 | 10 |
5.1121 | 12.2919 | 11 |
5.2126 | 12.6272 | 12 |
5.2086 | 11.3478 | 13 |
5.2459 | 12.2183 | 14 |
5.0035 | 11.7580 | 15 |
4.9613 | 12.4852 | 16 |
5.0312 | 12.4627 | 17 |
5.0073 | 13.6309 | 18 |
5.4284 | 12.7799 | 19 |
5.3100 | 12.6417 | 20 |
5.0765 | 12.7851 | 21 |
5.2276 | 13.3828 | 22 |
5.1986 | 12.7421 | 23 |
4.8935 | 12.8679 | 24 |
4.6959 | 12.9201 | 25 |
5.4161 | 13.4416 | 26 |
5.2459 | 14.0112 | 27 |
5.2781 | 13.2740 | 28 |
5.5104 | 12.8646 | 29 |
5.5024 | 13.7514 | 30 |
5.6284 | 13.7125 | 31 |
5.8452 | 13.6332 | 32 |
5.5767 | 13.8019 | 33 |
5.6444 | 13.4279 | 34 |
5.5551 | 13.2666 | 35 |
5.5421 | 13.5996 | 36 |
5.5246 | 13.1686 | 37 |
5.5233 | 13.3788 | 38 |
5.6011 | 13.4038 | 39 |
5.3695 | 13.5241 | 40 |
5.5061 | 13.6035 | 41 |
5.4534 | 13.8652 | 42 |
5.4222 | 13.4525 | 43 |
5.4408 | 13.6572 | 44 |
5.6683 | 13.7671 | 45 |
5.7137 | 14.1255 | 46 |
5.6777 | 14.4026 | 47 |
5.6776 | 14.3435 | 48 |
5.8337 | 14.3650 | 49 |
5.8583 | 14.2897 | 50 |
5.6849 | 14.6518 | 51 |
5.7112 | 14.5420 | 52 |
5.7281 | 13.9947 | 53 |
5.9154 | 14.3210 | 54 |
5.6742 | 13.8867 | 55 |
5.8674 | 14.2819 | 56 |
5.7128 | 14.5811 | 57 |
5.7091 | 14.2113 | 58 |
5.7479 | 14.4418 | 59 |
5.7632 | 13.9566 | 60 |
5.6443 | 14.1394 | 61 |
5.6794 | 14.5981 | 62 |
5.6450 | 14.5139 | 63 |
5.6935 | 14.3309 | 64 |
5.7443 | 14.3540 | 65 |
5.7014 | 14.7472 | 66 |
5.7407 | 14.4245 | 67 |
5.9023 | 14.4602 | 68 |
5.9222 | 14.6654 | 69 |
5.6813 | 14.3179 | 70 |
5.6505 | 14.1670 | 71 |
5.8407 | 14.2520 | 72 |
5.6683 | 14.1696 | 73 |
5.6880 | 15.1198 | 74 |
5.8254 | 14.2783 | 75 |
5.7758 | 14.5934 | 76 |
5.7180 | 14.4779 | 77 |
5.7348 | 14.3955 | 78 |
5.6680 | 14.0637 | 79 |
5.7029 | 14.6120 | 80 |
5.7088 | 14.3396 | 81 |
5.7215 | 14.5878 | 82 |
5.5987 | 15.0465 | 83 |
5.7613 | 14.7521 | 84 |
5.7670 | 14.9828 | 85 |
5.7954 | 14.6714 | 86 |
5.6080 | 15.2686 | 87 |
5.7493 | 14.8772 | 88 |
5.6884 | 14.4567 | 89 |
5.6932 | 14.3316 | 90 |
5.7152 | 15.2725 | 91 |
5.6548 | 15.0855 | 92 |
5.6196 | 14.8487 | 93 |
5.7889 | 14.7169 | 94 |
5.5958 | 14.9320 | 95 |
5.7047 | 14.8829 | 96 |
5.5637 | 14.8704 | 97 |
5.6375 | 14.7917 | 98 |
5.7003 | 15.3566 | 99 |
Framework versions
- Transformers 4.27.0.dev0
- TensorFlow 2.9.2
- Datasets 2.9.0
- Tokenizers 0.13.2