ele-sage commited on
Commit
69116fe
·
verified ·
1 Parent(s): 371f899

distilbert-base-uncased-name-classifier

Browse files
README.md CHANGED
@@ -21,11 +21,11 @@ should probably proofread and complete it, then remove this comment. -->
21
 
22
  This model is a fine-tuned version of [distilbert/distilbert-base-uncased](https://huggingface.co/distilbert/distilbert-base-uncased) on an unknown dataset.
23
  It achieves the following results on the evaluation set:
24
- - Loss: 0.0230
25
- - Accuracy: 0.9937
26
- - Precision: 0.9983
27
- - Recall: 0.9904
28
- - F1: 0.9943
29
 
30
  ## Model description
31
 
@@ -51,25 +51,25 @@ The following hyperparameters were used during training:
51
  - optimizer: Use OptimizerNames.ADAMW_TORCH with betas=(0.9,0.999) and epsilon=1e-08 and optimizer_args=No additional optimizer arguments
52
  - lr_scheduler_type: linear
53
  - lr_scheduler_warmup_steps: 1000
54
- - num_epochs: 1
55
 
56
  ### Training results
57
 
58
  | Training Loss | Epoch | Step | Validation Loss | Accuracy | Precision | Recall | F1 |
59
  |:-------------:|:------:|:-----:|:---------------:|:--------:|:---------:|:------:|:------:|
60
- | 0.0397 | 0.0718 | 2000 | 0.0405 | 0.9885 | 0.9981 | 0.9812 | 0.9896 |
61
- | 0.0324 | 0.1435 | 4000 | 0.0303 | 0.9914 | 0.9970 | 0.9875 | 0.9923 |
62
- | 0.031 | 0.2153 | 6000 | 0.0295 | 0.9914 | 0.9938 | 0.9907 | 0.9923 |
63
- | 0.0295 | 0.2870 | 8000 | 0.0271 | 0.9924 | 0.9970 | 0.9894 | 0.9932 |
64
- | 0.0275 | 0.3588 | 10000 | 0.0262 | 0.9926 | 0.9964 | 0.9904 | 0.9934 |
65
- | 0.0281 | 0.4305 | 12000 | 0.0256 | 0.9930 | 0.9981 | 0.9893 | 0.9937 |
66
- | 0.0244 | 0.5023 | 14000 | 0.0272 | 0.9926 | 0.9991 | 0.9876 | 0.9933 |
67
- | 0.0229 | 0.5740 | 16000 | 0.0254 | 0.9931 | 0.9970 | 0.9907 | 0.9938 |
68
- | 0.0264 | 0.6458 | 18000 | 0.0248 | 0.9932 | 0.9986 | 0.9892 | 0.9939 |
69
- | 0.0258 | 0.7175 | 20000 | 0.0237 | 0.9934 | 0.9983 | 0.9899 | 0.9941 |
70
- | 0.0236 | 0.7893 | 22000 | 0.0234 | 0.9936 | 0.9982 | 0.9903 | 0.9943 |
71
- | 0.0253 | 0.8610 | 24000 | 0.0231 | 0.9936 | 0.9979 | 0.9907 | 0.9943 |
72
- | 0.0248 | 0.9328 | 26000 | 0.0230 | 0.9937 | 0.9983 | 0.9904 | 0.9943 |
73
 
74
 
75
  ### Framework versions
 
21
 
22
  This model is a fine-tuned version of [distilbert/distilbert-base-uncased](https://huggingface.co/distilbert/distilbert-base-uncased) on an unknown dataset.
23
  It achieves the following results on the evaluation set:
24
+ - Loss: 0.0225
25
+ - Accuracy: 0.9939
26
+ - Precision: 0.9979
27
+ - Recall: 0.9912
28
+ - F1: 0.9945
29
 
30
  ## Model description
31
 
 
51
  - optimizer: Use OptimizerNames.ADAMW_TORCH with betas=(0.9,0.999) and epsilon=1e-08 and optimizer_args=No additional optimizer arguments
52
  - lr_scheduler_type: linear
53
  - lr_scheduler_warmup_steps: 1000
54
+ - num_epochs: 2
55
 
56
  ### Training results
57
 
58
  | Training Loss | Epoch | Step | Validation Loss | Accuracy | Precision | Recall | F1 |
59
  |:-------------:|:------:|:-----:|:---------------:|:--------:|:---------:|:------:|:------:|
60
+ | 0.0323 | 0.1435 | 4000 | 0.0303 | 0.9915 | 0.9975 | 0.9872 | 0.9923 |
61
+ | 0.0297 | 0.2870 | 8000 | 0.0279 | 0.9923 | 0.9963 | 0.9899 | 0.9931 |
62
+ | 0.0283 | 0.4305 | 12000 | 0.0257 | 0.9929 | 0.9978 | 0.9895 | 0.9936 |
63
+ | 0.0229 | 0.5740 | 16000 | 0.0258 | 0.9932 | 0.9972 | 0.9905 | 0.9938 |
64
+ | 0.0263 | 0.7175 | 20000 | 0.0239 | 0.9934 | 0.9981 | 0.9901 | 0.9940 |
65
+ | 0.0256 | 0.8610 | 24000 | 0.0233 | 0.9935 | 0.9976 | 0.9908 | 0.9942 |
66
+ | 0.023 | 1.0046 | 28000 | 0.0233 | 0.9936 | 0.9976 | 0.9909 | 0.9943 |
67
+ | 0.0214 | 1.1481 | 32000 | 0.0231 | 0.9937 | 0.9986 | 0.9902 | 0.9944 |
68
+ | 0.0207 | 1.2916 | 36000 | 0.0232 | 0.9938 | 0.9984 | 0.9905 | 0.9944 |
69
+ | 0.0215 | 1.4351 | 40000 | 0.0229 | 0.9938 | 0.9978 | 0.9910 | 0.9944 |
70
+ | 0.0206 | 1.5786 | 44000 | 0.0232 | 0.9938 | 0.9976 | 0.9913 | 0.9944 |
71
+ | 0.0197 | 1.7221 | 48000 | 0.0229 | 0.9939 | 0.9978 | 0.9912 | 0.9945 |
72
+ | 0.0216 | 1.8656 | 52000 | 0.0225 | 0.9939 | 0.9979 | 0.9912 | 0.9945 |
73
 
74
 
75
  ### Framework versions
model.safetensors CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:348d244e9a1f1cf3da7e6668e29924033194e06b0e80c4ab90588d0f11ca9bd9
3
  size 267832560
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:678761693231623744a4d06b70588b00e157f059d628103ade400601d56c9801
3
  size 267832560
runs/Dec07_19-32-58_elesage-pc/events.out.tfevents.1765154092.elesage-pc.198534.0 ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:efd47bc06a34b1ff5da63ddc4872f3d8d6556306058bd17a04def96b4ef73f54
3
+ size 71079
training_args.bin CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:4cc87171a4c10cd5c65823f0406598b1e6064512435b585c4d33fd491196ffc1
3
  size 5905
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:06e4a6182834d544587b3d84193f04b0a800c85c2b82b092c2cd1213a4173aa3
3
  size 5905