distilbert-base-uncased-name-classifier

Browse files

Files changed (4) hide show

README.md +19 -19
model.safetensors +1 -1
runs/Dec07_19-32-58_elesage-pc/events.out.tfevents.1765154092.elesage-pc.198534.0 +3 -0
training_args.bin +1 -1

README.md CHANGED Viewed

@@ -21,11 +21,11 @@ should probably proofread and complete it, then remove this comment. -->
 This model is a fine-tuned version of [distilbert/distilbert-base-uncased](https://huggingface.co/distilbert/distilbert-base-uncased) on an unknown dataset.
 It achieves the following results on the evaluation set:
-- Loss: 0.0230
-- Accuracy: 0.9937
-- Precision: 0.9983
-- Recall: 0.9904
-- F1: 0.9943
 ## Model description
@@ -51,25 +51,25 @@ The following hyperparameters were used during training:
 - optimizer: Use OptimizerNames.ADAMW_TORCH with betas=(0.9,0.999) and epsilon=1e-08 and optimizer_args=No additional optimizer arguments
 - lr_scheduler_type: linear
 - lr_scheduler_warmup_steps: 1000
-- num_epochs: 1
 ### Training results
 | Training Loss | Epoch  | Step  | Validation Loss | Accuracy | Precision | Recall | F1     |
 |:-------------:|:------:|:-----:|:---------------:|:--------:|:---------:|:------:|:------:|
-| 0.0397        | 0.0718 | 2000  | 0.0405          | 0.9885   | 0.9981    | 0.9812 | 0.9896 |
-| 0.0324        | 0.1435 | 4000  | 0.0303          | 0.9914   | 0.9970    | 0.9875 | 0.9923 |
-| 0.031         | 0.2153 | 6000  | 0.0295          | 0.9914   | 0.9938    | 0.9907 | 0.9923 |
-| 0.0295        | 0.2870 | 8000  | 0.0271          | 0.9924   | 0.9970    | 0.9894 | 0.9932 |
-| 0.0275        | 0.3588 | 10000 | 0.0262          | 0.9926   | 0.9964    | 0.9904 | 0.9934 |
-| 0.0281        | 0.4305 | 12000 | 0.0256          | 0.9930   | 0.9981    | 0.9893 | 0.9937 |
-| 0.0244        | 0.5023 | 14000 | 0.0272          | 0.9926   | 0.9991    | 0.9876 | 0.9933 |
-| 0.0229        | 0.5740 | 16000 | 0.0254          | 0.9931   | 0.9970    | 0.9907 | 0.9938 |
-| 0.0264        | 0.6458 | 18000 | 0.0248          | 0.9932   | 0.9986    | 0.9892 | 0.9939 |
-| 0.0258        | 0.7175 | 20000 | 0.0237          | 0.9934   | 0.9983    | 0.9899 | 0.9941 |
-| 0.0236        | 0.7893 | 22000 | 0.0234          | 0.9936   | 0.9982    | 0.9903 | 0.9943 |
-| 0.0253        | 0.8610 | 24000 | 0.0231          | 0.9936   | 0.9979    | 0.9907 | 0.9943 |
-| 0.0248        | 0.9328 | 26000 | 0.0230          | 0.9937   | 0.9983    | 0.9904 | 0.9943 |
 ### Framework versions

 This model is a fine-tuned version of [distilbert/distilbert-base-uncased](https://huggingface.co/distilbert/distilbert-base-uncased) on an unknown dataset.
 It achieves the following results on the evaluation set:
+- Loss: 0.0225
+- Accuracy: 0.9939
+- Precision: 0.9979
+- Recall: 0.9912
+- F1: 0.9945
 ## Model description
 - optimizer: Use OptimizerNames.ADAMW_TORCH with betas=(0.9,0.999) and epsilon=1e-08 and optimizer_args=No additional optimizer arguments
 - lr_scheduler_type: linear
 - lr_scheduler_warmup_steps: 1000
+- num_epochs: 2
 ### Training results
 | Training Loss | Epoch  | Step  | Validation Loss | Accuracy | Precision | Recall | F1     |
 |:-------------:|:------:|:-----:|:---------------:|:--------:|:---------:|:------:|:------:|
+| 0.0323        | 0.1435 | 4000  | 0.0303          | 0.9915   | 0.9975    | 0.9872 | 0.9923 |
+| 0.0297        | 0.2870 | 8000  | 0.0279          | 0.9923   | 0.9963    | 0.9899 | 0.9931 |
+| 0.0283        | 0.4305 | 12000 | 0.0257          | 0.9929   | 0.9978    | 0.9895 | 0.9936 |
+| 0.0229        | 0.5740 | 16000 | 0.0258          | 0.9932   | 0.9972    | 0.9905 | 0.9938 |
+| 0.0263        | 0.7175 | 20000 | 0.0239          | 0.9934   | 0.9981    | 0.9901 | 0.9940 |
+| 0.0256        | 0.8610 | 24000 | 0.0233          | 0.9935   | 0.9976    | 0.9908 | 0.9942 |
+| 0.023         | 1.0046 | 28000 | 0.0233          | 0.9936   | 0.9976    | 0.9909 | 0.9943 |
+| 0.0214        | 1.1481 | 32000 | 0.0231          | 0.9937   | 0.9986    | 0.9902 | 0.9944 |
+| 0.0207        | 1.2916 | 36000 | 0.0232          | 0.9938   | 0.9984    | 0.9905 | 0.9944 |
+| 0.0215        | 1.4351 | 40000 | 0.0229          | 0.9938   | 0.9978    | 0.9910 | 0.9944 |
+| 0.0206        | 1.5786 | 44000 | 0.0232          | 0.9938   | 0.9976    | 0.9913 | 0.9944 |
+| 0.0197        | 1.7221 | 48000 | 0.0229          | 0.9939   | 0.9978    | 0.9912 | 0.9945 |
+| 0.0216        | 1.8656 | 52000 | 0.0225          | 0.9939   | 0.9979    | 0.9912 | 0.9945 |
 ### Framework versions

model.safetensors CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:348d244e9a1f1cf3da7e6668e29924033194e06b0e80c4ab90588d0f11ca9bd9
 size 267832560

 version https://git-lfs.github.com/spec/v1
+oid sha256:678761693231623744a4d06b70588b00e157f059d628103ade400601d56c9801
 size 267832560

runs/Dec07_19-32-58_elesage-pc/events.out.tfevents.1765154092.elesage-pc.198534.0 ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:efd47bc06a34b1ff5da63ddc4872f3d8d6556306058bd17a04def96b4ef73f54
+size 71079

training_args.bin CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:4cc87171a4c10cd5c65823f0406598b1e6064512435b585c4d33fd491196ffc1
 size 5905

 version https://git-lfs.github.com/spec/v1
+oid sha256:06e4a6182834d544587b3d84193f04b0a800c85c2b82b092c2cd1213a4173aa3
 size 5905