Update README.md
Browse files
README.md
CHANGED
|
@@ -1,5 +1,5 @@
|
|
| 1 |
---
|
| 2 |
-
license:
|
| 3 |
library_name: transformers
|
| 4 |
base_model: nbeerbower/Mistral-Small-Drummer-22B
|
| 5 |
datasets:
|
|
@@ -26,7 +26,8 @@ model-index:
|
|
| 26 |
value: 63.31
|
| 27 |
name: strict accuracy
|
| 28 |
source:
|
| 29 |
-
url:
|
|
|
|
| 30 |
name: Open LLM Leaderboard
|
| 31 |
- task:
|
| 32 |
type: text-generation
|
|
@@ -41,7 +42,8 @@ model-index:
|
|
| 41 |
value: 40.12
|
| 42 |
name: normalized accuracy
|
| 43 |
source:
|
| 44 |
-
url:
|
|
|
|
| 45 |
name: Open LLM Leaderboard
|
| 46 |
- task:
|
| 47 |
type: text-generation
|
|
@@ -56,7 +58,8 @@ model-index:
|
|
| 56 |
value: 16.69
|
| 57 |
name: exact match
|
| 58 |
source:
|
| 59 |
-
url:
|
|
|
|
| 60 |
name: Open LLM Leaderboard
|
| 61 |
- task:
|
| 62 |
type: text-generation
|
|
@@ -71,7 +74,8 @@ model-index:
|
|
| 71 |
value: 12.42
|
| 72 |
name: acc_norm
|
| 73 |
source:
|
| 74 |
-
url:
|
|
|
|
| 75 |
name: Open LLM Leaderboard
|
| 76 |
- task:
|
| 77 |
type: text-generation
|
|
@@ -86,7 +90,8 @@ model-index:
|
|
| 86 |
value: 9.8
|
| 87 |
name: acc_norm
|
| 88 |
source:
|
| 89 |
-
url:
|
|
|
|
| 90 |
name: Open LLM Leaderboard
|
| 91 |
- task:
|
| 92 |
type: text-generation
|
|
@@ -103,7 +108,8 @@ model-index:
|
|
| 103 |
value: 34.39
|
| 104 |
name: accuracy
|
| 105 |
source:
|
| 106 |
-
url:
|
|
|
|
| 107 |
name: Open LLM Leaderboard
|
| 108 |
---
|
| 109 |
|
|
@@ -111,6 +117,28 @@ model-index:
|
|
| 111 |
This model was converted to GGUF format from [`nbeerbower/Mistral-Small-Drummer-22B`](https://huggingface.co/nbeerbower/Mistral-Small-Drummer-22B) using llama.cpp via the ggml.ai's [GGUF-my-repo](https://huggingface.co/spaces/ggml-org/gguf-my-repo) space.
|
| 112 |
Refer to the [original model card](https://huggingface.co/nbeerbower/Mistral-Small-Drummer-22B) for more details on the model.
|
| 113 |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 114 |
## Use with llama.cpp
|
| 115 |
Install llama.cpp through brew (works on Mac and Linux)
|
| 116 |
|
|
@@ -149,4 +177,4 @@ Step 3: Run inference through the main binary.
|
|
| 149 |
or
|
| 150 |
```
|
| 151 |
./llama-server --hf-repo Triangle104/Mistral-Small-Drummer-22B-Q8_0-GGUF --hf-file mistral-small-drummer-22b-q8_0.gguf -c 2048
|
| 152 |
-
```
|
|
|
|
| 1 |
---
|
| 2 |
+
license: apache-2.0
|
| 3 |
library_name: transformers
|
| 4 |
base_model: nbeerbower/Mistral-Small-Drummer-22B
|
| 5 |
datasets:
|
|
|
|
| 26 |
value: 63.31
|
| 27 |
name: strict accuracy
|
| 28 |
source:
|
| 29 |
+
url: >-
|
| 30 |
+
https://huggingface.co/spaces/open-llm-leaderboard/open_llm_leaderboard?query=nbeerbower/Mistral-Small-Drummer-22B
|
| 31 |
name: Open LLM Leaderboard
|
| 32 |
- task:
|
| 33 |
type: text-generation
|
|
|
|
| 42 |
value: 40.12
|
| 43 |
name: normalized accuracy
|
| 44 |
source:
|
| 45 |
+
url: >-
|
| 46 |
+
https://huggingface.co/spaces/open-llm-leaderboard/open_llm_leaderboard?query=nbeerbower/Mistral-Small-Drummer-22B
|
| 47 |
name: Open LLM Leaderboard
|
| 48 |
- task:
|
| 49 |
type: text-generation
|
|
|
|
| 58 |
value: 16.69
|
| 59 |
name: exact match
|
| 60 |
source:
|
| 61 |
+
url: >-
|
| 62 |
+
https://huggingface.co/spaces/open-llm-leaderboard/open_llm_leaderboard?query=nbeerbower/Mistral-Small-Drummer-22B
|
| 63 |
name: Open LLM Leaderboard
|
| 64 |
- task:
|
| 65 |
type: text-generation
|
|
|
|
| 74 |
value: 12.42
|
| 75 |
name: acc_norm
|
| 76 |
source:
|
| 77 |
+
url: >-
|
| 78 |
+
https://huggingface.co/spaces/open-llm-leaderboard/open_llm_leaderboard?query=nbeerbower/Mistral-Small-Drummer-22B
|
| 79 |
name: Open LLM Leaderboard
|
| 80 |
- task:
|
| 81 |
type: text-generation
|
|
|
|
| 90 |
value: 9.8
|
| 91 |
name: acc_norm
|
| 92 |
source:
|
| 93 |
+
url: >-
|
| 94 |
+
https://huggingface.co/spaces/open-llm-leaderboard/open_llm_leaderboard?query=nbeerbower/Mistral-Small-Drummer-22B
|
| 95 |
name: Open LLM Leaderboard
|
| 96 |
- task:
|
| 97 |
type: text-generation
|
|
|
|
| 108 |
value: 34.39
|
| 109 |
name: accuracy
|
| 110 |
source:
|
| 111 |
+
url: >-
|
| 112 |
+
https://huggingface.co/spaces/open-llm-leaderboard/open_llm_leaderboard?query=nbeerbower/Mistral-Small-Drummer-22B
|
| 113 |
name: Open LLM Leaderboard
|
| 114 |
---
|
| 115 |
|
|
|
|
| 117 |
This model was converted to GGUF format from [`nbeerbower/Mistral-Small-Drummer-22B`](https://huggingface.co/nbeerbower/Mistral-Small-Drummer-22B) using llama.cpp via the ggml.ai's [GGUF-my-repo](https://huggingface.co/spaces/ggml-org/gguf-my-repo) space.
|
| 118 |
Refer to the [original model card](https://huggingface.co/nbeerbower/Mistral-Small-Drummer-22B) for more details on the model.
|
| 119 |
|
| 120 |
+
---
|
| 121 |
+
Model details:
|
| 122 |
+
-
|
| 123 |
+
mistralai/Mistral-Small-Instruct-2409 finetuned on jondurbin/gutenberg-dpo-v0.1 and nbeerbower/gutenberg2-dpo.
|
| 124 |
+
|
| 125 |
+
Method
|
| 126 |
+
ORPO tuned with 2xA40 on RunPod for 1 epoch.
|
| 127 |
+
|
| 128 |
+
learning_rate=4e-6,
|
| 129 |
+
lr_scheduler_type="linear",
|
| 130 |
+
beta=0.1,
|
| 131 |
+
per_device_train_batch_size=4,
|
| 132 |
+
per_device_eval_batch_size=4,
|
| 133 |
+
gradient_accumulation_steps=8,
|
| 134 |
+
optim="paged_adamw_8bit",
|
| 135 |
+
num_train_epochs=1,
|
| 136 |
+
|
| 137 |
+
Dataset was prepared using Mistral-Small Instruct format.
|
| 138 |
+
|
| 139 |
+
Fine-tune Llama 3 with ORPO
|
| 140 |
+
|
| 141 |
+
---
|
| 142 |
## Use with llama.cpp
|
| 143 |
Install llama.cpp through brew (works on Mac and Linux)
|
| 144 |
|
|
|
|
| 177 |
or
|
| 178 |
```
|
| 179 |
./llama-server --hf-repo Triangle104/Mistral-Small-Drummer-22B-Q8_0-GGUF --hf-file mistral-small-drummer-22b-q8_0.gguf -c 2048
|
| 180 |
+
```
|