Upload fine-tuned Phi-3 4-bit LoRA adapters
Browse files
README.md
CHANGED
|
@@ -147,3 +147,19 @@ try:
|
|
| 147 |
except json.JSONDecodeError as e:
|
| 148 |
print(f"\nGenerated output IS NOT valid JSON. Error: {e}")
|
| 149 |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 147 |
except json.JSONDecodeError as e:
|
| 148 |
print(f"\nGenerated output IS NOT valid JSON. Error: {e}")
|
| 149 |
|
| 150 |
+
## Example Output
|
| 151 |
+
|
| 152 |
+
The model aims to generate a valid JSON object structured like the example below. Note that while the training prompt focused on specific keys (question, options, correct_option), the model might also generate related fields like explanation based on patterns learned from the training data.
|
| 153 |
+
|
| 154 |
+
```json
|
| 155 |
+
{
|
| 156 |
+
"question": "What is the maximum duration of a temporary ban from practising as a disciplinary sanction in the medical profession?",
|
| 157 |
+
"option_a": "1 year",
|
| 158 |
+
"option_b": "2 years",
|
| 159 |
+
"option_c": "3 years",
|
| 160 |
+
"option_d": "5 years",
|
| 161 |
+
"correct_option": "C",
|
| 162 |
+
"explanation": "The correct answer is C, which states that the maximum duration of a temporary ban from practising as a disciplinary sanction in the medical profession is 3 years. This information is explicitly stated in the text, which mentions that a temporary ban from practising may be imposed for a maximum of three years. The other options are incorrect because they either underestimate or overestimate the maximum duration of the ban."
|
| 163 |
+
}
|
| 164 |
+
```
|
| 165 |
+
|