qwen_arabic_medical

This model is a fine-tuned version of unsloth/qwen2.5-7b-instruct-unsloth-bnb-4bit on an unknown dataset. It achieves the following results on the evaluation set:

Model description

More information needed

More information needed

More information needed

The following hyperparameters were used during training:

learning_rate: 0.0002
train_batch_size: 2
eval_batch_size: 1
seed: 3407
gradient_accumulation_steps: 4
total_train_batch_size: 8
optimizer: Use OptimizerNames.ADAMW_8BIT with betas=(0.9,0.999) and epsilon=1e-08 and optimizer_args=No additional optimizer arguments
lr_scheduler_type: linear
lr_scheduler_warmup_steps: 10
num_epochs: 3

Training Loss	Epoch	Step	Validation Loss
1.7594	0.2961	250	1.7583
1.6629	0.5922	500	1.6873
1.6701	0.8884	750	1.6366
1.3644	1.1836	1000	1.6331
1.3127	1.4797	1250	1.6502
1.292	1.7758	1500	1.6292
1.0995	2.0711	1750	1.6320
1.1124	2.3672	2000	1.6248
0.9545	2.6633	2250	1.6714
0.9728	2.9594	2500	1.6825