Qwen3-4B-MedCombined-RL

Qwen3-4B fine-tuned with RL on combined medical datasets (MedCalc-Bench, MedMCQA, MedCaseReasoning). LoRA weights properly merged.

Model Details

Usage

Please ask your administrator.

License

Apache 2.0

Downloads last month
-
Safetensors
Model size
4B params
Tensor type
BF16
·
Video Preview
loading

Model tree for nsk7153/Qwen3-4B-MedCombined-RL

Finetuned
(1395)
this model

Collection including nsk7153/Qwen3-4B-MedCombined-RL