PrimeRL Qwen Models
Collection
4 items • Updated
Qwen3-4B fine-tuned with RL on combined medical datasets (MedCalc-Bench, MedMCQA, MedCaseReasoning). LoRA weights properly merged.
Please ask your administrator.
Apache 2.0
Base model
Qwen/Qwen3-4B-Instruct-2507