Model Card for VALOR-8B

This is the RL-tuned Qwen3-8B model from the paper: No Labels, No Problem: Training Visual Reasoners with Multimodal Verifiers

For further information please refer to the project webpage, paper, and repository.

Citation

If you use VALOR in your research, please consider citing our work:

BibTeX:

@misc{marsili2025labelsproblemtrainingvisual,
      title={No Labels, No Problem: Training Visual Reasoners with Multimodal Verifiers}, 
      author={Damiano Marsili and Georgia Gkioxari},
      year={2025},
      eprint={2512.08889},
      archivePrefix={arXiv},
      primaryClass={cs.CV},
      url={https://arxiv.org/abs/2512.08889}, 
}
Downloads last month
23
Safetensors
Model size
8B params
Tensor type
BF16
·
Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Model tree for glab-caltech/VALOR-8B

Base model

Qwen/Qwen3-8B-Base
Finetuned
Qwen/Qwen3-8B
Finetuned
(642)
this model
Quantizations
2 models

Dataset used to train glab-caltech/VALOR-8B