Model Card for VALOR-8B

This is the RL-tuned Qwen3-8B model from the paper: No Labels, No Problem: Training Visual Reasoners with Multimodal Verifiers

For further information please refer to the project webpage, paper, and repository.

Citation

If you use VALOR in your research, please consider citing our work:

BibTeX:

@misc{marsili2025labelsproblemtrainingvisual,
      title={No Labels, No Problem: Training Visual Reasoners with Multimodal Verifiers}, 
      author={Damiano Marsili and Georgia Gkioxari},
      year={2025},
      eprint={2512.08889},
      archivePrefix={arXiv},
      primaryClass={cs.CV},
      url={https://arxiv.org/abs/2512.08889}, 
}

Downloads last month: 23

Safetensors

Model size

8B params

Tensor type

BF16

Inference Providers NEW

This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Model tree for glab-caltech/VALOR-8B

Base model

Qwen/Qwen3-8B-Base

Finetuned

Qwen/Qwen3-8B

Finetuned

(642)

this model

Quantizations

2 models

glab-caltech
/

VALOR-8B

Model Card for VALOR-8B

Citation

Model tree for glab-caltech/VALOR-8B

Dataset used to train glab-caltech/VALOR-8B