Edit Models filters

Apps

Docker Model Runner

Inference Providers

OVHcloud AI Endpoints

HF Inference API

Misc

Inference Endpoints

text-generation-inference

Eval Results (legacy)

text-embeddings-inference

4-bit precision

8-bit precision

Mixture of Experts

Carbon Emissions

Models

114

Full-text search

Active filters: reward-model

ilgee/Multiclass-Think-RM-8B

8B • Updated Nov 2, 2025 • 6

launch/ThinkPRM-7B

Text Generation • 8B • Updated May 17, 2025 • 41 • 1

mradermacher/ThinkPRM-7B-GGUF

8B • Updated Jul 11, 2025 • 262

mradermacher/ThinkPRM-7B-i1-GGUF

8B • Updated Jul 11, 2025 • 580

Huanghz/align2llava-7b-lora-question

Updated May 21, 2025 • 2

Huanghz/align2llava-7b-lora-answer

Updated May 21, 2025 • 1

nvidia/Qwen-2.5-Nemotron-32B-Reward

Text Classification • 32B • Updated Jun 26, 2025 • 134 • 2

nvidia/Qwen-3-Nemotron-32B-Reward

Text Classification • 32B • Updated Jun 26, 2025 • 4.54k • 19

zhuohaoyu/RewardAnything-8B-v1

Text Generation • 8B • Updated Jun 5, 2025 • 293 • 4

mradermacher/RewardAnything-8B-v1-GGUF

8B • Updated Jul 11, 2025 • 44

WisdomShell/RewardAnything-8B-v1

Text Generation • 8B • Updated Jun 5, 2025 • 174 • • 22

Skywork/Skywork-Reward-V2-Qwen3-8B

Text Classification • 8B • Updated Jul 6, 2025 • 8.76k • 22

ContextualAI/ctx-bird-reward-250121

Text Generation • 33B • Updated Dec 2, 2025 • 19 • 5

Bifrost-AI/Qwen-3-Nemotron-32B-Reward-F16

Text Classification • 32B • Updated Jul 11, 2025

tensorblock/WisdomShell_RewardAnything-8B-v1-GGUF

Text Generation • 8B • Updated 16 days ago • 73

ulab-ai/sotopia-rl-qwen2.5-7B-rm

Feature Extraction • Updated Aug 7, 2025 • 1

ilgee/Binary-Think-RM-3B

3B • Updated Nov 2, 2025 • 1

gandhiraketla277/demo-lora-reward-model

Text Generation • Updated Aug 10, 2025

Schrieffer/Llama-SARM-4B

Reinforcement Learning • 5B • Updated Dec 11, 2025 • 12 • 1

ykorkmaz/rfm_no_failure

4B • Updated Aug 30, 2025 • 3

abraranwar/spur_metaworld

4B • Updated Aug 31, 2025

ykorkmaz/rfm_progress_only

4B • Updated Sep 1, 2025 • 3

kewu93/skywork-medarena-lora-v1

Updated Sep 18, 2025

kewu93/skywork-medarena-lora-v2

Text Classification • Updated Sep 18, 2025 • 1

nabeelshan/rlhf-gpt2-pipeline

Text Generation • Updated Sep 24, 2025

Schrieffer/Llama-SARM-4B-PostSAEPretrain

Feature Extraction • 5B • Updated Dec 11, 2025 • 2 • 1

dongboklee/gPRM-14B

Text Generation • Updated Oct 6, 2025 • 16 • 1

dongboklee/gPRM-14B-merged

Text Generation • 15B • Updated Oct 6, 2025 • 26 • 2

dongboklee/gORM-14B

Text Generation • Updated Oct 6, 2025

dongboklee/gORM-14B-merged

Text Generation • 15B • Updated Oct 6, 2025 • 51 • 1