Edit Models filters

Apps

Docker Model Runner

Inference Providers

OVHcloud AI Endpoints

HF Inference API

Misc

Inference Endpoints

text-generation-inference

Eval Results (legacy)

text-embeddings-inference

4-bit precision

8-bit precision

Mixture of Experts

Carbon Emissions

Models

114

Full-text search

Active filters: reward-model

RobotsMali/reward-model

898k • Updated Jan 9 • 927

mradermacher/Binary-Think-RM-3B-GGUF

3B • Updated Nov 3, 2025 • 60 • 1

mradermacher/Binary-Think-RM-3B-i1-GGUF

3B • Updated Dec 7, 2025 • 94 • 1

sodeniZz/llm-course-hw2-reward-model

Text Classification • 0.1B • Updated Nov 15, 2025

friendshipkim/Qwen2.5-Math-1.5B-Scoring

2B • Updated Nov 15, 2025 • 1

DarianNLP/bert_sequel_beagles

0.1B • Updated Nov 18, 2025

friendshipkim/Qwen2.5-Math-1.5B-Scoring-Mean

2B • Updated Nov 16, 2025 • 34

mingpinDZJ/Shanzhi-M1

Text Generation • 33B • Updated Nov 22, 2025 • 5 • 3

bmbgsj/ProRAG_PRM

Text Classification • 8B • Updated 7 days ago • 11 • 1

AIPlans/Qwen3-0.6B-RM-hs2

Text Classification • 0.6B • Updated Dec 1, 2025 • 1 • 1

AmirhoseinGH/Gnosis-Qwen3-1.7B-Hybrid

Text Classification • 2B • Updated Jan 7 • 70

AmirhoseinGH/Gnosis-Qwen3-4B-Instruct-2507

Text Classification • 4B • Updated Jan 7 • 22

AmirhoseinGH/Gnosis-Qwen3-4B-Thinking-2507

Text Classification • 4B • Updated Jan 7 • 34

AmirhoseinGH/Gnosis-Qwen3-8B

Text Classification • 8B • Updated Jan 7 • 16

SQCU/brainrot-partition-BTRMplus

nvidia/Qwen2.5-CascadeRL-RM-72B

Text Generation • 71B • Updated Jan 1 • 175 • 11

OpenDILabCommunity/HUMOR-RM-Keye-VL

Image Classification • 9B • Updated 27 days ago • 32 • 1

xander2432/djpo-reward-model

Text Classification • Updated Jan 5

Aster2024/swift-ministral-8b-deepscaler

Reinforcement Learning • Updated 29 days ago • 27 • 2

newmindai/Muhakim

Text Generation • Updated 15 days ago • 26 • 4

phuongntc/Multi_EvalSumVietN_FullDoc

Summarization • Updated 19 days ago • 13

Sachinkry/qwen3-imdb-reward-0.6b

Text Classification • 0.6B • Updated 18 days ago • 31

Bhavkumar21/mmrb2-mj1-checkpoint-results

Updated 9 days ago

mradermacher/HER-RM-32B-i1-GGUF

33B • Updated 7 days ago • 7.28k