Edit Models filters

Apps

Docker Model Runner

Inference Providers

OVHcloud AI Endpoints

HF Inference API

Misc

Inference Endpoints

text-generation-inference

4-bit precision

8-bit precision

text-embeddings-inference

Mixture of Experts

Carbon Emissions

Models

93

Full-text search

Active filters: reward-model

dongboklee/dPRM-14B

Text Classification • Updated Oct 6 • 19

dongboklee/gORM-8B

Text Generation • Updated Oct 6 • 7

dongboklee/gPRM-8B-merged

Text Generation • 8B • Updated Oct 6 • 10

dongboklee/gORM-8B-merged

Text Generation • 8B • Updated Oct 6 • 15

dongboklee/dORM-8B

Text Classification • Updated Oct 6 • 165

dongboklee/gPRM-8B

Text Generation • Updated Oct 6

dongboklee/dPRM-8B

Text Classification • Updated Oct 6 • 166

mradermacher/Binary-Think-RM-8B-GGUF

8B • Updated Oct 13 • 116

mradermacher/Multiclass-Think-RM-8B-GGUF

8B • Updated Oct 13 • 170

ArtusDev/ilgee_Binary-Think-RM-8B-EXL3

mradermacher/Binary-Think-RM-8B-i1-GGUF

8B • Updated 6 days ago • 1.88k

mradermacher/Multiclass-Think-RM-8B-i1-GGUF

8B • Updated 6 days ago • 1.63k

ArtusDev/ilgee_Multiclass-Think-RM-8B-EXL3

Updated Oct 13 • 4

Panga-Azazia/reward-model-v1

1.29M • Updated Oct 21

Panga-Azazia/reward-model-v2

1.29M • Updated Oct 21

Panga-Azazia/reward-model-v3

Tabular Regression • 1.29M • Updated Oct 21 • 1

Panga-Azazia/reward-model-v4

1.29M • Updated Oct 22

Panga-Azazia/reward-model-v5

1.68M • Updated Oct 22 • 1

Panga-Azazia/reward-model-v6

2.08M • Updated Oct 22 • 1

Panga-Azazia/reward-model-v7

2.08M • Updated Oct 22

kp-forks/reward-model-deberta-v3-large-v2

Updated Feb 1, 2023 • 7

Panga-Azazia/reward-model-v8

2.88M • Updated Oct 23 • 7

Yuhan123/rm_cad_maj_vote_eval_acc_0_9065

Text Classification • 1B • Updated Oct 24 • 4

samhitha2601/llama3-gsm8k-critic

3B • Updated Oct 24 • 5

RobotsMali/reward-model

Tabular Regression • 908k • Updated Nov 11 • 72

Panga-Azazia/reward-model

898k • Updated 25 days ago • 67

mradermacher/Binary-Think-RM-3B-GGUF

3B • Updated Nov 3 • 143 • 1

mradermacher/Binary-Think-RM-3B-i1-GGUF

3B • Updated 6 days ago • 1.73k • 1

sodeniZz/llm-course-hw2-reward-model

Text Classification • 0.1B • Updated 28 days ago • 8

friendshipkim/Qwen2.5-Math-1.5B-Scoring

2B • Updated 29 days ago • 1.28k