-
-
-
-
-
-
Inference Providers
Active filters:
simpo
radm/forerunner-qwen32b-simpo-awq
Text Generation
•
33B
•
Updated
•
9
Text Generation
•
8B
•
Updated
•
3
AIR-hl/Qwen2.5-1.5B-SimPO
Text Generation
•
2B
•
Updated
•
3
yakazimir/simpo-exps_qwen05b
Text Generation
•
0.5B
•
Updated
•
47
Sean13/mistral-7b-instruct-v0.2-rsimpo-full
Text Generation
•
7B
•
Updated
Boko99/llama3-instruct-simpo
Text Generation
•
266k
•
Updated
Text Generation
•
266k
•
Updated
Sean13/mistral-7b-instruct-v0.2-simpo-full
Text Generation
•
7B
•
Updated
•
1
Sean13/llama-8b-instruct-simpo-full
Text Generation
•
8B
•
Updated
Sean13/llama-8b-instruct-rsimpo-full
Text Generation
•
8B
•
Updated
Text Generation
•
9B
•
Updated
•
5
jz666/simpo-train-large-correct
Text Generation
•
9B
•
Updated
jz666/simpo-train-largest-30-ppl-rejected
Text Generation
•
9B
•
Updated
jz666/simpo-train-largest-30-ppl-chosen
Text Generation
•
9B
•
Updated
•
2
jz666/simpo-train-largest-30-abs-diff
Text Generation
•
9B
•
Updated
•
4
jz666/simpo-train-smallest-30-abs-diff
Text Generation
•
9B
•
Updated
jz666/simpo-train-small-correct
Text Generation
•
9B
•
Updated
jz666/simpo-train-small-wrong
Text Generation
•
9B
•
Updated
jz666/simpo-train-filtered-full
Text Generation
•
9B
•
Updated
•
1
jz666/simpo-train-large-wrong
Text Generation
•
9B
•
Updated
•
2
jz666/gemma-2-9b-it-simpo-split-10-train_filtered_full
Text Generation
•
9B
•
Updated
jz666/gemma-2-9b-it-dpo-train_filtered_full
Text Generation
•
9B
•
Updated
Sean13/mistral-7b-instruct-v0.2-simpo-full-label_smoothing-0.1
Text Generation
•
266k
•
Updated
Sean13/llama-8b-instruct-simpo-full-label_smoothing-0.1
Text Generation
•
266k
•
Updated
•
1
Text Generation
•
3B
•
Updated
•
357
•
1
mradermacher/Quanta-X-3B-GGUF
3B
•
Updated
•
451
•
1
mradermacher/Quanta-X-3B-i1-GGUF
3B
•
Updated
•
719
•
1