-
-
-
-
-
-
Inference Providers
Active filters:
kto, trl
ericlewis/SmolLM-1.7B-Instruct-KTO-V6
Text Generation
•
2B
•
Updated
ericlewis/SmolLM-1.7B-Instruct-KTO-V7
Text Generation
•
2B
•
Updated
qgallouedec/kto-aligned-model
Text Generation
•
2B
•
Updated
mahak1204/Mistral-2-7b-Instruct-v0.2-finetune-kto
Text Generation
•
7B
•
Updated
•
1
PaulD/llama3_false_positives_1207_KTO_top_model
Updated
PaulD/llama3_false_positives_0609_KTO_hp_screening
Updated
PaulD/llama3_false_positives_0609_KTO_hp_screening_seeds
Huertas97/smollm-gec-sftt-kto
Text Generation
•
0.1B
•
Updated
Text Generation
•
0.1B
•
Updated
•
1
CharlesLi/OpenELM-1_1B-KTO
Text Generation
•
1B
•
Updated
•
1
Text Generation
•
2B
•
Updated
•
1
PaulD/llama3_false_positives_1609_KTO_optimised_model
PaulD/llama3_false_positives_1010_KTO_hp_screening_seeds
johnpaulbin/llama3.2-3b-tokipona-v3-chat-v3
Updated
johnpaulbin/llama3.2-3b-tokipona-v3-chat-v3-Q8_0-GGUF
4B
•
Updated
•
1
Text Generation
•
0.5B
•
Updated
•
2
PaulD/llama3_false_positives_0411_KTO_hp_screening_seeds
PaulD/llama3_false_positives_0312_KTO_optimised_model
Text Generation
•
Updated
•
1
PaulD/llama3_false_positives_1101_KTO_optimised_model
chchen/Llama-3.1-8B-Instruct-KTO-100
Updated
chchen/Llama-3.1-8B-Instruct-KTO-200
Updated
chchen/Llama-3.1-8B-Instruct-KTO-300
chchen/Llama-3.1-8B-Instruct-KTO-400
Updated
chchen/Llama-3.1-8B-Instruct-KTO-500
Updated
chchen/Llama-3.1-8B-Instruct-KTO-600
Updated
chchen/Llama-3.1-8B-Instruct-KTO-700
Updated
chchen/Llama-3.1-8B-Instruct-KTO-800
Updated
chchen/Llama-3.1-8B-Instruct-KTO-900
Updated
chchen/Llama-3.1-8B-Instruct-KTO-1000
Updated