Edit Models filters

Apps

Docker Model Runner

Inference Providers

OVHcloud AI Endpoints

HF Inference API

Misc

Inference Endpoints

text-generation-inference

Eval Results (legacy)

text-embeddings-inference

4-bit precision

8-bit precision

Mixture of Experts

Carbon Emissions

Models

215

Full-text search

Active filters: kto, trl

ericlewis/SmolLM-1.7B-Instruct-KTO-V6

Text Generation • 2B • Updated Aug 3, 2024

ericlewis/SmolLM-1.7B-Instruct-KTO-V7

Text Generation • 2B • Updated Aug 3, 2024

qgallouedec/kto-aligned-model

Text Generation • 2B • Updated Aug 22, 2024

mahak1204/Mistral-2-7b-Instruct-v0.2-finetune-kto

Text Generation • 7B • Updated Aug 26, 2024 • 1

PaulD/llama3_false_positives_1207_KTO_top_model

Updated Sep 6, 2024

PaulD/llama3_false_positives_0609_KTO_hp_screening

Updated Sep 6, 2024

PaulD/llama3_false_positives_0609_KTO_hp_screening_seeds

Updated Sep 18, 2024 • 1

Huertas97/smollm-gec-sftt-kto

Text Generation • 0.1B • Updated Sep 12, 2024

Huertas97/smollm-gec-kto

Text Generation • 0.1B • Updated Sep 12, 2024 • 1

CharlesLi/OpenELM-1_1B-KTO

Text Generation • 1B • Updated Sep 24, 2024 • 1

faaany/kto-aligned-model

Text Generation • 2B • Updated Sep 23, 2024 • 1

PaulD/llama3_false_positives_1609_KTO_optimised_model

Updated Oct 10, 2024 • 2

PaulD/llama3_false_positives_1010_KTO_hp_screening_seeds

Updated Oct 11, 2024 • 6

johnpaulbin/llama3.2-3b-tokipona-v3-chat-v3

Updated Mar 14, 2025

johnpaulbin/llama3.2-3b-tokipona-v3-chat-v3-Q8_0-GGUF

4B • Updated Oct 16, 2024 • 1

trl-lib/Qwen2-0.5B-KTO

Text Generation • 0.5B • Updated Oct 18, 2024 • 2

PaulD/llama3_false_positives_0411_KTO_hp_screening_seeds

Updated Nov 5, 2024 • 1

PaulD/llama3_false_positives_0312_KTO_optimised_model

Updated Dec 9, 2024 • 1

jeiku/controlkto

Text Generation • Updated Dec 13, 2024 • 1

PaulD/llama3_false_positives_1101_KTO_optimised_model

Updated Jan 12, 2025 • 1

chchen/Llama-3.1-8B-Instruct-KTO-100

Updated Jan 16, 2025

chchen/Llama-3.1-8B-Instruct-KTO-200

Updated Jan 16, 2025

chchen/Llama-3.1-8B-Instruct-KTO-300

Updated Jan 16, 2025 • 2

chchen/Llama-3.1-8B-Instruct-KTO-400

Updated Jan 16, 2025

chchen/Llama-3.1-8B-Instruct-KTO-500

Updated Jan 16, 2025

chchen/Llama-3.1-8B-Instruct-KTO-600

Updated Jan 16, 2025

chchen/Llama-3.1-8B-Instruct-KTO-700

Updated Jan 16, 2025

chchen/Llama-3.1-8B-Instruct-KTO-800

Updated Jan 16, 2025

chchen/Llama-3.1-8B-Instruct-KTO-900

Updated Jan 16, 2025

chchen/Llama-3.1-8B-Instruct-KTO-1000

Updated Jan 16, 2025