Edit Models filters

Apps

Docker Model Runner

Inference Providers

OVHcloud AI Endpoints

HF Inference API

Misc

Inference Endpoints

text-generation-inference

Eval Results (legacy)

text-embeddings-inference

4-bit precision

8-bit precision

Mixture of Experts

Carbon Emissions

Models

141

Full-text search

Active filters: nvfp4

GadflyII/GLM-4.7-Flash-NVFP4

Text Generation • 18B • Updated 12 days ago • 203k • 53

nvidia/Qwen3-Next-80B-A3B-Instruct-NVFP4

Text Generation • Updated 19 days ago • 15.8k • 18

nvidia/Qwen3-Next-80B-A3B-Thinking-NVFP4

Text Generation • Updated 19 days ago • 12.6k • 24

llmat/Qwen3-30B-A3B-Instruct-2507-NVFP4

Text Generation • 17B • Updated Aug 27, 2025 • 107 • 2

nvidia/Qwen3-235B-A22B-Thinking-2507-NVFP4

Text Generation • 120B • Updated 1 day ago • 91 • 1

nvidia/Qwen3-235B-A22B-Instruct-2507-NVFP4

Text Generation • 120B • Updated 2 days ago • 73 • 1

GadflyII/MiniMax-M2.1-NVFP4

Text Generation • Updated 6 days ago • 3.95k • 4

Firworks/Hemlock-Coder-7B-nvfp4

5B • Updated 12 days ago • 21 • 1

apolloparty/Qwen3-4B-NVFP4A16

2B • Updated Jul 12, 2025 • 1

cortecs/Qwen3-8B-NVFP4A16

5B • Updated Nov 27, 2025 • 3

cortecs/Qwen3-8B-NVFP4

5B • Updated Nov 27, 2025 • 14

cortecs/Qwen3-8B-clean-sparse

6B • Updated Nov 27, 2025

cortecs/Qwen3-8B-clean-sparse-nvfp4a16

5B • Updated Nov 27, 2025

cortecs/Qwen3-8B-clean-sparse-finetuned-0.01-nvfp4a16

5B • Updated Nov 27, 2025 • 1

cortecs/Qwen3-8B-clean-sparse-finetuned-0.1-nvfp4a16

5B • Updated Nov 27, 2025 • 2

llmat/Mistral-Small-24B-Instruct-2501-NVFP4

Text Generation • 14B • Updated Aug 27, 2025 • 37

llmat/Qwen3-4B-Instruct-2507-NVFP4

Text Generation • 3B • Updated Aug 27, 2025 • 61 • 1

llmat/Qwen3-30B-A3B-NVFP4

Text Generation • 17B • Updated Aug 28, 2025 • 3

llmat/Qwen3-32B-NVFP4

Text Generation • 19B • Updated Aug 28, 2025 • 6

llmat/Qwen3-14B-NVFP4

Text Generation • 9B • Updated Aug 28, 2025 • 18

llmat/Qwen3-8B-NVFP4

Text Generation • 5B • Updated Aug 28, 2025 • 3

llmat/Qwen3-4B-NVFP4

Text Generation • 3B • Updated Aug 28, 2025 • 28

llmat/Qwen3-1.7B-NVFP4

Text Generation • 1B • Updated Aug 28, 2025 • 1

llmat/Qwen3-0.6B-NVFP4

Text Generation • 0.6B • Updated Aug 28, 2025 • 4

2imi9/gpt-oss-20B-NVFP4A16-BF16

Text Generation • 21B • Updated Dec 19, 2025 • 7.65k • 3

llmat/Apertus-8B-Instruct-2509-NVFP4

Text Generation • 5B • Updated Sep 3, 2025 • 2 • 1

mratsim/Seed-OSS-36B-Instruct-NVFP4

Text Generation • 21B • Updated Sep 14, 2025 • 86 • 4

mratsim/Wayfarer-Large-70B-NVFP4

Text Generation • 41B • Updated Oct 26, 2025 • 28 • 1

mratsim/Nova-70B-NVFP4

Text Generation • 41B • Updated Oct 26, 2025 • 16

mratsim/Anubis-70B-v1.1-NVFP4

Text Generation • 41B • Updated Oct 26, 2025 • 67 • 1