Edit Models filters

Model Tree

Apps

Docker Model Runner

Inference Providers

OVHcloud AI Endpoints

HF Inference API

Misc

Inference Endpoints

text-generation-inference

Eval Results (legacy)

text-embeddings-inference

4-bit precision

8-bit precision

Mixture of Experts

Carbon Emissions

Models

30,700

Base only

Active filters: 8-bit

nvidia/Qwen3.6-35B-A3B-NVFP4

Text Generation • 19B • Updated 14 days ago • 5.02M • 363

nvidia/GLM-5.2-NVFP4

Text Generation • 381B • Updated about 10 hours ago • 6.46k • 98

deepseek-ai/DeepSeek-V4-Pro

Text Generation • 862B • Updated 5 days ago • 1.17M • • 5.08k

deepseek-ai/DeepSeek-V4-Flash

Text Generation • 158B • Updated 5 days ago • 2.03M • • 1.61k

deepseek-ai/DeepSeek-V4-Pro-DSpark

Text Generation • 889B • Updated about 4 hours ago • 42

nvidia/MiniMax-M3-NVFP4

Text Generation • 247B • Updated about 16 hours ago • 13.9k • 26

google/gemma-4-E2B-it-qat-mobile-transformers

Any-to-Any • 2B • Updated 22 days ago • 17.4k • 77

0xSero/GLM-5.2-504B

Text Generation • 290B • Updated 1 day ago • 9.21k • 19

nvidia/DeepSeek-V4-Flash-NVFP4

Text Generation • 167B • Updated 12 days ago • 223k • 46

nvidia/NVIDIA-Nemotron-3-Ultra-550B-A55B-NVFP4

Text Generation • 335B • Updated 3 days ago • 411k • • 216

openai/gpt-oss-120b

Text Generation • 120B • Updated Aug 26, 2025 • 4.05M • • 4.92k

nvidia/Gemma-4-26B-A4B-NVFP4

Text Generation • 14B • Updated May 11 • 2.04M • 97

lukealonso/GLM-5.2-NVFP4

Text Generation • 432B • Updated 10 days ago • 65.9k • 26

PhalaCloud/GLM-5.2-W4AFP8

Text Generation • 392B • Updated 5 days ago • 11.2k • 22

deepseek-ai/DeepSeek-V4-Flash-DSpark

Text Generation • 165B • Updated about 5 hours ago • 12

openai/gpt-oss-20b

Text Generation • 22B • Updated Aug 26, 2025 • 7.01M • • 4.73k

0xSero/DeepSeek-V4-Flash-180B

Text Generation • 102B • Updated 28 days ago • 4.88k • 29

mlx-community/gemma-4-12b-coder-fable5-composer2.5-8bit

Text Generation • 12B • Updated 6 days ago • 4.4k • 13

madeby561/GLM-5.2-NVFP4-REAP-504B-term

Text Generation • 290B • Updated 4 days ago • 1.16k • 13

unsloth/Qwen3.6-27B-NVFP4

Image-Text-to-Text • 19B • Updated 27 days ago • 1.11M • 94

XiaomiMiMo/MiMo-V2.5-Pro-FP4-DFlash

Text Generation • 554B • Updated 19 days ago • 46.1k • 134

sakamakismile/gemma-4-12B-coder-fable5-composer2.5-MTP-NVFP4

Text Generation • 7B • Updated 11 days ago • 4.2k • 46

DJLougen/Qwable-5-27B-Coder-NVFP4

Text Generation • 15B • Updated 4 days ago • 564 • 7

OpenYourMind/GLM-5.2-abliterated

432B • Updated 1 day ago • 7

microsoft/bitnet-b1.58-2B-4T

Text Generation • 0.8B • Updated Dec 17, 2025 • 9.33k • 1.47k

nvidia/diffusiongemma-26B-A4B-it-NVFP4

Text Generation • 14B • Updated 16 days ago • 973k • 87

lovedheart/Qwen-AgentWorld-35B-A3B-NVFP4

Text Generation • 22B • Updated 3 days ago • 972 • 6

sakamakismile/Ornith-1.0-35B-NVFP4

Image-Text-to-Text • 20B • Updated 1 day ago • 408 • 6

unsloth/Qwen3.6-35B-A3B-NVFP4

Image-Text-to-Text • 22B • Updated 27 days ago • 168k • 44

0xSero/DeepSeek-V4-Flash-162B

Text Generation • 92B • Updated 28 days ago • 1.52k • 20