-
-
-
-
-
-
Inference Providers
Active filters:
nvfp4
GadflyII/GLM-4.7-Flash-NVFP4
Text Generation
•
18B
•
Updated
•
203k
•
53
nvidia/Qwen3-Next-80B-A3B-Instruct-NVFP4
Text Generation
•
Updated
•
15.8k
•
18
nvidia/Qwen3-Next-80B-A3B-Thinking-NVFP4
Text Generation
•
Updated
•
12.6k
•
24
llmat/Qwen3-30B-A3B-Instruct-2507-NVFP4
Text Generation
•
17B
•
Updated
•
107
•
2
nvidia/Qwen3-235B-A22B-Thinking-2507-NVFP4
Text Generation
•
120B
•
Updated
•
91
•
1
nvidia/Qwen3-235B-A22B-Instruct-2507-NVFP4
Text Generation
•
120B
•
Updated
•
73
•
1
GadflyII/MiniMax-M2.1-NVFP4
Text Generation
•
Updated
•
3.95k
•
4
Firworks/Hemlock-Coder-7B-nvfp4
5B
•
Updated
•
21
•
1
apolloparty/Qwen3-4B-NVFP4A16
2B
•
Updated
•
1
cortecs/Qwen3-8B-NVFP4A16
5B
•
Updated
•
3
5B
•
Updated
•
14
cortecs/Qwen3-8B-clean-sparse
6B
•
Updated
cortecs/Qwen3-8B-clean-sparse-nvfp4a16
5B
•
Updated
cortecs/Qwen3-8B-clean-sparse-finetuned-0.01-nvfp4a16
5B
•
Updated
•
1
cortecs/Qwen3-8B-clean-sparse-finetuned-0.1-nvfp4a16
5B
•
Updated
•
2
llmat/Mistral-Small-24B-Instruct-2501-NVFP4
Text Generation
•
14B
•
Updated
•
37
llmat/Qwen3-4B-Instruct-2507-NVFP4
Text Generation
•
3B
•
Updated
•
61
•
1
llmat/Qwen3-30B-A3B-NVFP4
Text Generation
•
17B
•
Updated
•
3
Text Generation
•
19B
•
Updated
•
6
Text Generation
•
9B
•
Updated
•
18
Text Generation
•
5B
•
Updated
•
3
Text Generation
•
3B
•
Updated
•
28
Text Generation
•
1B
•
Updated
•
1
Text Generation
•
0.6B
•
Updated
•
4
2imi9/gpt-oss-20B-NVFP4A16-BF16
Text Generation
•
21B
•
Updated
•
7.65k
•
3
llmat/Apertus-8B-Instruct-2509-NVFP4
Text Generation
•
5B
•
Updated
•
2
•
1
mratsim/Seed-OSS-36B-Instruct-NVFP4
Text Generation
•
21B
•
Updated
•
86
•
4
mratsim/Wayfarer-Large-70B-NVFP4
Text Generation
•
41B
•
Updated
•
28
•
1
Text Generation
•
41B
•
Updated
•
16
mratsim/Anubis-70B-v1.1-NVFP4
Text Generation
•
41B
•
Updated
•
67
•
1