M3.2-24B-Loki-V1.1-GGUF
GGUF model files for M3.2-24B-Loki-V1.1.
This repository contains GGUF models quantized using llama.cpp.
- Base Model:
M3.2-24B-Loki-V1.1 - Quantization Methods Processed in this Job:
Q8_0 - Importance Matrix Used: No
This specific upload is for the Q8_0 quantization.
- Downloads last month
- 16
Hardware compatibility
Log In to add your hardware
8-bit
Inference Providers NEW
This model isn't deployed by any Inference Provider. ๐ Ask for provider support