Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
Narutoouz
/
GLM-4-9B-0414-4bit-DWQ
like
2
Text Generation
MLX
Safetensors
glm4
quantized
dwq
9B
apple-silicon
4-bit precision
optimization
verified-performance
long-context
conversational
License:
apache-2.0
Model card
Files
Files and versions
xet
Community
Use this model
main
GLM-4-9B-0414-4bit-DWQ
5.31 GB
1 contributor
History:
4 commits
Narutoouz
Fix model card to only include GLM-4-9B-0414-4bit-DWQ specific information and correct context lengths
78dc8ae
verified
9 months ago
.gitattributes
1.57 kB
Upload GLM-4-9B-0414-4bit-DWQ DWQ 4-bit quantized model with comprehensive documentation
9 months ago
README.md
6.44 kB
Fix model card to only include GLM-4-9B-0414-4bit-DWQ specific information and correct context lengths
9 months ago
benchmark_script.py
1.03 kB
Upload GLM-4-9B-0414-4bit-DWQ DWQ 4-bit quantized model with comprehensive documentation
9 months ago
chat_template.jinja
944 Bytes
Upload GLM-4-9B-0414-4bit-DWQ DWQ 4-bit quantized model with comprehensive documentation
9 months ago
config.json
907 Bytes
Upload GLM-4-9B-0414-4bit-DWQ DWQ 4-bit quantized model with comprehensive documentation
9 months ago
conversion_script.py
503 Bytes
Upload GLM-4-9B-0414-4bit-DWQ DWQ 4-bit quantized model with comprehensive documentation
9 months ago
generation_config.json
160 Bytes
Upload GLM-4-9B-0414-4bit-DWQ DWQ 4-bit quantized model with comprehensive documentation
9 months ago
model.safetensors
5.29 GB
xet
Upload GLM-4-9B-0414-4bit-DWQ DWQ 4-bit quantized model with comprehensive documentation
9 months ago
model.safetensors.index.json
72.3 kB
Upload GLM-4-9B-0414-4bit-DWQ DWQ 4-bit quantized model with comprehensive documentation
9 months ago
quantization_metadata.json
621 Bytes
Upload GLM-4-9B-0414-4bit-DWQ DWQ 4-bit quantized model with comprehensive documentation
9 months ago
special_tokens_map.json
596 Bytes
Upload GLM-4-9B-0414-4bit-DWQ DWQ 4-bit quantized model with comprehensive documentation
9 months ago
tokenizer.json
20 MB
xet
Upload GLM-4-9B-0414-4bit-DWQ DWQ 4-bit quantized model with comprehensive documentation
9 months ago
tokenizer_config.json
3.19 kB
Upload GLM-4-9B-0414-4bit-DWQ DWQ 4-bit quantized model with comprehensive documentation
9 months ago