Zen Reranker 8B GGUF

High-performance text reranking model based on Qwen3-Reranker-8B, optimized for efficient inference.

Downloads

Source URL
HuggingFace hf download zenlm/zen-reranker-8B-GGUF
Direct https://download.hanzo.ai/llm-models/zen-reranker-8B-Q4_K_M.gguf

Features

  • 100+ language support
  • Optimized for reranking search results
  • GGUF format for efficient CPU/GPU inference
  • Q4_K_M quantization (5.03 GB)

License

Apache 2.0 (inherited from Qwen3-Reranker)

Downloads last month
25
GGUF
Model size
8B params
Architecture
qwen3
Hardware compatibility
Log In to add your hardware

4-bit

Inference Providers NEW
This model isn't deployed by any Inference Provider. ๐Ÿ™‹ Ask for provider support

Model tree for zenlm/zen-reranker-8B-GGUF

Base model

Qwen/Qwen3-8B-Base
Quantized
(32)
this model