Deepseek V3 (All Versions) Collection Deepseek-V3-0324 and V3 - available in original, and Dynamic GGUF formats, with support for 2-8-bit quantized versions. • 7 items • Updated 4 days ago • 39
Gemma 3 Collection All versions of Google's new multimodal models including QAT in 1B, 4B, 12B, and 27B sizes. In GGUF, dynamic 4-bit and 16-bit formats. • 55 items • Updated 4 days ago • 96
DeepSeek-V3.1 Collection DeepSeek's new 3.1 update to their V3 models! • 6 items • Updated 4 days ago • 7
DeepSeek R1 (All Versions) Collection DeepSeek-R1-0528 is here! The most powerful reasoning open LLM, available in GGUF, original & 4-bit formats. Includes Llama & Qwen distilled models. • 37 items • Updated 4 days ago • 261
Qwen3 Collection Qwen's new Qwen3 models. In Unsloth Dynamic 2.0, GGUF, 4-bit and 16-bit Safetensor formats. Includes 128K Context Length variants. • 79 items • Updated 4 days ago • 240
Unsloth Dynamic 2.0 Quants Collection New 2.0 version of our Dynamic GGUF + Quants. Dynamic 2.0 achieves superior accuracy & SOTA quantization performance. • 59 items • Updated 4 days ago • 258
Ministral 3 Collection Mistral Ministral 3: new multimodal models in Base, Instruct, and Reasoning variants, available in 3B, 8B, and 14B sizes. • 28 items • Updated 3 days ago • 19
Skywork-R1V4 Collection Toward Agentic Multimodal Intelligence through Interleaved Thinking with Images and DeepResearch • 2 items • Updated 1 day ago • 3
Ministral 3 Collection A collection of edge models, with Base, Instruct and Reasoning variants, in 3 different sizes: 3B, 8B and 14B. All with vision capabilities. • 9 items • Updated 4 days ago • 110
Mistral Large 3 Collection A state-of-the-art, open-weight, general-purpose multimodal model with a granular Mixture-of-Experts architecture. • 4 items • Updated 4 days ago • 69
Z-Image: An Efficient Image Generation Foundation Model with Single-Stream Diffusion Transformer Paper • 2511.22699 • Published 9 days ago • 143
MeshCoder: LLM-Powered Structured Mesh Code Generation from Point Clouds Paper • 2508.14879 • Published Aug 20 • 68
RpR Models Collection RpR (RolePlay with Reasoning) models which are built on RPMax datasets with properly trained multi-turn reasoning. • 8 items • Updated Jun 25 • 16
Derestricted Collection Models abliterated using Norm-Preserving Biprojected Abliteration • 5 items • Updated 8 days ago • 11
DeepSeekMath: Pushing the Limits of Mathematical Reasoning in Open Language Models Paper • 2402.03300 • Published Feb 5, 2024 • 137