deepseek-ai/DeepSeek-V3.2-Speciale Text Generation • 685B • Updated 25 days ago • 18.1k • 622
view article Article Rank-Stabilized LoRA: Unlocking the Potential of LoRA Fine-Tuning Feb 20, 2024 • 30
HuggingFaceH4/zephyr-7b-alpha Text Generation • 7B • Updated Oct 16, 2024 • 1.85k • • 1.12k
Running on CPU Upgrade Featured 2.69k The Smol Training Playbook 📚 2.69k The secrets to building world-class LLMs
mattshumer/Reflection-Llama-3.1-70B Text Generation • 71B • Updated Sep 24, 2024 • 463 • 1.71k
view article Article OpenReasoning-Nemotron: A Family of State-of-the-Art Distilled Reasoning Models Jul 18 • 50