Running 3.64k The Ultra-Scale Playbook 🌌 3.64k The ultimate guide to training LLM on large GPU Clusters
Congliu/Chinese-DeepSeek-R1-Distill-data-110k Viewer • Updated Feb 21, 2025 • 110k • 373 • 718
meta-llama/Meta-Llama-3-8B-Instruct Text Generation • 8B • Updated Jun 18, 2025 • 1.45M • • 4.35k