Running Featured 1.21k FineWeb: decanting the web for the finest text data at scale 🍷 1.21k Generate high-quality text data for LLMs using FineWeb
Running 3.55k The Ultra-Scale Playbook 🌌 3.55k The ultimate guide to training LLM on large GPU Clusters
Contrastive Policy Gradient: Aligning LLMs on sequence-level scores in a supervised-friendly fashion Paper • 2406.19185 • Published Jun 27, 2024