Running 3.55k The Ultra-Scale Playbook ๐ 3.55k The ultimate guide to training LLM on large GPU Clusters
AdamLucek/roberta-llama3.1405B-twitter-sentiment Text Classification โข 0.1B โข Updated Aug 14, 2024 โข 14 โข 4