shisa-v2-research
updated
Magpie: Alignment Data Synthesis from Scratch by Prompting Aligned LLMs
with Nothing
Paper
• 2406.08464
• Published
• 71
Scaling Synthetic Data Creation with 1,000,000,000 Personas
Paper
• 2406.20094
• Published
• 104
argilla/magpie-ultra-v1.0
Viewer
• Updated
• 3.22M • 686
• 50
Viewer
• Updated
• 1k • 1.91k
• 150
Viewer
• Updated
• 817 • 603
• 177
Self-Play Fine-Tuning Converts Weak Language Models to Strong Language
Models
Paper
• 2401.01335
• Published
• 68
Direct Nash Optimization: Teaching Language Models to Self-Improve with
General Preferences
Paper
• 2404.03715
• Published
• 62
Self-Boosting Large Language Models with Synthetic Preference Data
Paper
• 2410.06961
• Published
• 16
SPaR: Self-Play with Tree-Search Refinement to Improve
Instruction-Following in Large Language Models
Paper
• 2412.11605
• Published
• 18
Magpie-Align/Magpie-Reasoning-V1-150K-CoT-Deepseek-R1-Llama-70B
Viewer
• Updated
• 150k • 64
• 17
sbintuitions/modernbert-ja-130m
Fill-Mask
• 0.1B • Updated
• 8.05k
• • 47
bespokelabs/Bespoke-Stratos-17k
Viewer
• Updated
• 16.7k • 6.18k
• 341
SymNoise: Advancing Language Model Fine-tuning with Symmetric Noise
Paper
• 2312.01523
• Published
TÜLU 3: Pushing Frontiers in Open Language Model Post-Training
Paper
• 2411.15124
• Published
• 67