KORMo-10B KORMo-10B models KORMo: Korean Open Reasoning Model for Everyone Paper • 2510.09426 • Published Oct 10 • 82 KORMo-Team/KORMo-10B-sft Text Generation • 11B • Updated Nov 4 • 2.54k • 117 KORMo-Team/KORMo-10B-base Text Generation • 11B • Updated Nov 7 • 455 • 34 KORMo-Team/KORMo-tokenizer Updated Oct 11 • 16
KORMo midtraining datasets The midtraining datasets for KORMo-10B were collected from diverse, publicly available source. princeton-nlp/prolong-data-64K Updated Oct 5, 2024 • 9.33k • 20 KORMo-Team/Cosmopedia-ko-synth Preview • Updated Oct 13 • 1.2k KORMo-Team/korean-web-collection Preview • Updated Sep 14 • 1.01k • 1 nvidia/Nemotron-Post-Training-Dataset-v1 Viewer • Updated Aug 25 • 25.7M • 11.3k • 165
KORMo pretraining datasets The pretraining datasets for KORMo-10B were collected from diverse, publicly available source. KORMo-Team/dclm-baseline-filtered Preview • Updated Sep 14 • 6.36k • 1 KORMo-Team/korean-web-collection Preview • Updated Sep 14 • 1.01k • 1 KORMo-Team/UltraFineWeb-filtered Preview • Updated Sep 28 • 2.06k • 1 HuggingFaceTB/stack-edu Viewer • Updated Mar 20 • 167M • 2.08k • 60
KORMo SFT datasets The SFT datasets for KORMo-10B were collected from diverse, publicly available source. nvidia/Nemotron-Post-Training-Dataset-v1 Viewer • Updated Aug 25 • 25.7M • 11.3k • 165 HuggingFaceTB/smoltalk2 Viewer • Updated Oct 31 • 8.61M • 9k • 127 KORMo-Team/IF-bilingual-sft Preview • Updated Oct 13 • 145 • 3 KORMo-Team/NemoPost-ko-synth-sft Preview • Updated Jul 10 • 302 • 1
KORMo-10B KORMo-10B models KORMo: Korean Open Reasoning Model for Everyone Paper • 2510.09426 • Published Oct 10 • 82 KORMo-Team/KORMo-10B-sft Text Generation • 11B • Updated Nov 4 • 2.54k • 117 KORMo-Team/KORMo-10B-base Text Generation • 11B • Updated Nov 7 • 455 • 34 KORMo-Team/KORMo-tokenizer Updated Oct 11 • 16
KORMo pretraining datasets The pretraining datasets for KORMo-10B were collected from diverse, publicly available source. KORMo-Team/dclm-baseline-filtered Preview • Updated Sep 14 • 6.36k • 1 KORMo-Team/korean-web-collection Preview • Updated Sep 14 • 1.01k • 1 KORMo-Team/UltraFineWeb-filtered Preview • Updated Sep 28 • 2.06k • 1 HuggingFaceTB/stack-edu Viewer • Updated Mar 20 • 167M • 2.08k • 60
KORMo midtraining datasets The midtraining datasets for KORMo-10B were collected from diverse, publicly available source. princeton-nlp/prolong-data-64K Updated Oct 5, 2024 • 9.33k • 20 KORMo-Team/Cosmopedia-ko-synth Preview • Updated Oct 13 • 1.2k KORMo-Team/korean-web-collection Preview • Updated Sep 14 • 1.01k • 1 nvidia/Nemotron-Post-Training-Dataset-v1 Viewer • Updated Aug 25 • 25.7M • 11.3k • 165
KORMo SFT datasets The SFT datasets for KORMo-10B were collected from diverse, publicly available source. nvidia/Nemotron-Post-Training-Dataset-v1 Viewer • Updated Aug 25 • 25.7M • 11.3k • 165 HuggingFaceTB/smoltalk2 Viewer • Updated Oct 31 • 8.61M • 9k • 127 KORMo-Team/IF-bilingual-sft Preview • Updated Oct 13 • 145 • 3 KORMo-Team/NemoPost-ko-synth-sft Preview • Updated Jul 10 • 302 • 1