SkillFactory: Self-Distillation For Learning Cognitive Behaviors Paper • 2512.04072 • Published 6 days ago • 3
SkillFactory: Self-Distillation For Learning Cognitive Behaviors Paper • 2512.04072 • Published 6 days ago • 3
SkillFactory: Self-Distillation For Learning Cognitive Behaviors Paper • 2512.04072 • Published 6 days ago • 3
TAUR-dev/D-EVAL__standard_eval_v3__FinEval_16k_fulleval_3arg_OLMO_RLONLY-RL Viewer • Updated 7 days ago • 11.5k • 14
TAUR-dev/D-ExpTracker__FinEval_16k_fulleval_3arg_OLMO_RLONLY-RL-countdown_6arg__v1 Viewer • Updated 7 days ago • 1.01k • 40
TAUR-dev/D-EVAL__standard_eval_v3__FinEval_16k_fulleval_3arg_OLMO_RLONLY-RL-countdown_6arg-eval_rl Viewer • Updated 7 days ago • 1k • 11
TAUR-dev/D-EVAL__standard_eval_v3__FinEval_16k_fulleval_3arg_OLMO_RLONLY-RL-countdown_6arg-eval_rl Viewer • Updated 7 days ago • 1k • 11
TAUR-dev/D-EVAL__standard_eval_v3__FinEval_16k_fulleval_3arg_OLMO_RLONLY-RL Viewer • Updated 7 days ago • 11.5k • 14
TAUR-dev/D-ExpTracker__FinEval_16k_fulleval_3arg_OLMO_RLONLY-RL-countdown_4arg__v1 Viewer • Updated 7 days ago • 1.01k • 27
TAUR-dev/D-EVAL__standard_eval_v3__FinEval_16k_fulleval_3arg_OLMO_RLONLY-RL-countdown_4arg-eval_rl Viewer • Updated 7 days ago • 1k • 9
TAUR-dev/D-EVAL__standard_eval_v3__FinEval_16k_fulleval_3arg_OLMO_RLONLY-RL-countdown_4arg-eval_rl Viewer • Updated 7 days ago • 1k • 9
TAUR-dev/D-ExpTracker__FinEval_16k_fulleval_3arg_OLMO_RLONLY-RL-letter_countdown_5o__v1 Viewer • Updated 7 days ago • 304 • 18
TAUR-dev/D-EVAL__standard_eval_v3__FinEval_16k_fulleval_3arg_OLMO_RLONLY-RL-letter_countdown_5o-eval_rl Viewer • Updated 7 days ago • 300 • 16
TAUR-dev/D-EVAL__standard_eval_v3__FinEval_16k_fulleval_3arg_OLMO_RLONLY-RL-letter_countdown_5o-eval_rl Viewer • Updated 7 days ago • 300 • 16
TAUR-dev/D-ExpTracker__FinEval_16k_fulleval_3arg_OLMO_RLONLY-RL-countdown_5arg__v1 Viewer • Updated 7 days ago • 1.01k • 26
TAUR-dev/D-EVAL__standard_eval_v3__FinEval_16k_fulleval_3arg_OLMO_RLONLY-RL-countdown_5arg-eval_rl Viewer • Updated 7 days ago • 1k • 14
TAUR-dev/D-EVAL__standard_eval_v3__FinEval_16k_fulleval_3arg_OLMO_RLONLY-RL-countdown_5arg-eval_rl Viewer • Updated 7 days ago • 1k • 14
TAUR-dev/D-ExpTracker__FinEval_16k_fulleval_3arg_OLMO_RLONLY-RL-letter_countdown_4o__v1 Viewer • Updated 8 days ago • 303 • 17
TAUR-dev/D-EVAL__standard_eval_v3__FinEval_16k_fulleval_3arg_OLMO_RLONLY-RL-letter_countdown_4o-eval_rl Viewer • Updated 8 days ago • 300 • 10