CatkinChen/nethack-ppo-ablation-baseline_curiosity_trans_only Reinforcement Learning • Updated Sep 27
CatkinChen/nethack-ppo-ablation-baseline_curiosity_skill_only Reinforcement Learning • Updated Sep 27
CatkinChen/babyai-classical-ppo-prefinal-experiments-mix_hard_reasoning_early_stopping_target_0.3 Updated Apr 11 • 4
CatkinChen/babyai-classical-ppo-prefinal-experiments-mix_hard_reasoning_early_stopping_something_wild Updated Apr 11 • 5
CatkinChen/BAAI_bge-base-en-v1.5_retrieval_finetuned_2025-04-07_21-26-50 Sentence Similarity • 0.1B • Updated Apr 7 • 7
CatkinChen/BAAI_bge-base-en-v1.5_retrieval_finetuned_2025-04-05_22-01-09 Sentence Similarity • 0.1B • Updated Apr 5 • 7
CatkinChen/BAAI_bge-base-en-v1.5_retrieval_finetuned_2025-04-05_23-00-04 Sentence Similarity • 0.1B • Updated Apr 5 • 7