kaeru39 PRO
ryota39
AI & ML interests
LLM × RL
Recent Activity
liked a model 2 days ago
Qwen/Qwen3.5-35B-A3B liked a model 2 days ago
Qwen/Qwen3.5-2B liked a model 2 days ago
Qwen/Qwen3.5-9BOrganizations
models 19
ryota39/Qwen3-8B-math-RL-ja
8B • Updated
ryota39/Qwen3-8B-math-RL-en
Text Generation • 8B • Updated • 1
ryota39/gemma-2-2b-jpn-it-q8
3B • Updated • 7
ryota39/Tora-12B
Text Generation • 12B • Updated • 3 • 1
ryota39/Tora-7B-v0.1
Text Generation • Updated • 7 • 2
ryota39/mluke-large-lite-reward
Text Classification • 0.6B • Updated • 3
ryota39/retriva-bert-preference-classifier
Text Classification • 1B • Updated • 2
ryota39/Tora-7B-v0.2
Text Generation • 7B • Updated • 1
ryota39/llm-jp-1b-sft-100k-LoRA-dpo-12k
Text Generation • 1B • Updated • 1
ryota39/Phi-3-mini-4k-instruct-dpo
Text Generation • 4B • Updated • 7 • 3
datasets 34
ryota39/gsm8k-ja
Viewer • Updated • 8.79k • 9
ryota39/llmjp-chatbot-arena-v2
Viewer • Updated • 594 • 317
ryota39/aya-ja-evol-inst
Viewer • Updated • 29.1k • 6
ryota39/llm-jp-chatbot-arena-conversations-reformatted
Viewer • Updated • 990 • 11 • 1
ryota39/reviews_and_summaries2
Viewer • Updated • 50 • 27
ryota39/reviews_and_summaries
Viewer • Updated • 50 • 33
ryota39/movie_reviews_local
Viewer • Updated • 50 • 37
ryota39/movie_reviews
Viewer • Updated • 50 • 12
ryota39/wild_chat_ja
Viewer • Updated • 3.49k • 8
ryota39/aya-evol-instruct
Viewer • Updated • 29.2k • 8