kaeru39's picture

kaeru39 PRO

ryota39

·

AI & ML interests

LLM × RL

Recent Activity

liked a model 7 days ago

google/gemma-4-12B-it

liked a dataset 8 days ago

Tevatron/browsecomp-plus

liked a dataset 8 days ago

OpenResearcher/OpenResearcher-Dataset

View all activity

Organizations

Collections 9

View 9 collections

spaces 2

Sake Sonar

Ask questions about sake brewing

ICLR2025 Sonar-v1

Search for ICLR2025 papers using keywords

models 19

ryota39/Qwen3-8B-math-RL-ja

8B • Updated Dec 9, 2025 • 2

ryota39/Qwen3-8B-math-RL-en

Text Generation • 8B • Updated Dec 9, 2025 • 4

ryota39/gemma-2-2b-jpn-it-q8

3B • Updated Feb 22, 2025 • 2

ryota39/Tora-12B

Text Generation • 12B • Updated Nov 25, 2024 • 3 • 1

ryota39/Tora-7B-v0.1

Text Generation • Updated Nov 20, 2024 • 6 • 2

ryota39/mluke-large-lite-reward

Text Classification • 0.6B • Updated Jul 25, 2024 • 3

ryota39/retriva-bert-preference-classifier

Text Classification • 1B • Updated Jul 24, 2024 • 3

ryota39/Tora-7B-v0.2

Text Generation • 7B • Updated Jun 4, 2024 • 5 • 1

ryota39/llm-jp-1b-sft-100k-LoRA-dpo-12k

Text Generation • 1B • Updated May 1, 2024 • 4

ryota39/Phi-3-mini-4k-instruct-dpo

Text Generation • 4B • Updated May 1, 2024 • 13 • 3

datasets 34

ryota39/gsm8k-ja

Viewer • Updated Dec 9, 2025 • 8.79k • 24

ryota39/llmjp-chatbot-arena-v2

Viewer • Updated Jul 11, 2025 • 594 • 7

ryota39/aya-ja-evol-inst

Viewer • Updated May 21, 2025 • 29.1k • 15

ryota39/llm-jp-chatbot-arena-conversations-reformatted

Viewer • Updated May 19, 2025 • 990 • 19 • 1

ryota39/reviews_and_summaries2

Viewer • Updated Apr 26, 2025 • 50 • 7

ryota39/reviews_and_summaries

Viewer • Updated Apr 26, 2025 • 50 • 6

ryota39/movie_reviews_local

Viewer • Updated Apr 26, 2025 • 50 • 10

ryota39/movie_reviews

Viewer • Updated Apr 26, 2025 • 50 • 9

ryota39/wild_chat_ja

Viewer • Updated Jan 23, 2025 • 3.49k • 6

ryota39/aya-evol-instruct

Viewer • Updated Jan 6, 2025 • 29.2k • 10

View 34 datasets