kaeru39 PRO
ryota39
AI & ML interests
LLM × RL
Recent Activity
liked a model 17 days ago
zai-org/GLM-4.5-Air liked a model 18 days ago
Qwen/Qwen3.5-122B-A10B liked a model 28 days ago
Qwen/Qwen3-1.7B-BaseOrganizations
models 19
ryota39/Qwen3-8B-math-RL-ja
8B • Updated • 2
ryota39/Qwen3-8B-math-RL-en
Text Generation • 8B • Updated • 2
ryota39/gemma-2-2b-jpn-it-q8
3B • Updated • 2
ryota39/Tora-12B
Text Generation • 12B • Updated • 11 • 1
ryota39/Tora-7B-v0.1
Text Generation • Updated • 10 • 2
ryota39/mluke-large-lite-reward
Text Classification • 0.6B • Updated • 3
ryota39/retriva-bert-preference-classifier
Text Classification • 1B • Updated • 3
ryota39/Tora-7B-v0.2
Text Generation • 7B • Updated • 6 • 1
ryota39/llm-jp-1b-sft-100k-LoRA-dpo-12k
Text Generation • 1B • Updated • 6
ryota39/Phi-3-mini-4k-instruct-dpo
Text Generation • 4B • Updated • 11 • 3
datasets 34
ryota39/gsm8k-ja
Viewer • Updated • 8.79k • 31
ryota39/llmjp-chatbot-arena-v2
Viewer • Updated • 594 • 5
ryota39/aya-ja-evol-inst
Viewer • Updated • 29.1k • 14
ryota39/llm-jp-chatbot-arena-conversations-reformatted
Viewer • Updated • 990 • 11 • 1
ryota39/reviews_and_summaries2
Viewer • Updated • 50 • 5
ryota39/reviews_and_summaries
Viewer • Updated • 50 • 8
ryota39/movie_reviews_local
Viewer • Updated • 50 • 4
ryota39/movie_reviews
Viewer • Updated • 50 • 15
ryota39/wild_chat_ja
Viewer • Updated • 3.49k • 3
ryota39/aya-evol-instruct
Viewer • Updated • 29.2k • 15