Seungone Kim PRO

seungone

·

https://seungonekim.github.io/

AI & ML interests

Large Language Models, LLM-as-a-Judge, Reward Model Overoptimization, Personalized Alignment

Recent Activity

upvoted a paper 21 days ago

LLM-as-a-Tutor: Policy-Aware Prompt Adaptation for Non-Verifiable RL

upvoted a paper 30 days ago

RocketSmith: Agentic Additive Manufacturing of High-Powered Rockets

upvoted a paper about 2 months ago

Verus-SpecGym: An Agentic Environment for Evaluating Specification Autoformalization

View all activity

Organizations

Papers 43

arxiv:2606.02404

arxiv:2605.26457

arxiv:2605.20668

arxiv:2605.09063

spaces 2

My Argilla

Test3

models 1

seungone/skywork-reward-replicate

Text Classification • 8B • Updated Dec 11, 2024 • 7

datasets 5

seungone/ablation1_math_gpt4o_mini

Viewer • Updated Nov 25, 2024 • 5.56k • 17

seungone/ablation3_math_llama3.1_8b_instruct

Viewer • Updated Nov 25, 2024 • 24.8k • 9

seungone/ablation2_math_llama3.1_8b_instruct

Viewer • Updated Nov 25, 2024 • 5.99k • 12

seungone/ablation1_code_gpt4o_mini

Viewer • Updated Nov 25, 2024 • 10k • 20

seungone/final-math-claude3.5_sonnet-10000

Viewer • Updated Sep 16, 2024 • 10k • 19 • 1