arxiv:2606.02404
Seungone Kim PRO
seungone
AI & ML interests
Large Language Models, LLM-as-a-Judge, Reward Model Overoptimization, Personalized Alignment
Recent Activity
authored a paper 1 day ago
Verus-SpecGym: An Agentic Environment for Evaluating Specification Autoformalization authored a paper 1 day ago
K-BrowseComp: A Web Browsing Agent Benchmark Grounded in Korean Contexts upvoted a paper 5 days ago
K-BrowseComp: A Web Browsing Agent Benchmark Grounded in Korean Contexts