arxiv:2505.13291
Michał Wiliński
MWilinski
AI & ML interests
Machine Learning, Reinforcement Learning
Recent Activity
updated a dataset about 6 hours ago
MWilinski/rlhf-irl-pirate-expert published a dataset about 6 hours ago
MWilinski/rlhf-irl-pirate-expert updated a model about 19 hours ago
MWilinski/qwen2.5-3b-dpo-irl