Submitted by Juanxi Tian 21 Auto-Rubric as Reward: From Implicit Preferences to Explicit Multimodal Generative Criteria OpenEnvision 31 2