arxiv:2508.20478
X
Phoebe13
AI & ML interests
None yet
Recent Activity
updated a model 11 days ago
Phoebe13/Video-MTR upvoted a paper 8 months ago
Random Policy Valuation is Enough for LLM Reasoning with Verifiable
Rewards authored a paper 9 months ago
Video-MTR: Reinforced Multi-Turn Reasoning for Long Video UnderstandingOrganizations
None yet