Phoebe13
/

Video-MTR

Visual Question Answering

Model card Files Files and versions

Video-MTR

Video-MTR: Reinforced Multi-Turn Reasoning for Long Video Understanding.

This checkpoint extends Qwen2.5-VL-7B-Instruct with a multi-turn frame-retrieval policy trained via PPO, with an 80-frame video input budget.

References

Paper
Code

Downloads last month: 25

Safetensors

Model size

8B params

Tensor type

BF16

·

Model tree for Phoebe13/Video-MTR

Base model

Qwen/Qwen2.5-VL-7B-Instruct

Finetuned

(1084)

this model

Quantizations

Space using Phoebe13/Video-MTR 1

Paper for Phoebe13/Video-MTR

Video-MTR: Reinforced Multi-Turn Reasoning for Long Video Understanding

Paper • 2508.20478 • Published Aug 28, 2025 • 18