AdaptToken: Entropy-based Adaptive Token Selection for MLLM Long Video Understanding
Paper β’ 2603.28696 β’ Published β’ 6
Decompose images into intrinsic components
Generate stereo views from a single image
Generate surface normals from images