-
Towards Scalable Pre-training of Visual Tokenizers for Generation
Paper • 2512.13687 • Published • 108 -
MMGR: Multi-Modal Generative Reasoning
Paper • 2512.14691 • Published • 121 -
Coupling Experts and Routers in Mixture-of-Experts via an Auxiliary Loss
Paper • 2512.23447 • Published • 99 -
LiveTalk: Real-Time Multimodal Interactive Video Diffusion via Improved On-Policy Distillation
Paper • 2512.23576 • Published • 66
diege
fulandiege
·
AI & ML interests
None yet
Recent Activity
updated a collection 1 day ago
papers updated a collection 6 days ago
papers upvoted a collection 20 days ago
OneVL ModelsOrganizations
None yet