Fine-Tuning Diffusion Models via Intermediate Distribution Shaping Paper • 2510.02692 • Published Mar 3 • 1
Seeing Fast and Slow: Learning the Flow of Time in Videos Paper • 2604.21931 • Published 4 days ago • 17
MultiWorld: Scalable Multi-Agent Multi-View Video World Models Paper • 2604.18564 • Published 7 days ago • 43
UniMesh: Unifying 3D Mesh Understanding and Generation Paper • 2604.17472 • Published 8 days ago • 10
Extending One-Step Image Generation from Class Labels to Text via Discriminative Text Representation Paper • 2604.18168 • Published 7 days ago • 96
Elucidating the SNR-t Bias of Diffusion Probabilistic Models Paper • 2604.16044 • Published 10 days ago • 73
LeapAlign: Post-Training Flow Matching Models at Any Generation Step by Building Two-Step Trajectories Paper • 2604.15311 • Published 11 days ago • 12
Pseudo-Unification: Entropy Probing Reveals Divergent Information Patterns in Unified Multimodal Models Paper • 2604.10949 • Published 14 days ago • 39
OmniShow: Unifying Multimodal Conditions for Human-Object Interaction Video Generation Paper • 2604.11804 • Published 14 days ago • 70
LPM 1.0: Video-based Character Performance Model Paper • 2604.07823 • Published 18 days ago • 76
Rethinking the Diffusion Model from a Langevin Perspective Paper • 2604.10465 • Published 15 days ago • 15
D2E: Scaling Vision-Action Pretraining on Desktop Data for Transfer to Embodied AI Paper • 2510.05684 • Published Oct 7, 2025 • 146
Unify-Agent: A Unified Multimodal Agent for World-Grounded Image Synthesis Paper • 2603.29620 • Published 27 days ago • 46
Out of Sight but Not Out of Mind: Hybrid Memory for Dynamic Video World Models Paper • 2603.25716 • Published Mar 26 • 156
ImagenWorld: Stress-Testing Image Generation Models with Explainable Human Evaluation on Open-ended Real-World Tasks Paper • 2603.27862 • Published 28 days ago • 31
On-the-fly Repulsion in the Contextual Space for Rich Diversity in Diffusion Transformers Paper • 2603.28762 • Published 28 days ago • 25