arxiv:2603.06569
Boqiang Zhang
Cyril666
AI & ML interests
Multi-modal
Large Language Models
Vision-Language-Action Models
Recent Activity
upvoted a paper 1 day ago
Tstars-Tryon 1.0: Robust and Realistic Virtual Try-On for Diverse Fashion Items liked a dataset 7 days ago
tencent/Penguin-Recap-V upvoted a paper 20 days ago
AIBench: Evaluating Visual-Logical Consistency in Academic Illustration Generation