[144] Qwen-VL: A Versatile Vision-Language Model for Understanding, Localization, Text Reading, and Beyond

2023λ…„ 12μ›” 26일 Β· 2 λΆ„ Β· long8v Β· 

[120] Large-scale Bilingual Language-Image Contrastive Learning

2023λ…„ 6μ›” 19일 Β· 3 λΆ„ Β· long8v Β·