Alibaba | 🍎 Paper Today I Read 🦔

[218] Qwen2.5-VL Technical Report

alibaba MLLM 2025Q2 qwen

[144] Qwen-VL: A Versatile Vision-Language Model for Understanding, Localization, Text Reading, and Beyond

multilingual alibaba 2023Q3 MLLM qwen

[137] mPLUG-Owl2: Revolutionizing Multi-modal Large Language Model with Modality Collaboration

multimodal LLM 2023Q4 alibaba