[218] Qwen2.5-VL Technical Report

November 10, 2025 ยท 4 min ยท long8v ยท 

[144] Qwen-VL: A Versatile Vision-Language Model for Understanding, Localization, Text Reading, and Beyond

December 26, 2023 ยท 2 min ยท long8v ยท 

[137] mPLUG-Owl2: Revolutionizing Multi-modal Large Language Model with Modality Collaboration

December 5, 2023 ยท 3 min ยท long8v ยท