[218] Qwen2.5-VL Technical Report

2025λ…„ 11μ›” 10일 Β· 3 λΆ„ Β· long8v Β· 

[217] PerceptionLM: Open-Access Data and Models for Detailed Visual Understanding

2025λ…„ 11μ›” 3일 Β· 3 λΆ„ Β· long8v Β· 

[216] Emerging Properties in Unified Multimodal Pretraining

2025λ…„ 9μ›” 4일 Β· 3 λΆ„ Β· long8v Β· 

[211] Skywork R1V: Pioneering Multimodal Reasoning with Chain-of-Thought

2025λ…„ 7μ›” 2일 Β· 2 λΆ„ Β· long8v Β· 

[212] MiMo-VL Technical Report

2025λ…„ 7μ›” 2일 Β· 2 λΆ„ Β· long8v Β· 

[210] Weight Ensembling Improves Reasoning in Language Models

2025λ…„ 5μ›” 30일 Β· 2 λΆ„ Β· long8v Β·