[6] Crossing the Format Boundary of Text and Boxes: Towards Unified Vision-Language Modeling

2022λ…„ 1μ›” 18일 Β· 1 λΆ„ Β· long8v Β·