[6] Crossing the Format Boundary of Text and Boxes: Towards Unified Vision-Language Modeling

January 18, 2022 ยท 1 min ยท long8v ยท