[136] Visually-Situated Natural Language Understanding with Contrastive Reading Model and Frozen Large Language Models

November 28, 2023 ยท 3 min ยท long8v ยท 

[128] Pix2Struct: Screenshot Parsing as Pretraining for Visual Language Understanding

August 21, 2023 ยท 3 min ยท long8v ยท 

[90] Neural Collaborative Graph Machines for Table Structure Recognition

December 22, 2022 ยท 1 min ยท long8v ยท