[136] Visually-Situated Natural Language Understanding with Contrastive Reading Model and Frozen Large Language Models

2023λ…„ 11μ›” 28일 Β· 2 λΆ„ Β· long8v Β· 

[128] Pix2Struct: Screenshot Parsing as Pretraining for Visual Language Understanding

2023λ…„ 8μ›” 21일 Β· 3 λΆ„ Β· long8v Β· 

[90] Neural Collaborative Graph Machines for Table Structure Recognition

2022λ…„ 12μ›” 22일 Β· 1 λΆ„ Β· long8v Β·