[136] Visually-Situated Natural Language Understanding with Contrastive Reading Model and Frozen Large Language Models

November 28, 2023 ยท 3 min ยท long8v ยท 

[47] Recovering the Unbiased Scene Graphs from the Biased Ones

August 5, 2022 ยท 2 min ยท long8v ยท 

[46] ReFormer: The Relational Transformer for Image Captioning

August 3, 2022 ยท 3 min ยท long8v ยท