[136] Visually-Situated Natural Language Understanding with Contrastive Reading Model and Frozen Large Language Models

2023λ…„ 11μ›” 28일 Β· 2 λΆ„ Β· long8v Β· 

[47] Recovering the Unbiased Scene Graphs from the Biased Ones

2022λ…„ 8μ›” 5일 Β· 1 λΆ„ Β· long8v Β· 

[46] ReFormer: The Relational Transformer for Image Captioning

2022λ…„ 8μ›” 3일 Β· 2 λΆ„ Β· long8v Β·