[163] What You See is What You Read? Improving Text-Image Alignment Evaluationgoogle NeurIPS 2023Q2 evaluation
[135] Multimodal C4: An Open, Billion-scale Corpus of Images Interleaved with Textmultimodal dataset NeurIPS 2023Q2