[169] Direct Preference Optimization: Your Language Model is Secretly a Reward Model

August 26, 2024 · 1 min · long8v · 

[163] What You See is What You Read? Improving Text-Image Alignment Evaluation

July 18, 2024 · 2 min · long8v · 

[135] Multimodal C4: An Open, Billion-scale Corpus of Images Interleaved with Text

November 23, 2023 · 3 min · long8v · 

[133] DataComp: In search of the next generation of multimodal datasets

October 5, 2023 · 2 min · long8v · 

[132] Hyperbolic Image-Text Representations

September 26, 2023 · 2 min · long8v · 

[118] PaLI-X: On Scaling up a Multilingual Vision and Language Model

June 8, 2023 · 4 min · long8v ·