Google | 🍎 Paper Today I Read 🦔

[209] SFT Memorizes, RL Generalizes: A Comparative Study of Foundation Model Post-training

google RL Berkley 2025Q1

[195] STaR: Self-Taught Reasoner Bootstrapping Reasoning With Reasoning

2022Q1 google 25min reasoning

[163] What You See is What You Read? Improving Text-Image Alignment Evaluation

google NeurIPS 2023Q2 evaluation

[155] Revisiting Text-to-Image Evaluation with Gecko: On Metrics, Prompts, and Human Ratings

google evaluation generation 2024Q2

[154] Mismatch Quest: Visual and Textual Feedback for Image-Text Misalignment

google XAI evaluation 2024Q2

[139] Davidsonian Scene Graph: Improving Reliability in Fine-Grained Evaluation for Text-to-Image Generation

google 2023Q4 evaluation generation

[128] Pix2Struct: Screenshot Parsing as Pretraining for Visual Language Understanding

ICML google 2022Q3 document

[124] LiT: Zero-Shot Transfer with Locked-image text Tuning

2021Q4 google CLIP

[123] Robust fine-tuning of zero-shot models

openAI google CVPR 2022Q3 CLIP domainshift

[118] PaLI-X: On Scaling up a Multilingual Vision and Language Model

multimodal google 2023Q2

[114] MaMMUT: A Simple Architecture for Joint Learning for MultiModal Tasks

multimodal google 2023Q1

[92] Long-Tail Learning via Logit Adjustment

2020Q3 google 25min imbalance

[73] Simple Open-Vocabulary Object Detection with Vision Transformers

google object detection 2022Q2 25min ECCV OV

[59] MLP-Mixer: An all-MLP Architecture for Vision

backbone 2021Q2 google 25min

[33] Learning to Prompt for Continual Learning

2021Q4 google CVPR continual learning

[30] CoCa: Contrastive Captioners are Image-Text Foundation Models

multimodal backbone google 2022Q2

[23] Bootstrap Your Own Latent: A New Approach to Self-Supervised Learning

SSL 2020Q2 google DeepMind

[20] Memorizing Transformer

NLP 2022Q1 google ICLR long

[18] Deep Learning with Differential Privacy

WIP privacy 2016 google

[9] SimCLR : A Simple Framework for Contrastive Learning of Visual Representations

few-shot SSL 2020Q3 ICML google