DeepMind | 🍎 Paper Today I Read 🦔

[194] Scaling LLM Test-Time Compute Optimally can be More Effective than Scaling Model Parameters

DeepMind 2024Q3 reasoning

[190] Solving math word problems with process and outcome-based feedback

DeepMind 2022Q4 RL

[134] Asynchronous Methods for Deep Reinforcement Learning

2016 DeepMind RL

[116] Data Distributional Properties Drive Emergent In-Context Learning in Transformers

DeepMind NeurIPS 2022Q2

[111] Perceiver IO: A General Architecture for Structured Inputs & Outputs

multimodal 2021Q2 ICLR DeepMind MTL

[109] 🦩 Flamingo: a Visual Language Model for Few-Shot Learning

multimodal DeepMind LLM

[40] Neural Discrete Representation Learning

DeepMind 2017 generative

[23] Bootstrap Your Own Latent: A New Approach to Self-Supervised Learning

SSL 2020Q2 google DeepMind