[194] Scaling LLM Test-Time Compute Optimally can be More Effective than Scaling Model Parameters

2025λ…„ 1μ›” 3일 Β· 4 λΆ„ Β· long8v Β· 

[190] Solving math word problems with process and outcome-based feedback

2024λ…„ 12μ›” 16일 Β· 3 λΆ„ Β· long8v Β· 

[134] Asynchronous Methods for Deep Reinforcement Learning

2023λ…„ 10μ›” 18일 Β· 3 λΆ„ Β· long8v Β· 

[116] Data Distributional Properties Drive Emergent In-Context Learning in Transformers

2023λ…„ 5μ›” 22일 Β· 2 λΆ„ Β· long8v Β· 

[111] Perceiver IO: A General Architecture for Structured Inputs & Outputs

2023λ…„ 4μ›” 24일 Β· 2 λΆ„ Β· long8v Β· 

[109] 🦩 Flamingo: a Visual Language Model for Few-Shot Learning

2023λ…„ 4μ›” 10일 Β· 3 λΆ„ Β· long8v Β· 

[40] Neural Discrete Representation Learning

2022λ…„ 7μ›” 30일 Β· 1 λΆ„ Β· long8v Β· 

[23] Bootstrap Your Own Latent: A New Approach to Self-Supervised Learning

2022λ…„ 4μ›” 25일 Β· 2 λΆ„ Β· long8v Β·