[193] Scaling of Search and Learning: A Roadmap to Reproduce o1 from Reinforcement Learning Perspective

2024λ…„ 12μ›” 30일 Β· 3 λΆ„ Β· long8v Β· 

[174] Evaluations for Object Hallucinations

2024λ…„ 9μ›” 2일 Β· 2 λΆ„ Β· long8v Β· 

[100] An Overview of Multi-Task Learning in Deep Neural Networks

2023λ…„ 1μ›” 26일 Β· 1 λΆ„ Β· long8v Β·