[193] Scaling of Search and Learning: A Roadmap to Reproduce o1 from Reinforcement Learning Perspective

December 30, 2024 ยท 3 min ยท long8v ยท 

[174] Evaluations for Object Hallucinations

September 2, 2024 ยท 2 min ยท long8v ยท 

[100] An Overview of Multi-Task Learning in Deep Neural Networks

January 26, 2023 ยท 2 min ยท long8v ยท